Measures of Variation. For discrete variables, the Index of Qualitative Variation.

Slides:



Advertisements
Similar presentations
I OWA S TATE U NIVERSITY Department of Animal Science Using Basic Graphical and Statistical Procedures (Chapter in the 8 Little SAS Book) Animal Science.
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Statistical Techniques I EXST7005 Start here Measures of Dispersion.
Measures of Dispersion
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
Statistics Intro Univariate Analysis Central Tendency Dispersion.
PPA 415 – Research Methods in Public Administration Lecture 4 – Measures of Dispersion.
Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Standard Deviation Interquartile Range (IQR)
Chapter 5 – 1 Chapter 5: Measures of Variability The Importance of Measuring Variability The Range IQR (Inter-Quartile Range) Variance Standard Deviation.
Chapter 5 – 1 Chapter 5 Measures of Variability The importance of measuring variability IQV (index of qualitative variation) The range IQR (inter-quartile.
Descriptive Statistics: Overview Measures of Center Mode Median Mean * Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Week 3 Topic - Descriptive Procedures Program 3 in course notes Cody & Smith (Chapter 2)
Poverty & Human Capability 101 Introductory Class.
1 Chapter 2: Methods for Describing Sets of Data (Page 19-98) Homework:14ab, 36, 43, 45, 51, 56, 64abc, 71, 79, 85, 89, 96.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
European Growth Phases, Period Real GDP Real GDP per capita
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Chapter 5 – 1 Chapter 5: Measures of Variability The Importance of Measuring Variability IQV (Index of Qualitative Variation) The Range IQR (Inter-Quartile.
Countries of Europe France Spain Italy Germany Which country is this?
Review Measures of central tendency
Table of Contents 1. Standard Deviation
Corrected page numbers for optional readings in SAS ® Languages and Procedures Aug 30pp and Sept 6pp Sept 13pp Sept 27pp
T- and Z-Tests for Hypotheses about the Difference between Two Subsamples.
Skewness & Kurtosis: Reference
Measures of Dispersion How far the data is spread out.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
INVESTIGATION 1.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Chapter 5: Measures of Variability  The Importance of Measuring Variability  IQV (Index of Qualitative Variation)  The Range  IQR (Inter-Quartile Range)
Measures of Spread 1. Range: the distance from the lowest to the highest score * Problem of clustering differences ** Problem of outliers.
Stem and Leaf Bingo. Pick 8 from the list
INVESTIGATION Data Colllection Data Presentation Tabulation Diagrams Graphs Descriptive Statistics Measures of Location Measures of Dispersion Measures.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
Chapter 3: Averages and Variation Section 2: Measures of Dispersion.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Summary Statistics: Measures of Location and Dispersion.
The Distribution of Single Variables. Two Types of Variables Continuous variables – Equal intervals of measurement – Known zero-point that is meaningful.
METHODS OF SPATIAL ECONOMIC ANALYSIS LECTURE 02 Δρ. Μαρί-Νοέλ Ντυκέν, Αναπληρώτρια Καθηγήτρια, Τηλ Γραφείο Γ.6 UNIVERSITY.
Today: March 7 Data Transformations Rank Tests for Non-Normal data Solutions for Assignment 4.
Psychology 202a Advanced Psychological Statistics September 8, 2015.
Other Types of t-tests Recapitulation Recapitulation 1. Still dealing with random samples. 2. However, they are partitioned into two subsamples. 3. Interest.
LIS 570 Summarising and presenting data - Univariate analysis.
Math 310 Section 8.1 & 8.2 Statistics. Centers and Spread A goal in statistics is to determine how data is centered and spread. There are many different.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 5. Measuring Dispersion or Spread in a Distribution of Scores.
Customize SAS Output Using ODS Joan Dong. The Output Delivery System (ODS) gives you greater flexibility in generating, storing, and reproducing SAS procedure.
CHAPTER 2: Basic Summary Statistics
Data Presentation Data Presentation Frances A. Pfab.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Chapter 4: Measures of Variability
6.1 - Measures of Central Tendency and Variation
제 5장 기술통계 및 추론 PROC MEANS 절차 PROC MEANS <options> ;
Practice Page Practice Page Positive Skew.
Descriptive Statistics: Overview
Description of Data (Summary and Variability measures)
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Warm Up # 3: Answer each question to the best of your knowledge.
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Descriptive Statistics
CHAPTER 2: Basic Summary Statistics
Lecture 4 Psyc 300A.
Prodcom Statistics in Focus
Presentation transcript:

Measures of Variation

For discrete variables, the Index of Qualitative Variation

Religious Preference Percent Protestant 65.6 Catholic 24.2 Jewish 2.3 Other 1.2 None 6.1 No Answer 0.5

where p = proportion of cases and K = # of categories

Religious Preference Percent Proportion Protestant Catholic Jewish Other None No Answer

Religious Preference Percent Proportion Proportion 2 Protestant Catholic Jewish Other None No Answer

Religious Preference Percent Proportion Proportion 2 Protestant Catholic Jewish Other None No Answer

IQV = ( ) / [(6 - 1) / 6] = (0.506) / (5 / 6) = (0.506) / (0.833) = 0.61

What does this mean? When there is perfect dispersion, IQV = 1.00 When there is no dispersion, IQV = 0.00

Religious Preference Percent Proportion Proportion 2 Protestant Catholic Jewish Other None No Answer IQV = ( ) / [(6 - 1) / 6] = (0.833) / (5 / 6) = (0.833) / (0.833) = 1.00

Religious Preference Percent Proportion Proportion 2 Protestant Catholic Jewish Other None No Answer IQV = ( ) / [(6 - 1) / 6] = (0.000) / (5 / 6) = (0.000) / (0.833) = 0.00

For continuous variables, 1. range 2. interquartile range 3. standard deviation 4. variance

The Range The distance across 100% of scores Range = H – L + 1

For example, take the following 12 values (N = 12): 5, 2, 27, 32, 3, 5, 35, 7, 31, 42, 37, 39 To determine any of the so-called quantile statistics such as the range, the scores first must be ranked or ordered, here in descending order: 1st th 2

[42.5] 1st th 2 [1.5] Range = 42 – = 41.0

The Interquartile Range The distance across the middle 50% of scores IQR = Q 3 – Q 1

1st42 2nd39 3rd37 4th35 5th32 6th31 7th27 8th15 9th 7 10th 5 11th 3 12th 2

Univariate and EDA Statistics PPD 404 Stem Leaf # Boxplot * * | *-----* Multiply Stem.Leaf by 10**+3

1st42 2nd39 3rd th35 5th32 6th31 7th27 8th15 9th th 5 11th 3 12th 2

1st42 2nd39 3rd Q 3 4th35 5th32 6th31 7th27 8th15 9th Q 1 10th 5 11th 3 12th 2

1st42 2nd39 3rd Q 3 = ( )/2 = th35 5th32 6th31 7th27 8th15 9th Q 1 = ( )/2 = th 5 11th 3 12th 2

1st42 2nd39 3rd Q 3 = ( )/2 = th35 5th32 6th31 7th27 8th15 9th Q 1 = ( )/2 = th 5 11th 3 12th 2 IQR = Q 3 – Q 1 = 36.0 – 6.0 = 30.0

The Standard Deviation

  (0.003)

The sum of the deviations will always be zero (except for rounding error)

The Sum of the Deviations 1—2—3—4—5 ^ Mean = ————— ——— +1 0 Sum = (-2) + (+2) + (-1) + (+1) + (0) = 0.0

 =  =

The Variance

 =  = s y 2 = / (6 - 1) =

The standard deviation Simply the square root of the variance s Y = 8.914

Z-scores pure numbers with mean of 0.0 and standard deviation of 1.00 z 1 = ( ) / 6.45 = (-2.00) / 6.45 = z 1 = ( ) / = (-2.00) / =

Using SAS to Produce Z-Scores libname old 'a:\'; libname library 'a:\'; options ps=66 nodate nonumber; data temp1; set old.cities; popstd=populat; run; proc standard data=temp1 mean=0.0 std=1.0 out=temp2; var popstd; run; proc print data=temp2; id populat; var popstd; title1 'Z-Scores Produced by PROC STANDARD'; title2; title3 'PPD 404'; run;

Z-Scores Produced by PROC STANDARD PPD 404 POPULAT POPSTD

libname mydata 'a:\'; libname library 'a:\'; options ps=66 nodate nonumber; proc univariate data=mydata.cities; var populat; title1 'Univariate Statistics'; run;

Univariate Statistics PPD 404 Univariate Procedure Variable=POPULAT NUMBER OF RESIDENTS, IN 1,000S Moments N 63 Sum Wgts 63 Mean Sum Std Dev Variance Skewness Kurtosis USS CSS CV Std Mean T:Mean= Pr>|T| Num ^= 0 63 Num > 0 63 M(Sign) 31.5 Pr>=|M| Sgn Rank 1008 Pr>=|S| W:Normal Pr<W

Quantiles(Def=5) 100% Max % % Q % % Med % % Q % 72 0% Min 56 5% 60 1% 56 Range 7840 Q3-Q1 541 Mode 56 Extremes Lowest Obs Highest Obs 56( 30) 1511( 56) 56( 24) 1949( 55) 58( 46) 2816( 54) 60( 21) 3367( 53) 65( 51) 7896( 52)

Calculate the INDEX OF QUALITATIVE VARIATION for the data in the following table. =============================================================== Service Branch FrequencyPP Air Force 56 Army 166 Marine Corps 14 Merchant Marines 1 Navy Total

=============================================================== Service Branch Frequency P P Air Force Army Marine Corps Merchant Marines Navy Total INDEX OF QUALITATIVE VARIATION = 0.776

Here are data once again from 16 European countries. ============================================================================== Gross Domestic Percent in Crude Birth Nation Product (GDP) Agriculture Rate per (in billion$) 1, Austria Belgium Denmark Finland France Germany Great Britain Greece Ireland Italy Netherlands Norway Portugal Spain Sweden Switzerland What is the RANGE for the PERCENT IN AGRICULTURE? What is the INTERQUARTILE RANGE for the PERCENT IN AGRICULTURE?

First, rank the values in descending order. Find the difference between the HIGHEST and LOWEST values (and add 1) RANGE = H – L + 1 = 48 – = 44.0

Having ranked the values in descending order, determine the value at the location dividing the upper 4 values from the lower 12 values. Then determine the value at the location dividing the upper 12 values from the lower 4 values. Find the difference between these two values Q 3 = ( ) / 2 = Q 1 = ( ) / 2 = IQR = Q 3 – Q 1 = 37.0 – 14.0 = 23.0

============================================================================== Gross Domestic Percent in Crude Birth Nation Product (GDP) Agriculture Rate per (in billion$) 1, Austria Belgium Denmark Finland France Germany Great Britain Greece Ireland Italy Netherlands Norway Portugal Spain Sweden Switzerland What is the STANDARD DEVIATION for GDP? What is Germany’s Z-SCORE for GDP?

First, determine the value of the mean.

Next, determine the deviations and squared deviations for each value

What is Germany’s Z-SCORE for GDP? Germany’s GDP = 112 Mean GDP =