Measures of Variability Variability. Measure of Variability (Dispersion, Spread) Variance, standard deviation Range Inter-Quartile Range Pseudo-standard.

Slides:



Advertisements
Similar presentations
CHAPTER 1 Exploring Data
Advertisements

Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Measures of Dispersion
Measures of Dispersion or Measures of Variability
1. 2 BIOSTATISTICS TOPIC 5.4 MEASURES OF DISPERSION.
1.2 Describing Distributions with Numbers. Center and spread are the most basic descriptions of what a data set “looks like.” They are intuitively meant.
Measures of Variability or Dispersion Range - high and lows. –Limitations: Based on only two extreme observations Interquartile range - measures variablility.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
1.2: Describing Distributions
2.3. Measures of Dispersion (Variation):
1 Numerical Summary Measures Lecture 03: Measures of Variation and Interpretation, and Measures of Relative Position.
Descriptive Statistics for Numeric Variables Types of Measures: measures of location measures of spread measures of shape measures of relative standing.
Chapter 2 Describing Data with Numerical Measurements
Describing distributions with numbers
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Chapter-2 Statistical description of quantitative variable.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
1 Excursions in Modern Mathematics Sixth Edition Peter Tannenbaum.
1.3: Describing Quantitative Data with Numbers
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part A n Measures of.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Measures of Dispersion
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Measures of Variability Variability. Measure of Variability (Dispersion, Spread) Variance, standard deviation Range Inter-Quartile Range Pseudo-standard.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describing distributions with numbers
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
INVESTIGATION 1.
Measure of Variability (Dispersion, Spread) 1.Range 2.Inter-Quartile Range 3.Variance, standard deviation 4.Pseudo-standard deviation.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Variability Pick up little assignments from Wed. class.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Compare the following heights in inches: BoysGirls
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
CHAPTER 2: Basic Summary Statistics
COMPUTATIONAL FORMULAS AND IQR’S. Compare the following heights in inches: BoysGirls
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Parameter, Statistic and Random Samples
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 2: Basic Summary Statistics
Chapter 1: Exploring Data
The Five-Number Summary
Chapter 1: Exploring Data
Measures of Variability
Chapter 1: Exploring Data
2.3. Measures of Dispersion (Variation):
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Measures of Dispersion
Chapter 1: Exploring Data
Presentation transcript:

Measures of Variability Variability

Measure of Variability (Dispersion, Spread) Variance, standard deviation Range Inter-Quartile Range Pseudo-standard deviation

Range

Definition Let min = the smallest observation Let max = the largest observation Then Range =max - min Range

Inter-Quartile Range (IQR)

Definition Let Q 1 = the first quartile, Q 3 = the third quartile Then the Inter-Quartile Range = IQR = Q 3 - Q 1

Q1Q1 Q3Q3 25% 50% Inter-Quartile Range

Example The data Verbal IQ on n = 23 students arranged in increasing order is:

Example The data Verbal IQ on n = 23 students arranged in increasing order is: Q 2 = 96Q 1 = 89 Q 3 = 105 min = 80max = 119

Range Range = max – min = 119 – 80 = 39 Inter-Quartile Range = IQR = Q 3 - Q 1 = 105 – 89 = 16

Some Comments Range and Inter-quartile range are relatively easy to compute. Range slightly easier to compute than the Inter-quartile range. Range is very sensitive to outliers (extreme observations)

Variance and Standard deviation

Sample Variance Let x 1, x 2, x 3, … x n denote a set of n numbers. Recall the mean of the n numbers is defined as:

The numbers are called deviations from the the mean

The sum is called the sum of squares of deviations from the the mean. Writing it out in full: or

The Sample Variance Is defined as the quantity: and is denoted by the symbol Comment One might think that the divisor in variance should be n. For certain reasons it was found that a divisor of n – 1, resulted in a estimator with a particular desirable property – unbiasedness

Example Let x 1, x 2, x 3, x 3, x 4, x 5 denote a set of 5 denote the set of numbers in the following table. i12345 xixi

Then = x 1 + x 2 + x 3 + x 4 + x 5 = = 66 and

The deviations from the mean d 1, d 2, d 3, d 4, d 5 are given in the following table. i12345 xixi didi

The sum and

The Sample Standard Deviation s Definition: The Sample Standard Deviation is defined by: Hence the Sample Standard Deviation, s, is the square root of the sample variance.

In the last example

Interpretations of s In Normal distributions –Approximately 2/3 of the observations will lie within one standard deviation of the mean –Approximately 95% of the observations lie within two standard deviations of the mean –In a histogram of the Normal distribution, the standard deviation is approximately the distance from the mode to the inflection point

s Inflection point Mode

s 2/3 s

2s

Example A researcher collected data on 1500 males aged The variable measured was cholesterol and blood pressure. –The mean blood pressure was 155 with a standard deviation of 12. –The mean cholesterol level was 230 with a standard deviation of 15 –In both cases the data was normally distributed

Interpretation of these numbers Blood pressure levels vary about the value 155 in males aged Cholesterol levels vary about the value 230 in males aged

2/3 of males aged have blood pressure within 12 of 155. Ii.e. between =143 and = /3 of males aged have Cholesterol within 15 of 230. i.e. between =215 and = 245.

95% of males aged have blood pressure within 2(12) = 24 of 155. Ii.e. between =131 and = % of males aged have Cholesterol within 2(15) = 30 of 230. i.e. between =200 and = 260.

A Computing formula for: Sum of squares of deviations from the the mean : The difficulty with this formula is that will have many decimals. The result will be that each term in the above sum will also have many decimals.

The sum of squares of deviations from the the mean can also be computed using the following identity:

To use this identity we need to compute:

Then:

Example The data Verbal IQ on n = 23 students arranged in increasing order is:

= = 2244 = =

Then:

A quick (rough) calculation of s The reason for this is that approximately all (95%) of the observations are between and Thus

Example Verbal IQ on n = 23 students min = 80and max = 119 This compares with the exact value of s which is The rough method is useful for checking your calculation of s.

The Pseudo Standard Deviation (PSD)

Definition: The Pseudo Standard Deviation (PSD) is defined by:

Properties For Normal distributions the magnitude of the pseudo standard deviation (PSD) and the standard deviation (s) will be approximately the same value For leptokurtic distributions the standard deviation (s) will be larger than the pseudo standard deviation (PSD) For platykurtic distributions the standard deviation (s) will be smaller than the pseudo standard deviation (PSD)

Example Verbal IQ on n = 23 students Inter-Quartile Range = IQR = Q 3 - Q 1 = 105 – 89 = 16 Pseudo standard deviation This compares with the standard deviation