Measures of Variability. Why are measures of variability important? Why not just stick with the mean?  Ratings of attractiveness (out of 10) – Mean =

Slides:



Advertisements
Similar presentations
Quantitative Methods in HPELS 440:210
Advertisements

C. D. Toliver AP Statistics
Additional Measures of Center and Spread
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
University of Durham D Dr Robert Coe University of Durham School of Education Tel: (+44 / 0) Fax: (+44 / 0)
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Descriptive Statistics Statistical Notation Measures of Central Tendency Measures of Variability Estimating Population Values.
Jan Shapes of distributions… “Statistics” for one quantitative variable… Mean and median Percentiles Standard deviations Transforming data… Rescale:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Measures of Central Tendency. Central Tendency = values that summarize/ represent the majority of scores in a distribution Central Tendency = values that.
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 3: Central Tendency And Dispersion.
Measures of Variability. Why are measures of variability important? Why not just stick with the mean?  Ratings of attractiveness (out of 10) – Mean =
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 4: The Normal Distribution and Z-Scores.
Descriptive Statistics: Overview Measures of Center Mode Median Mean * Measures of Symmetry Skewness Measures of Spread Range Inter-quartile Range Variance.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
EDUC 200C Section 1– Describing Data Melissa Kemmerle September 28, 2012.
Central Tendency and Variability Chapter 4. Central Tendency >Mean: arithmetic average Add up all scores, divide by number of scores >Median: middle score.
Measures of Central Tendency
Formula Compute a standard deviation with the Raw-Score Method Previously learned the deviation formula Good to see “what's going on” Raw score formula.
Summary statistics Using a single value to summarize some characteristic of a dataset. For example, the arithmetic mean (or average) is a summary statistic.
POPULATION DYNAMICS Required background knowledge:
Chapter 3 Averages and Variations
Statistics Recording the results from our studies.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Review Measures of central tendency
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Table of Contents 1. Standard Deviation
Psyc 235: Introduction to Statistics Lecture Format New Content/Conceptual Info Questions & Work through problems.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
1 Review Mean—arithmetic average, sum of all scores divided by the number of scores Median—balance point of the data, exact middle of the distribution,
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Page 1 Chapter 3 Variability. Page 2 Central tendency tells us about the similarity between scores Variability tells us about the differences between.
Lecture 3 Describing Data Using Numerical Measures.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Do Now Find the mean, median, mode, and range of each data set and then state which measure of central tendency best represents the data. 1)2, 3, 3, 3,
Chapter 5 Measures of Variability. 2 Measures of Variability Major Points The general problem The general problem Range and related statistics Range and.
Chapter 3 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Chapter 3: Measures of Central Tendency and Variability Imagine that a researcher.
Practice Page 65 –2.1 Positive Skew Note Slides online.
Measures of Spread 1. Range: the distance from the lowest to the highest score * Problem of clustering differences ** Problem of outliers.
Measures of Dispersion. Introduction Measures of central tendency are incomplete and need to be paired with measures of dispersion Measures of dispersion.
CHAPTER 3  Descriptive Statistics Measures of Central Tendency 1.
Numerical Measures of Variability
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Central Tendency & Dispersion
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
Summary Statistics: Measures of Location and Dispersion.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Measures of Central Tendency: Averages or other measures of “location” that find a single number that reflects the middle of the distribution of scores—
MR. MARK ANTHONY GARCIA, M.S. MATHEMATICS DEPARTMENT DE LA SALLE UNIVERSITY.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Notes 13.2 Measures of Center & Spread
One-Variable Statistics
Practice Page Practice Page Positive Skew.
Descriptive Statistics (Part 2)
Descriptive Statistics: Overview
Central Tendency and Variability
Chapter 3 Describing Data Using Numerical Measures
Box and Whisker Plots Algebra 2.
Descriptive Statistics
Warm up Find the Mean Median Mode 0, 23, 12, 5, 8, 5, 13, 5, 15.
Presentation transcript:

Measures of Variability

Why are measures of variability important? Why not just stick with the mean?  Ratings of attractiveness (out of 10) – Mean = 5  Everyone rated you a 5 (low variability) What could we conclude about attractiveness from this?  People’s ratings fell into a range from 1 – 10, that averaged a 5 (high variability) What could we conclude about attractiveness from this?

Measures of Variability Range Interquartile Range Average Deviation Variance Standard Deviation

Measures of Variability Range  The difference between the highest and lowest values in a dataset  Heavily biased by outliers Dataset #1: Range = 6 Dataset #2: million Range = 10,999,995

Measures of Variability Interquartile Range  The difference between the highest and lowest values in the middle 50% of a dataset  Less biased by outliers than the Range  Based on sample with upper and lower 25% of the data “trimmed”  However this kind of trimming essentially ignores half of your data – better to trim top and bottom 1 or 5%

Measures of Variability Average Deviation  For each score, calculate deviation from the mean, then sum all of theses scores  However, this score will always equal zero Dataset: 19, 16, 20, 17, 20, 19, 7, 11, 10, 19, 14, 11, 6, 11, 14, 19, 20, 17, 4, 11  X = /20 = 14.25

 Diff. from Mean = 0 Mean = 0/N = 0 DataMeanDiff. from Mean

Measures of Variability Variance  Sample Variance (s 2 ) =  (X - ) 2 /(N-1)  Population Variance (σ 2 ) =  (X - ) 2 /N Note the use of squared units! Gets rid of the positive and negative values in our “Diff. from Mean” column before that added up to 0 However, because we’re squaring our values they will not be in the metric of our original scale  If we calculate the variance for a test out of 100, a variance of 100 is actually average variability of 10 pts. (  100 = 10) about the mean of the test

 (Diff. from Mean) 2 = Variance = /(20-1) = DataMeanDiff. from Mean (Diff. from Mean)

Measures of Variability Standard Deviation  Sample Standard Deviation (s) = √ [  (X - ) 2 /(N-1)]  Population Standard Deviation (σ) = √ [  (X - ) 2 /N] Note that the formula is identical to the Variance except that after everything else you take the square-root! You can interpret the standard deviation without doing any mental math, like you did with the variance  Variance =  Standard Deviation = √(25.99) = 5.10

Measures of Variability Standard Deviation  Example: Bush/Cheaney – 55% Kerry/Edwards – 40%  Margin of Error = 30% Bush/Cheaney – 25% – 85% Kerry/Edwards – 10% - 70%

Computational Formula for Variability Definitional Formula  designed more to illustrate how the formula relates to the concept it underlies Computational Formula  identical to the definitional formula, but different in form  allows you to compute your variable with less effort  particularly useful with large datasets

Computational Formula for Variability Definitional Formula for Variance:  s 2 = (X – ) 2 N – 1 Computational Formula for Variance:  s 2 = All you need to plug in here is  X 2 and  X Standard deviation still = √ s 2, no matter how it is calculated

Computational Formula for Variability Definitional Formula for Standard Deviation:  s = √ [(X – ) 2 ] [ N – 1 ] Computational Formula for Standard Deviation  s = √ ( )

Computational Formula for Variability Example:  For the following dataset, compute the variance and standard deviation   X = 23   X 2 = 77  s 2 = 77 – (23) 2 _____8__ 8 – 1 s 2 = 77 – = Data (X)X2X

Measures of Variability What do you think will happen to the standard deviation if we add a constant (say 4) to all of our scores? What if we multiply all the scores by a constant?

Measures of Variability Characteristics of the Standard Deviation  Adding a constant to each score will not alter the standard deviation i.e. add 3 to all scores in a sample and your s.d. will remain unchanged Let’s say our scores originally ranged from 1 – 10  Add 5 to all scores, the new data ranges from 6 – 15  In both cases the range is 9

Measures of Variability  However, multiplying or dividing each score by a constant causes the s.d. to be similarly multiplied or divided by that constant i.e. divide each score by 2 and your s.d. will decrease from 10 to 5 in multiplication, higher numbers increase more than lower ones do, increasing the distance between the highest and lowest score, which increases the variability  i.e. 2 x 5 = 10 – difference of 8 pts. 5 x 5 = 25 – difference of 20 pts.

Graphically Depicting Variability Boxplot/Box-and- Whisker Plot  Median  Hinges/1 st & 3 rd Quartiles  H-Spread  Whisker  Outlier

Graphically Depicting Variability Boxplot/Box-and- Whisker Plot  Median  Hinges/1 st & 3 rd Quartiles  H-Spread  Whisker  Outlier

Graphically Depicting Variability Boxplot/Box-and- Whisker Plot  Median  Hinges/1 st & 3 rd Quartiles  H-Spread  Whisker  Outlier {

Graphically Depicting Variability Boxplot/Box-and- Whisker Plot  Median  Hinges/1 st & 3 rd Quartiles  H-Spread  Whisker  Outlier

Graphically Depicting Variability Boxplot/Box-and- Whisker Plot  Median  Hinges/1 st & 3 rd Quartiles  H-Spread  Whisker  Outlier

Graphically Depicting Variability Interpreting a Boxplot/Box-and- Whisker Plot  Off-center median = Non- symmetry  Longer top whisker = Positively-skewed distribution  Longer bottom whisker = Negatively-skewed distribution

Graphically Depicting Variability

Boxplot/Box-and-Whisker Plot  Hinge/Quartile Location = (Median Location+1)/2  Data: Median Location = (16+1)/2 = 8.5 Hinge Location = (8.5+1)/2 = 4.75 (4 since we drop the fraction) Hinges = 5 and 18

Graphically Depicting Variability  H-Spread = Upper Hinge – Lower Hinge H-Spread = 18-5 = 13  Whisker = H-Spread x 1.5 Since the whisker always ends at an actual data point, if we, say calculated the whisker to end at a value of 12, but the data only has a 10 and a 15, we would end the whisker at the 10. Whiskers = 12x1.5 = 19.5 Lower whisker from 5 to 1 Higher whisker from 18 to 21  Outliers Value of 40 extends beyond upper whisker

Graphically Depicting Variability

Review Questions Central Tendency  For the following data set 1, 3, 3, 5, 5, 5, 7, 7, 9  What is the mode?  What is the mean?  What is the median?

Review Questions  Answer: 1, 3, 3, 5, 5, 5, 7, 7, 9  Mode = 5  Mean =  X = 45/9 = 5  Median: Median Location = (9 +1)/2 = 5 Median = 5

Review Questions Variability  For the following data set: What is the range? What is the variance? What is the standard deviation?

Review Questions  Answer Range = 1 – 5 Variance:  Mean = 2.875

Review Questions  What is the variance?  (Diff. from Mean) 2 = /N – 1 = 1.55 DataMeanDiff.from Mean(Diff. from Mean)

Review Questions  What is the standard deviation? √1.55 = 1.25