NUMERICAL DESCRIPTIVE STATISTICS Measures of Variability.

Slides:



Advertisements
Similar presentations
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
Advertisements

Measures of Variability or Dispersion Range - high and lows. –Limitations: Based on only two extreme observations Interquartile range - measures variablility.
Calculating & Reporting Healthcare Statistics
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
NUMERICAL DESCRIPTIVE STATISTICS Measures of Central Tendency Measures of Central Tendency.
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
NUMERICAL DESCRIPTIVE STATISTICS Measures of Variability.
Intro to Descriptive Statistics
Measures of Variability
2.3. Measures of Dispersion (Variation):
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Learning Objectives In this chapter you will learn about the importance of variation how to measure variation range variance standard deviation.
Measures of Variability: Range, Variance, and Standard Deviation
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
1 1 Slide © 2001 South-Western/Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 4 Summarizing Data.
Describing Data: Numerical
Economics 173 Business Statistics Lecture 2 Fall, 2001 Professor J. Petry
MGQ 201 WEEK 4 VICTORIA LOJACONO. Help Me Solve This Tool.
SECTION 3.2 MEASURES OF SPREAD Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Numerical Descriptive Techniques
Chapter 3 – Descriptive Statistics
Slide 1 Lecture 4: Measures of Variation Given a stem –and-leaf plot Be able to find »Mean ( * * )/10=46.7 »Median (50+51)/2=50.5 »mode.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Unit 3 Section 3-3 – Day : Measures of Variation  Range – the highest value minus the lowest value.  The symbol R is used for range.  Variance.
Business Statistics: Communicating with Numbers
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
M07-Numerical Summaries 1 1  Department of ISM, University of Alabama, Lesson Objectives  Learn when each measure of a “typical value” is appropriate.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Measures of Dispersion
Descriptive Statistics: Presenting and Describing Data.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
Describing Quantitative Data Numerically Symmetric Distributions Mean, Variance, and Standard Deviation.
Standard Deviation Lecture 18 Sec Tue, Feb 15, 2005.
Part II Sigma Freud and Descriptive Statistics Chapter 3 Vive La Différence: Understanding Variability.
BIA 2610 – Statistical Methods Chapter 3 – Descriptive Statistics: Numerical Measures.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.
Descriptive Statistics for one variable. Statistics has two major chapters: Descriptive Statistics Inferential statistics.
CHAPTER 2: Basic Summary Statistics
Averages and Variability
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Bio-Statistic KUEU 3146 & KBEB 3153 Bio-Statistic Data grouping and presentations Part II: Summarizing Data.
Measures of Variation. Range, Variance, & Standard Deviation.
Copyright © Cengage Learning. All rights reserved. Probability and Statistics.
2.4 Measures of Variation The Range of a data set is simply: Range = (Max. entry) – (Min. entry)
Copyright © Cengage Learning. All rights reserved.
Measures of Central Tendency and Variability
Measures of Dispersion
Copyright © Cengage Learning. All rights reserved.
Introduction to Elementary Statistics
Central Tendency and Variability
Descriptive Statistics: Presenting and Describing Data
BUS7010 Quant Prep Statistics in Business and Economics
Numerical Descriptive Measures
Describing Quantitative Data Numerically
Hypothesis Testing and Confidence Intervals
Numerical Descriptive Measures
CHAPTER 2: Basic Summary Statistics
St. Edward’s University
Statistics 5/19/2019.
2.3. Measures of Dispersion (Variation):
Lecture 4 Psyc 300A.
Numerical Descriptive Measures
Presentation transcript:

NUMERICAL DESCRIPTIVE STATISTICS Measures of Variability

Another Description of the Data -- Variability For Data Set A below, the mean of the 10 observations is SET A:4,2,3,3,2,2,1,4,3,2 But each of the following two data sets with 10 observations also has a mean of 2.60 SET B:2,2,2,2,3,3,3,3,3,3 SET C:0,0,1,1,4,4,4,4,4,4 Although sets A, B, an C all have the same mean, the “spread” of the data differs from set to set.

The “Spread” of the Data Data Set A Data Set B Data Set C Most “spread”Least “spread”

Measures of Variability Population –Variance  2 – Standard Deviation  Sample – Range – Variance s 2 – Standard Deviation s

The Range When we are talking about a sample, the range is the difference between the highest and lowest observation In the sample there were some A’s (4’s), and the lowest value in the sample was a D (1) –Sample range = = 3

Another Approach to Variability The range only takes into account the two most extreme values A better approach –Look at the variability of all the data In some sense find the “average” deviation from the mean The value of an observation minus the mean can be positive or negative The plusses and minuses cancel each other out giving an average value of 0 –Need another measure

How to Average Only Positive Deviations MEAN ABSOLUTE DEVIATION (MAD)MEAN ABSOLUTE DEVIATION (MAD) –Averages the absolute values of these differences –Used in quality control/inventory analyses –But this quantity is hard to work with algebraically and analytically POPULATION VARIANCE (σ 2 )POPULATION VARIANCE (σ 2 ) –Averages the squares of the differences from the mean

Population Variance Formulas

EXAMPLE Calculation of σ 2 Using the numbers from the population of 2000 GPA’s: 4,2,1,3,3,3,2,… 2

Standard Deviation But the unit of measurement for σ 2 is: –Square Grade Points (???) What is a square grade point? To get back to the original units (grade points), take the square root of σ 2 STANDARD DEVIATION (  )STANDARD DEVIATION (  ) – the square root of the variance, σ 2

Calculation of the Standard Deviation (σ) For the grade point data:

Estimating σ 2 SAMPLE VARIANCE (s 2 )SAMPLE VARIANCE (s 2 ) –Best estimate for is  2 is s 2 s 2 is found by using the sample data and using the formula for  2 except:

Sample Variance Formulas

Calculations for s 2 The data from the sample is: 4,2,3,3,2,2,1,4,3,2

Sample Standard Deviation, s The best estimate for  is denoted: s It is called the sample standard deviation s is found by taking the square root of s 2

s 2 for Grouped Data For the grade point example –4 occurs 2 times –3 occurs 3 times –2 occurs 4 times –1 occurs 1 time To calculate the sample variance, s 2, rather than write the term down each time: – Multiply the squared deviations by their class frequencies

Calculation of s 2 -Grouped Data

Extension to Class Data Sometimes data is given in classes instead of individual observations approximationIn this case, the midpoints of the classes can be used to get an approximation of the mean and standard deviation using the grouped data approach

Example Suppose a sample of 100 salaries of CFO’s of small market firms is summarized below. Salary Range Midpoint Frequency Salary Range Midpoint Frequency $ 80,000-$100,000 $ 90, $100,000-$120,000 $110, $120,000-$140,000 $130, $140,000-$160,000 $150, Total = 100

_ Approximations for x and s

Interpreting s (Mound Shaped Distribution) If data forms a mound shaped distribution –Within  1s from the mean Approximately 68% of the measurements –Within  2s from the mean Approximately 95% of the measurements –Within  3s from the mean Approximately all of the measurements

Interpreting s (Any Distribution) If data is not mound shaped ( or shape is unknown) Within  2s from the mean At least 75% of the measurements –Within  3s from the mean At least 88.9% of the measurements –Within  ks from the mean (k > 1) At least 1 -1/k 2 of the measurements

Coefficient of Variation Coefficient of Variation (CV)Another measure of variability that is frequently used to compare different data sets (even if measured in different units) is the: Coefficient of Variation (CV) CV = (Standard Deviation/Mean) x 100%

CV Comparison of Return on Investment $1000 Invested in Stock Fund A: –Average Weekly Return = $1.90 –Standard Deviation = $0.38 $5000 Invested in Stock Fund B: –Average Weekly Return = $12.50 –Standard Deviation = $1.75 CV A = (.38/1.90)x100% = 20% CV B = (1.75/12.50)x100% = 14% Conclusion: Conclusion: Even though the risk (as measured by s) is larger for Stock fund B, the variability of the returns for Stock fund B (as measured by the CV) is less.

Range Approximation for σ If data is relatively mound-shaped a “good” approximation for s is: σ  (range)/4 Sometimes, when one is more certain that the sample range captures the entire population of data statisticians use, σ  (range)/6

Using Excel Suppose population data is in cells A2 to A2001 Population variance (  2 ) = VARP(A2:A2001) Population standard dev. (  ) =STDEVP(A2:A2001) Suppose sample data is in cells A2 to A11 Sample variance (s 2 ) =VAR(A2:A11) Sample standard dev. (s) =STDEV(A2:A11) Data Analysis

Where data values are stored Check Labels Check both: Summary Statistics Confidence Level Enter Name of Output Worksheet

Drag to make Column A wider Sample Standard Deviation Sample Variance

Using EXCEL for Grouped Data The Excel function SUMPRODUCT multiplies the respective numbers of two columns together then adds them up, e.g. =SUMPRODUCT(A2:A4,B2:B4) is equivalent to =A2*B2+A3*B3+A4*B4 This function aids in calculating the values in the following worksheet for calculating the mean, variance, and standard deviation for grouped data

Enter midpoints and frequencies Total Frequencies Square midpoint in Cell A2: =A2^2 Drag down to D6 =SUMPRODUCT(A2:A6,B2:B6)/B8 =(SUMPRODUCT(B2:B6,D2:D6)-200*B10^2)/199 =SQRT(B11)

Review Measures of variability for Populations and Samples –Range –Variance –Standard Deviation Formulas for –Complete Data –Frequency Data Excel –Functions –Data Analysis