©2003 Thomson/South-Western 1 Chapter 3 – Data Summary Using Descriptive Measures Slides prepared by Jeff Heyl, Lincoln University ©2003 South-Western/Thomson.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

Measures of Dispersion
Descriptive Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Data Summary Using Descriptive Measures Introduction to Business Statistics, 5e Kvanli/Guynes/Pavur (c)2000 South-Western College Publishing.
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Measures of Dispersion
Numerical Descriptive Measures
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 3 – Descriptive Statistics
Chapter 3 Averages and Variations
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
© The McGraw-Hill Companies, Inc., Chapter 3 Data Description.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Review Measures of central tendency
Descriptive Statistics: Numerical Methods
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Skewness & Kurtosis: Reference
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter 3 Descriptive Statistics II: Additional Descriptive Measures and Data Displays.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Basic Business Statistics 11 th Edition.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
MATH 1107 Elementary Statistics Lecture 3 Describing and Exploring Data – Central Tendency, Variation and Relative Standing.
Descriptive Statistics(Summary and Variability measures)
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics: Numerical Methods
Describing, Exploring and Comparing Data
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Averages and Variation
Descriptive Statistics
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Quartile Measures DCOVA
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Presentation transcript:

©2003 Thomson/South-Western 1 Chapter 3 – Data Summary Using Descriptive Measures Slides prepared by Jeff Heyl, Lincoln University ©2003 South-Western/Thomson Learning™ Introduction to Business Statistics, 6e Kvanli, Pavur, Keeling

©2003 Thomson/South-Western 2 Types of Descriptive Measures  Measures of central tendency  Measures of variation  Measures of position  Measures of shape

©2003 Thomson/South-Western 3 Measures of Central Tendency  The Mean  The Median  The Midrange  The Mode

©2003 Thomson/South-Western 4 The Mean The Mean is simply the average of the data Each value in the sample is represented by x. Thus to get the mean simply add all the values in the sample and divide by the number of values in the sample (n) A Sample Mean x =x =x =x = xxnnxxnnn xxnnxxnnn

©2003 Thomson/South-Western 5 The Population Mean Each value in the population is represented by x. Thus to get the population mean (  ) simply add all the values in the population and divide by the number of values in the population (N)  = = = = xxNNxxNNN xxNNxxNNN

©2003 Thomson/South-Western 6 The Accident Data Set The Accident Data Set x = = x = = If we remove the last value from the data set, then

©2003 Thomson/South-Western 7 The Median The Median (Md) of a set of data is the value in the center of the data values when they are arranged from lowest to highest

©2003 Thomson/South-Western 8 Accident Data Ordered array: 5, 6, 7, 9, 23 The value that has an equal number of items to the right and left is the median If n is an odd number, Md is the center data value of the ordered data set Md = st ordered value n Md = 7

©2003 Thomson/South-Western 9 Even Numbered Data Ordered array: 3, 8, 12, 14 The value that has an equal number of items to the right and left is the median If n is an even number, Md is the average of the two center values of the ordered data set Md = (8 + 12)/2 = 10

©2003 Thomson/South-Western 10 The Midrange The Midrange (Mr) provides an easy- to-grasp measure of central tendency Mr = L + H 2

©2003 Thomson/South-Western 11 Accident Data Ordered array: 5, 6, 7, 9, 23 Mr = = Note: that the Midrange is severely affected by outliers Compare Mr to x = 10 and Md = 7

©2003 Thomson/South-Western 12 The Mode  The Mode (Mo) of a data set is the value that occurs more than once and the most often  The Mode is not always a measure of central tendency; this value need not occur in the center of the data

©2003 Thomson/South-Western 13 Bellaire College Example Figure 3.2

©2003 Thomson/South-Western 14 Bellaire College Example Figure 3.3

©2003 Thomson/South-Western 15 Bellaire College Example Figure 3.4

©2003 Thomson/South-Western 16 Level of Measurement and Measure of Central Tendency Summary of levels of measurement and appropriate measure of central tendency. A “Y” indicates this measure can be used with the corresponding level of measurement. Measure of Central TendencyNominalOrdinalIntervalRatio MeanYY MedianYYY MidrangeYY ModeYYYY Level of Measurement Table 3.1

©2003 Thomson/South-Western 17 Measures of Variation  Homogeneity refers to the degree of similarity within a set of data  The more homogeneous a set of data is, the better the mean will represent a typical value  Variation is the tendency of data values to scatter about the mean, x

©2003 Thomson/South-Western 18 Common Measures of Variation  Range  Variance  Standard Deviation  Coefficient of Variation

©2003 Thomson/South-Western 19 The Range For the Accident data: Range = H - L = = 18 Rather crude measure but easy to calculate and contains valuable information in some situations

©2003 Thomson/South-Western 20 The Variance and Standard Deviation Both measures describe the variation of the values about the mean  (x - x ) = 0  (x - x ) 2 = 220  (x - x ) = 0  (x - x ) 2 = 220 Data Value (x)(x - x )(x - x ) 2

©2003 Thomson/South-Western 21 Sample Variance s2 =s2 =s2 =s2 =  (x - x ) 2 n - 1 Using the accident data: s 2 = = =

©2003 Thomson/South-Western 22 Sample Standard Deviation s =s =s =s =  (x - x ) 2 n - 1 Using the accident data: s = 55.0 = 7.416

©2003 Thomson/South-Western 23 Population Variance and Standard Deviation  = = = =  (x -  ) 2 N 2 =2 =2 =2 = N

©2003 Thomson/South-Western 24 The Coefficient of Variation The Coefficient of Variation (CV) is used to compare the variation of two or more data sets where the values of the data differ greatly CV =  100 sx

©2003 Thomson/South-Western 25 Machined Parts Example Figure 3.6

©2003 Thomson/South-Western 26 Measures of Position  Percentile (Quartile)  Most common measure of position  Quartiles are percentiles with the data divided into quarters  Z-Score  The relative position of a data value expressed in terms of the number of standard deviations above or below the mean

©2003 Thomson/South-Western 27 Percentile Example The 35th Percentile (P 35 ) is that value such that at most 35% of the data values are less than P 35 and at most 65% of the data values are greater than P 35.

©2003 Thomson/South-Western 28 Aptitude Test Scores Table 3.2Ordered array of aptitude test scores for 50 applicants (x = 60.36, s = 18.61)

©2003 Thomson/South-Western 29 Percentile Texon Industries Data 17.5 represents the position of the 35th percentile n = = 17.5 P100 Number of data values, n = 50 Percentile, P = 35

©2003 Thomson/South-Western 30 Percentile Location Rules Rule 1:If n  P/100 is not a counting number, round it up, and the Pth percentile will be the value in this position of the ordered data Rule 2:If n  P/100 is a counting number, the Pth percentile is the average of the number in this location (of the ordered data) and the number in the next largest location

©2003 Thomson/South-Western 31 Aptitude Scores Example Ms. Jensen received a score of 83 on the aptitude test. What is her percentile value? 83 is the 45th largest value out of 50. A guess of the percentile would be: P = 100 = Examining the surrounding values clarifies the true percentile P(n P)/100P th Percentile = 44( )/2 = = th value = = 45( )/2 = 84 Example 3.5

©2003 Thomson/South-Western 32 Quartiles Quartiles are merely particular percentiles that divide the data into quarters, namely: Q 1 = 1st quartile = 25th percentile (P 25 ) Q 2 = 2nd quartile = 50th percentile = median (P 50 ) Q 3 = 3rd quartile = 75th percentile (P 75 )

©2003 Thomson/South-Western 33 Quartile Example Using the applicant data, the first quartile is: Rounded up Q 1 = 13th ordered value = 46 Similarly the third quartile is: P100 n = (50)(.75) = 37.5 ≈ 38 and Q 3 = 75 n = (50)(.25) = 12.5 P100

©2003 Thomson/South-Western 34 Interquartile Range The interquartile range (IQR) is essentially the middle 50% of the data set IQR = Q 3 - Q 1 Using the applicant data, the IQR is: IQR = = 29

©2003 Thomson/South-Western 35 Z-Scores  Z-score determines the relative position of any particular data value x and is based on the mean and standard deviation of the data set  The Z-score is expresses the number of standard deviations the value x is from the mean  A negative Z-score implies that x is to the left of the mean and a positive Z-score implies that x is to the right of the mean

©2003 Thomson/South-Western 36 Z Score Equation z =z =z =z = x - x s For a score of 83 from the aptitude data set, z = = For a score of 35 from the aptitude data set, z = =

©2003 Thomson/South-Western 37 Standardizing Sample Data The process of subtracting the mean and dividing by the standard deviation is referred to as standardizing the sample data. The corresponding z-score is the standardized score.

©2003 Thomson/South-Western 38 Measures of Shape  Skewness  Skewness measures the tendency of a distribution to stretch out in a particular direction  Kurtosis  Kurtosis measures the peakedness of the distribution

©2003 Thomson/South-Western 39 Skewness  In a symmetrical distribution the mean, median, and mode would all be the same value and Sk = 0  A positive Sk number implies a shape which is skewed right and the mode < median < mean  In a data set with a negative Sk value the mean < median < mode

©2003 Thomson/South-Western 40 Skewness Calculation Pearsonian coefficient of skewness Sk = 3(x - Md) s Values of Sk will always fall between -3 and 3

©2003 Thomson/South-Western 41 Histogram of Symmetric Data Frequency x = Md = Mo Figure 3.7

©2003 Thomson/South-Western 42 Histogram with Right (Positive) Skew Relative Frequency Mode(Mo)Median(Md) Sk > 0 Mean (x ) Figure 3.8

©2003 Thomson/South-Western 43 Histogram with Left (Negative) Skew Mode(Mo)Median(Md) Relative Frequency Sk < 0 Mean (x ) Figure 3.9

©2003 Thomson/South-Western 44 Kurtosis  Kurtosis is a measure of the peakedness of a distribution  Large values occur when there is a high frequency of data near the mean and in the tails  The calculation is cumbersome and the measure is used infrequently

©2003 Thomson/South-Western 45 Chebyshev’s Inequality 1.At least 75% of the data values are between x - 2s and x + 2s, or At least 75% of the data values have a z- score value between -2 and 2 3.In general, at least (1-1/k 2 ) x 100% of the data values lie between x - ks and x + ks for any k>1 2.At least 89% of the data values are between x - 3s and x + 3s, or At least 75% of the data values have a z- score value between -3 and 3

©2003 Thomson/South-Western 46 Empirical Rule Under the assumption of a bell shaped population: 1.Approximately 68% of the data values lie between x - s and x + s (have z-scores between -1 and 1) 2.Approximately 95% of the data values lie between x - 2s and x + 2s (have z-scores between -2 and 2) 3.Approximately 99.7% of the data values lie between x - 3s and x + 3s (have z-scores between -3 and 3)

©2003 Thomson/South-Western 47 A Bell-Shaped (Normal) Population Figure 3.10

©2003 Thomson/South-Western 48 Chebyshev’s Versus Empirical Chebyshev’s Actual InequalityEmpirical Rule BetweenPercentagePercentagePercentage x - s and x + s66%—≈ 68% (33 out of 50) x - 2s and x + 2s98%≥ 75%≈ 95% (49 out of 50) x - 3s and x + 3s100%≥ 89%≈ 100% (50 out of 50) Table 3.3 Md = 62 Sk = -.26

©2003 Thomson/South-Western 49 Allied Manufacturing Example Is the Empirical Rule applicable to this data? Probably yes. Histogram is approximately bell shaped. x - 2s = and x + 2s = of the 100 data values fall between these limits closely approximating the 95% called for by the Empirical Rule

©2003 Thomson/South-Western 50 Grouped Data Class NumberClass (Age in years)Frequency 120 and under and under and under and under and under Table 3.4 When raw data are not available Estimate x by assuming data values are equal to the midpoint of their class

©2003 Thomson/South-Western 51 Grouped Data When raw data are not available Estimate x by assuming data values are equal to the midpoint of their class 5 values at ( )/2= values at ( )/2= 35 9 values at ( )/5= 45 6 values at ( )/2= 55 2 values at ( )/2= 65 x = x = = 41.1 (5)(25) + (14)(35) + (9)(45) + (6)(55) + (2)(65)

©2003 Thomson/South-Western 52 Grouped Data When raw data are not available Estimate s 2 by assuming data values are equal to the midpoint of their class and using the normal method s2 =s2 =s2 =s2 = ∑(each data value) 2 - ∑(each data value) 2 /n n - 1 s 2 = = s = = ,100 - (1480) 2 /36 35

©2003 Thomson/South-Western 53 Grouped Data Table 3.5 Summary of calculations Class NumberClassfmf mf m and under , and under , and under , and under , and under ,450 36∑f m = 1,480∑f m 2 = 65,100

©2003 Thomson/South-Western 54 Grouped Data Figure 3.11

©2003 Thomson/South-Western 55 Box Plots Box plots are graphical representations of data sets that illustrate the lowest data value (L), the first quartile (Q 1 ), the median (Q 2, MD), the third quartile (Q 3 ), the interquartile range (IQR), and the highest data value (H)

©2003 Thomson/South-Western 56 Box Plots Given the aptitude test data: L= 22Q 3 = 75 Q 1 = 46IQR= = 29 Q 2 = Md = 62H= 96 ||||||||| L = 22 Q 1 = 46 Md = 62 Q 3 = 75 H = 96 Figure 3.12 x x

©2003 Thomson/South-Western 57 Box Plots Figure 3.13

©2003 Thomson/South-Western 58 Box Plots Figure 3.14

©2003 Thomson/South-Western 59 Box Plots Figure 3.15

©2003 Thomson/South-Western 60 Box Plots Figure 3.16a

©2003 Thomson/South-Western 61 Box Plots Figure 3.16b

©2003 Thomson/South-Western 62 Box Plots Figure Apptitude Score Box Plots for Aptitude Scores Sample 12