Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Chapter 3: Descriptive Statistics
Measures of Dispersion
Descriptive Statistics
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
DESCRIBING DATA: 2. Numerical summaries of data using measures of central tendency and dispersion.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Analysis of Research Data
Slides by JOHN LOUCKS St. Edward’s University.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Describing Data Using Numerical Measures
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Numerical Descriptive Techniques
Methods for Describing Sets of Data
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part A n Measures of.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 2 Describing Data.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Averages and Variation Chapter 3 Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Summary Statistics: Measures of Location and Dispersion.
Chapter 3 Averages and Variation Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
LIS 570 Summarising and presenting data - Univariate analysis.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
COMPLETE BUSINESS STATISTICS
Exploratory Data Analysis
Business and Economics 6th Edition
Chapter 3 Describing Data Using Numerical Measures
Descriptive measures Capture the main 4 basic Ch.Ch. of the sample distribution: Central tendency Variability (variance) Skewness kurtosis.
Chapter 3 Created by Bethany Stubbe and Stephan Kogitz.
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Averages and Variation
NUMERICAL DESCRIPTIVE MEASURES
Descriptive Statistics
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Business and Economics 7th Edition
Presentation transcript:

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-2 Learning Objectives Distinguish between measures of central tendency, measures of variability, measures of shape, and measures of association. Understand the meanings of mean, median, mode, quartile, percentile, and range. Compute mean, median, mode, percentile, quartile, range, variance, standard deviation, and mean absolute deviation on ungrouped data. Differentiate between sample and population variance and standard deviation.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-3 Learning Objectives -- Continued Understand the meaning of standard deviation as it is applied by using the empirical rule and Chebyshev’s theorem. Compute the mean, median, standard deviation, and variance on grouped data. Understand box and whisker plots, skewness, and kurtosis. Compute a coefficient of correlation and interpret it.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-4 Measures of Central Tendency: Ungrouped Data Measures of central tendency yield information about “particular places or locations in a group of numbers.” Common Measures of Location –Mode –Median –Mean –Percentiles –Quartiles

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-5 Mode The most frequently occurring value in a data set Applicable to all levels of data measurement (nominal, ordinal, interval, and ratio) Bimodal -- Data sets that have two modes Multimodal -- Data sets that contain more than two modes

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-6 The mode is 44. There are more 44s than any other value Mode -- Example

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-7 Median Middle value in an ordered array of numbers. Applicable for ordinal, interval, and ratio data Not applicable for nominal data Unaffected by extremely large and extremely small values.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-8 Median: Computational Procedure First Procedure –Arrange the observations in an ordered array. –If there is an odd number of terms, the median is the middle term of the ordered array. –If there is an even number of terms, the median is the average of the middle two terms. Second Procedure –The median’s position in an ordered array is given by (n+1)/2.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-9 Median: Example with an Odd Number of Terms Ordered Array There are 17 terms in the ordered array. Position of median = (n+1)/2 = (17+1)/2 = 9 The median is the 9th term, 15. If the 22 is replaced by 100, the median is 15. If the 3 is replaced by -103, the median is 15.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Median: Example with an Even Number of Terms Ordered Array There are 16 terms in the ordered array. Position of median = (n+1)/2 = (16+1)/2 = 8.5 The median is between the 8th and 9th terms, If the 21 is replaced by 100, the median is If the 3 is replaced by -88, the median is 14.5.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Arithmetic Mean Commonly called ‘the mean’ is the average of a group of numbers Applicable for interval and ratio data Not applicable for nominal or ordinal data Affected by each value in the data set, including extreme values Computed by summing all values in the data set and dividing the sum by the number of values in the data set

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Population Mean

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Sample Mean

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Percentiles Measures of central tendency that divide a group of data into 100 parts At least n% of the data lie below the nth percentile, and at most (100 - n)% of the data lie above the nth percentile Example: 90th percentile indicates that at least 90% of the data lie below it, and at most 10% of the data lie above it The median and the 50th percentile have the same value. Applicable for ordinal, interval, and ratio data Not applicable for nominal data

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Percentiles: Computational Procedure Organize the data into an ascending ordered array. Calculate the percentile location: Determine the percentile’s location and its value. If i is a whole number, the percentile is the average of the values at the i and (i+1) positions. If i is not a whole number, the percentile is at the (i+1) position in the ordered array.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Percentiles: Example Raw Data: 14, 12, 19, 23, 5, 13, 28, 17 Ordered Array: 5, 12, 13, 14, 17, 19, 23, 28 Location of 30th percentile: The location index, i, is not a whole number; i+1 = 2.4+1=3.4; the whole number portion is 3; the 30th percentile is at the 3rd location of the array; the 30th percentile is 13.

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Quartiles Measures of central tendency that divide a group of data into four subgroups Q 1 : 25% of the data set is below the first quartile Q 2 : 50% of the data set is below the second quartile Q 3 : 75% of the data set is below the third quartile Q 1 is equal to the 25th percentile Q 2 is located at 50th percentile and equals the median Q 3 is equal to the 75th percentile Quartile values are not necessarily members of the data set

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Quartiles 25% Q3Q3 Q2Q2 Q1Q1

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Ordered array: 106, 109, 114, 116, 121, 122, 125, 129 Q 1 Q 2 : Q 3 : Quartiles: Example

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Variability Mean No Variability in Cash Flow Variability in Cash FlowMean

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Variability No Variability Variability

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Measures of Variability: Ungrouped Data Measures of variability describe the spread or the dispersion of a set of data. Common Measures of Variability –Range –Interquartile Range –Mean Absolute Deviation –Variance –Standard Deviation – Z scores –Coefficient of Variation

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Range The difference between the largest and the smallest values in a set of data Simple to compute Ignores all data points except the two extremes Example: Range = Largest - Smallest= =

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Interquartile Range Range of values between the first and third quartiles Range of the “middle half” Less influenced by extremes

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Deviation from the Mean Data set: 5, 9, 16, 17, 18 Mean: Deviations from the mean: -8, -4, 3, 4,

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Mean Absolute Deviation Average of the absolute deviations from the mean

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Population Variance Average of the squared deviations from the arithmetic mean

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Population Standard Deviation Square root of the variance

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Sample Variance Average of the squared deviations from the arithmetic mean 2,398 1,844 1,539 1,311 7, ,625 5,041 54, , ,866

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Sample Standard Deviation Square root of the sample variance 2,398 1,844 1,539 1,311 7, ,625 5,041 54, , ,866

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Uses of Standard Deviation Indicator of financial risk Quality Control –construction of quality control charts –process capability studies Comparing populations –household incomes in two cities –employee absenteeism at two plants

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Standard Deviation as an Indicator of Financial Risk Annualized Rate of Return Financial Security  A15%3% B15%7%

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Empirical Rule Data are normally distributed (or approximately normal) Distance from the Mean Percentage of Values Falling Within Distance

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Chebyshev’s Theorem Applies to all distributions

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Chebyshev’s Theorem Applies to all distributions 1-1/3 2 = /2 2 = 0.75 Distance from the Mean Minimum Proportion of Values Falling Within Distance Number of Standard Deviations K = 2 K = 3 K = 4 1-1/4 2 = 0.94

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Coefficient of Variation Ratio of the standard deviation to the mean, expressed as a percentage Measurement of relative dispersion

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Coefficient of Variation

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Measures of Central Tendency and Variability: Grouped Data Measures of Central Tendency –Mean –Median –Mode Measures of Variability –Variance –Standard Deviation

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Mean of Grouped Data Weighted average of class midpoints Class frequencies are the weights

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Calculation of Grouped Mean Class IntervalFrequencyClass Midpoint fM 20-under under under under under under

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Median of Grouped Data

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Median of Grouped Data -- Example Cumulative Class IntervalFrequency Frequency 20-under under under under under under N = 50

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Mode of Grouped Data Midpoint of the modal class Modal class has the greatest frequency Class IntervalFrequency 20-under under under under under under 80 1

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Variance and Standard Deviation of Grouped Data PopulationSample

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Population Variance and Standard Deviation of Grouped Data under under under under under under

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Measures of Shape Skewness –Absence of symmetry –Extreme values in one side of a distribution Kurtosis –Peakedness of a distribution –Leptokurtic: high and thin –Mesokurtic: normal shape –Platykurtic: flat and spread out Box and Whisker Plots –Graphic display of a distribution –Reveals skewness

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Skewness Negatively Skewed Positively Skewed Symmetric (Not Skewed)

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Skewness Negatively Skewed Mode Median Mean Symmetric (Not Skewed) Mean Median Mode Positively Skewed Mode Median Mean

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Coefficient of Skewness Summary measure for skewness If S < 0, the distribution is negatively skewed (skewed to the left). If S = 0, the distribution is symmetric (not skewed). If S > 0, the distribution is positively skewed (skewed to the right).

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Coefficient of Skewness

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Kurtosis Peakedness of a distribution –Leptokurtic: high and thin –Mesokurtic: normal in shape –Platykurtic: flat and spread out Leptokurtic Mesokurtic Platykurtic

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Box and Whisker Plot Five secific values are used: –Median, Q 2 –First quartile, Q 1 –Third quartile, Q 3 –Minimum value in the data set –Maximum value in the data set Inner Fences –IQR = Q 3 - Q 1 –Lower inner fence = Q IQR –Upper inner fence = Q IQR Outer Fences –Lower outer fence = Q IQR –Upper outer fence = Q IQR

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Box and Whisker Plot Q1Q1 Q3Q3 Q2Q2 MinimumMaximum

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Skewness: Box and Whisker Plots, and Coefficient of Skewness Negatively Skewed Positively Skewed Symmetric (Not Skewed) S < 0 S = 0 S > 0

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Pearson Product-Moment Correlation Coefficient

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Three Degrees of Correlation r < 0r > 0 r = 0

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Computation of r for the Economics Example (Part 1) Day Interest X Futures Index Y ,8411, ,2841, ,0761, ,6251, ,1761, ,7291, ,7291, ,0761, ,0761, ,2251, ,2891, ,0811, Summations92.932, ,20721, X2X2 Y2Y2 XY

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Computation of r for the Economics Example (Part 2)

Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Scatter Plot and Correlation Matrix for the Economics Example Interest Futures Index InterestFutures Index Interest1 Futures Index