Chapter 3 Describing Data Using Numerical Measures

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Measures of Dispersion
Statistics for Managers using Microsoft Excel 6th Edition
Descriptive Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
© 2002 Prentice-Hall, Inc.Chap 3-1 Basic Business Statistics (8 th Edition) Chapter 3 Numerical Descriptive Measures.
Chapter 3 Describing Data Using Numerical Measures
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 3-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
Statistics for Managers Using Microsoft® Excel 4th Edition
Slides by JOHN LOUCKS St. Edward’s University.
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 1 Agenda: 1) Course web page 2) Greensheet 3) Numerical Descriptive Measures.
Basic Business Statistics 10th Edition
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Statistics for Managers using Microsoft Excel 6th Global Edition
Business and Economics 7th Edition
Coefficient of Variation
© 2003 Prentice-Hall, Inc.Chap 3-1 Business Statistics: A First Course (3 rd Edition) Chapter 3 Numerical Descriptive Measures.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 3 Describing Data Using Numerical Measures.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
UDM MSC COURSE IN EDUCATION & DEVELOPMENT 2013 –
Statistics: Use Graphs to Show Data Box Plots.
Describing Data: Numerical
Chapter 2 Describing Data with Numerical Measurements
Describing Data Using Numerical Measures
Numerical Descriptive Measures
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Chapter 2 Describing Data.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
Lecture 3 Describing Data Using Numerical Measures.
Variation This presentation should be read by students at home to be able to solve problems.
BUS304 – Data Charaterization1 Other Numerical Measures  Median  Mode  Range  Percentiles  Quartiles, Interquartile range.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
Basic Business Statistics Chapter 3: Numerical Descriptive Measures Assoc. Prof. Dr. Mustafa Yüzükırmızı.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures (Summary Measures) Basic Business Statistics.
Describing Data Descriptive Statistics: Central Tendency and Variation.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Basic Business Statistics 11 th Edition.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Chapter 2 Describing Data: Numerical
Yandell – Econ 216 Chap 3-1 Chapter 3 Numerical Descriptive Measures.
Descriptive Statistics ( )
Statistics for Managers Using Microsoft® Excel 5th Edition
Business and Economics 6th Edition
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Quartile Measures DCOVA
Business and Economics 7th Edition
Presentation transcript:

Chapter 3 Describing Data Using Numerical Measures Business Statistics: A Decision-Making Approach 7th Edition Chapter 3 Describing Data Using Numerical Measures

Chapter Goals After completing this chapter, you should be able to: Compute and interpret the mean, median, and mode for a set of data Compute the range, variance, and standard deviation and know what these values mean Construct and interpret a box and whisker graph Compute and explain the coefficient of variation and z scores Use numerical measures along with graphs, charts, and tables to describe data

Chapter Topics Measures of Center and Location Mean, median, mode Other measures of Location Weighted mean, percentiles, quartiles Measures of Variation Range, interquartile range, variance and standard deviation, coefficient of variation Using the mean and standard deviation together Coefficient of variation, z-scores

Summary Measures Describing Data Numerically Center and Location Other Measures of Location Variation Mean Range Percentiles Median Interquartile Range Quartiles Mode Variance Weighted Mean Standard Deviation Coefficient of Variation

Measures of Center and Location Mean Median Mode Weighted Mean

Mean (Average) The most common measure of central tendency (continued) The most common measure of central tendency Mean = sum of values divided by the number of values Affected by extreme values (outliers) 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Mean = 3 Mean = 4

Median In an ordered array, the median is the “middle” number, i.e., the number that splits the distribution in half The median is not affected by extreme values Useful when data are highly skewed 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Median = 3 Median = 3

Which measure of location is the “best”? Mean is generally used, unless extreme values (outliers) exist Then Median is often used, since the median is not sensitive to extreme values. Example: Realtors say “Median Home Prices”….not average – less sensitive to outliers

Shape of a Distribution based on Mean and Median Describes how data is distributed Symmetric or skewed Symmetric Right-Skewed Left-Skewed Mean < Median Mean = Median Median < Mean (Longer tail extends to left) (Longer tail extends to right)

Mode The value that is repeated most often Not affected by extreme values Used for either numerical or categorical data There may be no mode There may be several modes 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 1 2 3 4 5 6 Mode = 5 No Mode

Implication Five houses on a hill by the beach House Prices: $2,000,000 500,000 300,000 100,000 100,000

Implication Mean: ($3,000,000/5) = $600,000 = $600,000 Median: middle value of ranked data = $300,000 Mode: most frequent value = $100,000 House Prices: $2,000,000 500,000 300,000 100,000 100,000 Sum 3,000,000

Practice From the class website, download “describing numerical data using Excel.” Try followings: Mean vs. Median Hockey Shoes

Weighted Mean Used when values are grouped by frequency or relative importance (i.e., GPA) Example: Sample of 26 Repair Projects Weighted Mean Days to Complete: Days to Complete Frequency 5 4 6 12 7 8 2

Other Location Measures Other Measures of Location Percentiles Quartiles A percentile is a measure that tells us what percent of the total frequency scored at or below that measure.  SAT score 90th percentile: the score is as high as or higher than 90% of other students who took the SAT test. When statistical data is ordered sequentially, the data can be divided into 4 groups of data. Each of the groups of data is separated by a line called a Quartile. Example 3-8 on pp. 99

Percentiles Example 1: Find the 60th percentile in an ordered array of 19 values. So, use value in the i = 12th position If i is not an integer, round off to the next higher integer value. If i is an integer, the pth percentile is the average of the values in position i and i+1.

Quartiles Quartiles split the ranked data into 4 equal groups: Note that the second quartile (the 50th percentile) is the median 25% 25% 25% 25% Q1 Q2 Q3

Quartiles 25 Example: Find the first quartile (25th percentile) Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22 (n = 9) Q1 = 25th percentile, so find i : i = (9) = 2.25 so round up and use the value in the 3rd position: Q1 = 13 25 100

Practice Use “describing numerical data using Excel” file, try following: Salary

Box and Whisker Plot Statistics assumes that your data points (the numbers in your list) are clustered around some central value. The "box" in the box-and-whisker plot contains and highlights the middle half of data points. It is useful for quickly finding outliers - data points out of line with the rest of the data set. The Box and Whisker plot can be shown in either vertical or horizontal format. Example 3-9 on pp.100

Box and Whisker Plot Outliers are plotted outside the calculated limits and the center box extends from Q1 to Q3. 25% 25% 25% 25% * * Outliers Lower 1st Median 3rd Upper Limit Quartile Quartile Limit The lower limit is Q1 – 1.5 (Q3 – Q1) The upper limit is Q3 + 1.5 (Q3 – Q1)

Box-and-Whisker Plot Example Below is a Box-and-Whisker plot for the following data: 0 2 2 2 3 3 4 5 6 11 27 This data is right skewed, as the plot depicts Min Q1 Q2 Q3 Max * 0 2 3 6 12 27 Upper limit = Q3 + 1.5 (Q3 – Q1) = 6 + 1.5 (6 – 2) = 12 27 is above the upper limit so is shown as an outlier

Distribution Shape and Box and Whisker Plot Left-Skewed Symmetric Right-Skewed Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3

Practice Using Free Online Tool Using the data set on the website, develop a Box-and-Whisker Plot (free online tool).

Coefficient of Variation Measures of Variation Variation Range Variance Standard Deviation Coefficient of Variation Population Standard Deviation Interquartile Range Population Variance Sample Variance Sample Standard Deviation

Variation Measures of variation give information on the spread of the data values. Same center, different variation

Range (refer to Measures of Variability) Simplest measure of variation Difference between the largest and the smallest observations: Range = xmaximum – xminimum Example: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Range = 14 - 1 = 13

Disadvantages of the Range Ignores the way in which data are distributed Sensitive to outliers (extreme values) 7 8 9 10 11 12 7 8 9 10 11 12 Range = 12 - 7 = 5 Range = 12 - 7 = 5 1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5 Range = 5 - 1 = 4 1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120 Range = 120 - 1 = 119

Interquartile Range (refer to Measures of Variability) Eliminate range’s susceptibility to extreme values Can eliminate some outlier problems Eliminate some high-and low-valued observations and calculate the range from the remaining values. Interquartile range = 3rd quartile – 1st quartile Example 3-10 on pp.108

Interquartile Range Example Median (Q2) X X Q1 Q3 maximum minimum 25% 25% 25% 25% 12 30 45 57 70 Interquartile range = 57 – 30 = 27

Variance Average of squared deviations of values from the mean Population variance: Sample variance:

Standard Deviation Most commonly used measure of variation Shows variation about the mean Has the same units as the original data Population standard deviation: Sample standard deviation:

Comparing Standard Deviations Same mean, but different standard deviations: Data A Mean = 15.5 s = 3.338 11 12 13 14 15 16 17 18 19 20 21 Data B Mean = 15.5 s = .9258 11 12 13 14 15 16 17 18 19 20 21 Data C Mean = 15.5 s = 4.57 11 12 13 14 15 16 17 18 19 20 21

Practice Try the “Variance Vs. Standard Deviation” link on the class website Use “describing numerical data using Excel” file, try following: Std.

Coefficient of Variation The coefficient of variation measures the relative variation for distributions with different means. In finance, CV measures the relative risk of a stock portfolio It is usually expressed as a percentage. Population Sample

Comparing Coefficients of Variation Stock A: Average price last year = $50 Standard deviation = $5 Stock B: Average price last year = $100 Both stocks have the same standard deviation, but stock B is less variable relative to its price

The Empirical Rule If the data distribution is bell-shaped, then the interval: contains about 68% of the values in the population or the sample 68%

The Empirical Rule contains about 95% of the values in the population or the sample contains about 99.7% of the values in the population or the sample 95% 99.7%

Practice: Getting Descriptive Statistics Using Excel Descriptive Statistics are easy to obtain from Microsoft Excel Use menu choice: Data / data analysis / descriptive statistics Enter details in dialog box

Open Excel and then enter the data Select: Data / data analysis / descriptive statistics

Using Excel Enter dialog box details Check box for summary statistics (continued) Enter dialog box details Check box for summary statistics Click OK

Excel output Microsoft Excel descriptive statistics output, using the house price data: How to read “Skewness”? House Prices: $2,000,000 500,000 300,000 100,000 100,000

Chapter Summary Described measures of center and location Mean, median, mode, weighted mean Discussed percentiles and quartiles Created Box and Whisker Plots Illustrated distribution shapes Symmetric, skewed

Chapter Summary Described measure of variation (continued) Described measure of variation Range, interquartile range, variance, standard deviation, coefficient of variation Discussed Tchebysheff’s Theorem Calculated standardized data values