Chapter 3 Describing Data Using Numerical Measures

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Measures of Dispersion
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
© 2002 Prentice-Hall, Inc.Chap 3-1 Basic Business Statistics (8 th Edition) Chapter 3 Numerical Descriptive Measures.
Ka-fu Wong © 2007 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics A.A. Elimam College of Business San Francisco State University.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 3-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter Two Treatment of Data.
Statistics for Managers Using Microsoft® Excel 4th Edition
Slides by JOHN LOUCKS St. Edward’s University.
Basic Business Statistics 10th Edition
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Statistics for Managers using Microsoft Excel 6th Global Edition
Business and Economics 7th Edition
Coefficient of Variation
© 2003 Prentice-Hall, Inc.Chap 3-1 Business Statistics: A First Course (3 rd Edition) Chapter 3 Numerical Descriptive Measures.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 3 Describing Data Using Numerical Measures.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
UDM MSC COURSE IN EDUCATION & DEVELOPMENT 2013 –
Describing Data: Numerical
Numerical Descriptive Measures
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Statistics for Managers.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 3 – Descriptive Statistics
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Descriptive Statistics1 LSSG Green Belt Training Descriptive Statistics.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
Skewness & Kurtosis: Reference
Variation This presentation should be read by students at home to be able to solve problems.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
BUS304 – Data Charaterization1 Other Numerical Measures  Median  Mode  Range  Percentiles  Quartiles, Interquartile range.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
Basic Business Statistics Chapter 3: Numerical Descriptive Measures Assoc. Prof. Dr. Mustafa Yüzükırmızı.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Basic Business Statistics 11 th Edition.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Statistical Methods © 2004 Prentice-Hall, Inc. Week 3-1 Week 3 Numerical Descriptive Measures Statistical Methods.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 2 Describing Data: Numerical
Yandell – Econ 216 Chap 3-1 Chapter 3 Numerical Descriptive Measures.
Statistics for Managers Using Microsoft® Excel 5th Edition
Business and Economics 6th Edition
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
Description of Data (Summary and Variability measures)
Chapter 3 Describing Data Using Numerical Measures
Numerical Descriptive Measures
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Numerical Descriptive Measures
Business and Economics 7th Edition
Presentation transcript:

Chapter 3 Describing Data Using Numerical Measures Business Statistics: A Decision-Making Approach 6th Edition Chapter 3 Describing Data Using Numerical Measures Fundamentals of Business Statistics - Murali Shanker

Chapter Goals To establish the usefulness of summary measures of data. The Scientific Method 1. Formulate a theory 2. Collect data to test the theory 3. Analyze the results 4. Interpret the results, and make decisions Murali Shanker – Fundamentals of Business Statistics

Summary Measures Describing Data Numerically Center and Location Other Measures of Location Variation Mean Range Percentiles Median Interquartile Range Quartiles Mode Variance Weighted Mean Standard Deviation Coefficient of Variation Murali Shanker – Fundamentals of Business Statistics

Measures of Center and Location Overview Center and Location Mean Median Mode Weighted Mean Murali Shanker – Fundamentals of Business Statistics

Mean (Arithmetic Average) The mean of a set of quantitative data, X1,X2,…,Xn, is equal to the sum of the measurements divided by the number of measurements. Sample mean Population mean n = Sample Size N = Population Size Murali Shanker – Fundamentals of Business Statistics

Example Find the mean of the following 5 numbers 5, 3, 8, 5, 6 Murali Shanker – Fundamentals of Business Statistics

Mean (Arithmetic Average) (continued) Affected by extreme values (outliers) For non-symmetrical distributions, the mean is located away from the concentration of items. 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Mean = 3 Mean = 4 Murali Shanker – Fundamentals of Business Statistics

YDI 5.1 and 5.2 Kim’s test scores are 7, 98, 25, 19, and 26. Calculate Kim’s mean test score. Does the mean do a good job of capturing Kim’s test scores? The mean score for 3 students is 54, and the mean score for 4 different students is 76. What is the mean score for all 7 students? Murali Shanker – Fundamentals of Business Statistics

Median The median Md of a data set is the middle number when the measurements are arranged in ascending (or descending) order. Calculating the Median: Arrange the n measurements from the smallest to the largest. If n is odd, the median is the middle number. If n is even, the median is the mean (average) of the middle two numbers. Example: Calculate the median of 5, 3, 8, 5, 6 Murali Shanker – Fundamentals of Business Statistics

Median Not affected by extreme values In an ordered array, the median is the “middle” number. What if the values in the data set are repeated? 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 Median = 3 Median = 3 Murali Shanker – Fundamentals of Business Statistics

Mode Mode is the measurement that occurs with the greatest frequency Example: 5, 3, 8, 6, 6 The modal class in a frequency distribution with equal class intervals is the class with the largest frequency. If the frequency polygon has only a single peak, it is said to be unimodal. If the frequency polygon has two peaks, it is said to be bimodal. Murali Shanker – Fundamentals of Business Statistics

Review Example Five houses on a hill by the beach House Prices: $2,000,000 500,000 300,000 100,000 100,000 Murali Shanker – Fundamentals of Business Statistics

Summary Statistics Mean: Median: Mode: House Prices: $2,000,000 500,000 300,000 100,000 100,000 Sum 3,000,000 Mean: Median: Mode: Murali Shanker – Fundamentals of Business Statistics

Which measure of location is the “best”? Mean is generally used, unless extreme values (outliers) exist Then median is often used, since the median is not sensitive to extreme values. Example: Median home prices may be reported for a region – less sensitive to outliers Murali Shanker – Fundamentals of Business Statistics

Shape of a Distribution Describes how data is distributed Symmetric or skewed Symmetric Right-Skewed Left-Skewed Mean = Median = Mode Mode < Median < Mean Mean < Median < Mode (Longer tail extends to left) (Longer tail extends to right) Murali Shanker – Fundamentals of Business Statistics

Other Location Measures Other Measures of Location Percentiles Quartiles 1st quartile = 25th percentile 2nd quartile = 50th percentile = median 3rd quartile = 75th percentile Let x1, x2,⋯, xn be a set of n measurements arranged in increasing (or decreasing) order. The pth percentile is a number x such that p% of the measurements fall below the pth percentile. Murali Shanker – Fundamentals of Business Statistics

Quartiles Quartiles split the ranked data into 4 equal groups 25% 25% Example: Find the first quartile Sample Data in Ordered Array: 11 12 13 16 16 17 18 21 22 Murali Shanker – Fundamentals of Business Statistics

Box and Whisker Plot A Graphical display of data using 5-number summary: Minimum -- Q1 -- Median -- Q3 -- Maximum Example: 25% 25% 25% 25% Murali Shanker – Fundamentals of Business Statistics

Shape of Box and Whisker Plots The Box and central line are centered between the endpoints if data is symmetric around the median A Box and Whisker plot can be shown in either vertical or horizontal format Murali Shanker – Fundamentals of Business Statistics

Distribution Shape and Box and Whisker Plot Left-Skewed Symmetric Right-Skewed Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3 Murali Shanker – Fundamentals of Business Statistics

Box-and-Whisker Plot Example Below is a Box-and-Whisker plot for the following data: 0 2 2 2 3 3 4 5 5 10 27 This data is very right skewed, as the plot depicts Min Q1 Q2 Q3 Max 0 2 3 5 27 Murali Shanker – Fundamentals of Business Statistics

Coefficient of Variation Measures of Variation Variation Range Variance Standard Deviation Coefficient of Variation Population Standard Deviation Interquartile Range Population Variance Sample Variance Sample Standard Deviation Murali Shanker – Fundamentals of Business Statistics

Variation Measures of variation give information on the spread or variability of the data values. Same center, different variation Murali Shanker – Fundamentals of Business Statistics

Range = xmaximum – xminimum Simplest measure of variation Difference between the largest and the smallest observations: Range = xmaximum – xminimum Example: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 Murali Shanker – Fundamentals of Business Statistics

Disadvantages of the Range Considers only extreme values With a frequency distribution, the range of original data cannot be determined exactly. Murali Shanker – Fundamentals of Business Statistics

Interquartile Range Can eliminate some outlier problems by using the interquartile range Eliminate some high-and low-valued observations and calculate the range from the remaining values. Interquartile range = 3rd quartile – 1st quartile Murali Shanker – Fundamentals of Business Statistics

Interquartile Range Example: Median (Q2) X X Q1 Q3 maximum minimum 25% 25% 25% 25% 12 30 45 57 70 Murali Shanker – Fundamentals of Business Statistics

YDI 5.8 Consider three sampling designs to estimate the true population mean (the total sample size is the same for all three designs): simple random sampling stratified random sampling taking equal sample sizes from the two strata stratified random sampling taking most units from one strata, but sampling a few units from the other strata For which population will design (1) and (2) be comparably effective? For which population will design (2) be the best? For which population will design (3) be the best? Which stratum in this population should have the higher sample size? Murali Shanker – Fundamentals of Business Statistics

Variance Average of squared deviations of values from the mean Sample variance: Example: 5, 3, 8, 5, 6 Murali Shanker – Fundamentals of Business Statistics

Variance The greater the variability of the values in a data set, the greater the variance is. If there is no variability of the values — that is, if all are equal and hence all are equal to the mean — then s2 = 0. The variance s2 is expressed in units that are the square of the units of measure of the characteristic under study. Often, it is desirable to return to the original units of measure which is provided by the standard deviation. The positive square root of the variance is called the sample standard deviation and is denoted by s Murali Shanker – Fundamentals of Business Statistics

Population Variance Population variance: Population Standard Deviation Murali Shanker – Fundamentals of Business Statistics

Comparing Standard Deviations Data A Mean = 15.5 s = 3.338 11 12 13 14 15 16 17 18 19 20 21 Data B Mean = 15.5 s = .9258 11 12 13 14 15 16 17 18 19 20 21 Data C Mean = 15.5 s = 4.57 11 12 13 14 15 16 17 18 19 20 21 Murali Shanker – Fundamentals of Business Statistics

Coefficient of Variation Measures relative variation Always in percentage (%) Shows variation relative to mean Is used to compare two or more sets of data measured in different units Population Sample Murali Shanker – Fundamentals of Business Statistics

YDI Stock A: Average price last year = $50 Standard deviation = $5 Stock B: Average price last year = $100 Murali Shanker – Fundamentals of Business Statistics

Linear Transformations The data on the number of children in a neighborhood of 10 households is as follows: 2, 3, 0, 2, 1, 0, 3, 0, 1, 4. If there are two adults in each of the above households, what is the mean and standard deviation of the number of people (children + adults) living in each household? If each child gets an allowance of $3, what is the mean and standard deviation of the amount of allowance in each household in this neighborhood? Murali Shanker – Fundamentals of Business Statistics

Definitions Let X be the variable representing a set of values, and sx and be the standard deviation and mean of X, respectively. Let Y = aX + b, where a and b are constants. Then, the mean and standard deviation of Y are given by Murali Shanker – Fundamentals of Business Statistics

Standardized Data Values A standardized data value refers to the number of standard deviations a value is from the mean Standardized data values are sometimes referred to as z-scores Murali Shanker – Fundamentals of Business Statistics

Standardized Values A standardized variable Z has a mean of 0 and a standard deviation of 1. where: x = original data value x = sample mean s = sample standard deviation z = standard score (number of standard deviations x is from the mean) Murali Shanker – Fundamentals of Business Statistics

YDI During a recent week in Europe, the temperature X in Celsius was as follows: Based on this Calculate the mean and standard deviation in Fahrenheit. Calculate the standardized score. Day M T W H F S X 40 41 39 38 Murali Shanker – Fundamentals of Business Statistics