Describing Data Using Numerical Measures

Slides:



Advertisements
Similar presentations
Descriptive Statistics
Advertisements

Chapter 3 Describing Data Using Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chap 3-1 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 3 Describing Data: Numerical.
Descriptive Statistics – Central Tendency & Variability Chapter 3 (Part 2) MSIS 111 Prof. Nick Dedeke.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Slides by JOHN LOUCKS St. Edward’s University.
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Measures of Dispersion
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Describing Data: Numerical
Chapter 2 Describing Data with Numerical Measurements
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
Numerical Descriptive Techniques
Methods for Describing Sets of Data
© Copyright McGraw-Hill CHAPTER 3 Data Description.
© The McGraw-Hill Companies, Inc., Chapter 3 Data Description.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part A n Measures of.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Measures of Variation variation A set of data exhibits variation if all of the data are not the same value.
1 MATB344 Applied Statistics Chapter 2 Describing Data with Numerical Measures.
Descriptive Statistics: Numerical Methods
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Chapter 2 Describing Data.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Lecture 3 Describing Data Using Numerical Measures.
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
BUS304 – Data Charaterization1 Other Numerical Measures  Median  Mode  Range  Percentiles  Quartiles, Interquartile range.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Larson/Farber Ch 2 1 Elementary Statistics Larson Farber 2 Descriptive Statistics.
INVESTIGATION 1.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill Education.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Cumulative frequency Cumulative frequency graph
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Descriptive Statistics(Summary and Variability measures)
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Chapter 3 Section 3 Measures of variation. Measures of Variation Example 3 – 18 Suppose we wish to test two experimental brands of outdoor paint to see.
Business and Economics 6th Edition
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
St. Edward’s University
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Descriptive Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Describing Data with Numerical Measures
St. Edward’s University
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
Business and Economics 7th Edition
Presentation transcript:

Describing Data Using Numerical Measures Chapter 3 Describing Data Using Numerical Measures

Chapter 3 - Chapter Outcomes After studying the material in this chapter, you should be able to: • Compute the mean, median, and mode for a set of data and understand what these values represent. • Compute the range, variance, and standard deviation and know what these values mean. • Know how to construct a box and whiskers graph and be able to interpret it.

Chapter 3 - Chapter Outcomes (continued) After studying the material in this chapter, you should be able to: • Compute the coefficient of variation and z scores and understand how they are applied in decision-making situations. • Use numerical measures along with graphs, charts, and tables to effectively describe data.

Parameters and Statistics A parameter is a measure computed from the entire population. As long as the population does not change, the value of the parameter will not change.

Parameters and Statistics A statistic is a measure computed from a sample that has been selected from the population. The value of the statistic will depend on which sample is selected.

Mean The mean is a numerical measure of the center of a set of quantitative measures computed by dividing the sum of the values by the number of values in the data set.

N = number of data values xi = ith individual value of variable x Population Mean where:  = population mean (mu) N = number of data values xi = ith individual value of variable x

Population Mean Example 3-1 Table 3-1: Foster City Hotel Data

Population Mean Example 3-1 The population mean for the number of rooms rented is computed as follows:

Sample Mean where: = sample mean (pronounced “x-bar”) n = sample size xi = ith individual value of variable x

Sample Mean Housing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000}

Median The median is the center value that divides data that have been arranged in numerical order (i.e. an ordered array) into two halves.

Median Housing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000} Ordered array: $98,000; 100,000; 144,000; 155,000; 177,000; 204,000; 316,000 Middle Value Median = 155,000

Median Another Housing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000; 177,000; 177,000; 170,000} Ordered array: $98,000; 100,000; 144,000; 155,000; 170,000; 177,000; 177,000; 177,000; 204,000; 316,000 Middle Values Median = (170,000 + 177,000)/2 = 173,500

Skewed Data • Right-skewed data: Data are right-skewed if the mean for the data is larger than the median. • Left-skewed data: Data are left -skewed if the mean for the data is smaller than the median.

Skewed Data (Figure 3-3) a) Right-Skewed b) Left-Skewed c) Symmetric Median Mean Mean Median Mean = Median a) Right-Skewed b) Left-Skewed c) Symmetric

Mode The mode is the value in a data set that occurs most frequently. A data set may have more than one mode if two or more values tie for the highest frequency. A data set might not have a mode at all if no value occurs more than one time.

Mode Housing Prices Example {xi} = {house prices} = {$144,000; 98,000; 204,000; 177,000; 155,000; 316,000; 100,000; 177,000; 177,000; 170,000} Data array: $98,000; 100,000; 144,000; 155,000; 170,000; 177,000; 177,000; 177,000; 204,000; 316,000 Mean = 1,718,000/10 = 171,800 Median = 173,500 Mode = 177,000

The median is the 50th percentile. Percentiles The pth percentile in a data array is a value that divides the data into two parts. The lower segment contains at least p% and the upper segment contains at least (100 - p)% of the data. The median is the 50th percentile.

The median corresponds to the second quartile. Quartiles Quartiles in a data array are those values that divide the data set into four equal-sized groups. The median corresponds to the second quartile.

Measures of Variation A set of data exhibits variation if all of the data are not the same value.

R = Maximum Value - Minimum Value Range The range is a measure of variation that is computed by finding the difference between the maximum and minimum values in the data set. R = Maximum Value - Minimum Value

Interquartile Range = Third Quartile - First Quartile The interquartile range is a measure of variation that is determined by computing the difference between the first and third quartiles. Interquartile Range = Third Quartile - First Quartile

Variance & Standard Deviation The population variance is the average of the squared distances of the data values from the population mean. The standard deviation is the positive square root of the variance.

2 = population variance (sigma squared) where: = population mean N = population size 2 = population variance (sigma squared)

Population Variance (Bryce Lumber Example)

Population Standard Deviation (Bryce Lumber Example)

where: = sample mean n = sample size s2 = sample variance

Sample Standard Deviation where: = sample mean n = sample size s = sample standard deviation

Coefficient of Variation The coefficient of variation is the ratio of the standard deviation to the mean expressed as a percentage. The coefficient of variation is used to measure the relative variation in the data.

Coefficient of Variation Population Coefficient of Variation Sample Coefficient of Variation

The Empirical Rule If the data distribution is bell-shaped, then the interval: contains approximately 68% of the values in the population or the sample contains approximately 95% of the values in the population or the sample contains virtually all of the data values in the population or the sample

The Empirical Rule (Figure 3-11) 95% 68% X

Tchebysheff’s Theorem Regardless of how the data are distributed, at least (1 - 1/k2) of the values will fall within k standard deviations of the mean. For example: At least (1 - 1/12) = 0% of the values will fall within k=1 standard deviation of the mean At least (1 - 1/22) = 3/4 = 75% of the values will fall within k=2 standard deviation of the mean At least (1 - 1/32) = 8/9 = 89% of the values will fall within k=3 standard deviation of the mean

Standardized Data Values A standardized data value refers to the number of standard deviations a value is from the mean. The standardized data values are sometimes referred to as z-scores.

Standardized Data Values STANDARDIZED POPULATION DATA where: x = original data value  = population mean  = population standard deviation z = standard score (number of standard deviations x is from )

Standardized Data Values STANDARDIZED SAMPLE DATA where: x = original data value = sample mean s = sample standard deviation z = standard score

Key Terms • Coefficient of Variation • Data Array • Empirical Rule • Interquartile Range • Left-Skewed Data • Mean • Median • Parameter • Percentiles • Quartiles • Range • Right-Skewed Data • Skewed Data • Standard Deviation • Standardized Data Values • Statistic

Key Terms (continued) • Symmetric Data • Tchebysheff’s Theorem • Variance • Variation