Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures.

Slides:



Advertisements
Similar presentations
Measures of Location and Dispersion
Advertisements

Describing Data: Measures of Central Tendency
Lecture 2 Part a: Numerical Measures
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Calculating & Reporting Healthcare Statistics
Chapters 3 & 4 Alan D. Smith Descriptive Statistics -
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson2-1 Lesson 2: Descriptive Statistics.
Slides by JOHN LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Describing Data: Numerical Measures
Central Tendency and Variability
Describing Data: Numerical Measures
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
3 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
Describing Data: Numerical Measures
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
STATISTIC & INFORMATION THEORY (CSNB134) MODULE 2 NUMERICAL DATA REPRESENTATION.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3 – Descriptive Statistics
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
1 1 Slide © 2003 Thomson/South-Western. 2 2 Slide © 2003 Thomson/South-Western Chapter 3 Descriptive Statistics: Numerical Methods Part A n Measures of.
Describing Data: Numerical Measures Chapter 03 McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
4 - 1 Copyright © 2004 by The McGraw-Hill Companies, Inc. All rights reserved.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Describing Data: Numerical Measures. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics,
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
INVESTIGATION 1.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Business Statistics Spring 2005 Summarizing and Describing Numerical Data.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 3 Descriptive Statistics: Numerical Methods.
Describing Data: Numerical Measures Chapter 03 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
Day 2. GOALS 1.Calculate the arithmetic mean, weighted mean, median, mode, and geometric mean. 2.Explain the characteristics, uses, advantages, and disadvantages.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Numerical Measures Chapter 3.
Describing Data: Numerical Measures Chapter 3 McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved.
Describing Data: Numerical Measures
Statistics for Business
Chapter 3 Describing Data Basic Statistics for Business and Economics
Chapter 3 Created by Bethany Stubbe and Stephan Kogitz.
Descriptive Statistics
Characteristics of the Mean
Summary descriptive statistics: means and standard deviations:
Describing Data: Numerical Measures
Describing Data with Numerical Measures
Summary descriptive statistics: means and standard deviations:
LESSON 3: CENTRAL TENDENCY
Describing Data: Numerical Measures
Describing Data: Numerical Measures
Describing Data: Numerical Measures
St. Edward’s University
Describing Data: Numerical Measures
Describing Data: Numerical Measures
Presentation transcript:

Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved. Describing Data: Numerical Measures

In this Chapter (3), we learn to describe data using 2 numerical techniques: 1. Measures of Location 2. Measures of Dispersion

3- 3 Measure of location Often, we want to know in a set of collected data: What is a representative data or the typical value ? OR What is the center/average of the distribution ? Egs. US Family income, Price of a house in LA, Rainfall in Seattle, Batting scores. Read the inset ‘Statistics in action’ on page 58.

3 types of ‘averages’. Mean Median Mode

where  µ “mu” is the population mean  N is the total number of observations. (Note the ‘Capital’ N)  X is a particular raw data value.   “sigma” indicates the operation of adding. Population Mean Population Mean is the sum of all the population values divided by the total number of population values: 3- 5 Mean=Average=Arithmetic Mean (synonyms)

Two terms you should know: i) Parameter - is a measurable characteristic of a population. Hence, Population Mean μ is a Parameter. Ii) Statistic - is a measurable characteristic of a sample. Hence, Sample Mean x is a Statistic.

Example 1 Find the mean mileage for the cars. Parameter A Parameter is a measurable characteristic of a population. The Kiers family owns four cars. The following is the current mileage on each of the four cars. 56,000 23,000 42,000 73,

where n is the total number of values in the sample. ( Note the ‘small’ n ) Sample Mean is the sum of all the sample values divided by the number of sample values: 3- 8 “X bar” (not μ!) Statistic A Statistic is a measurable characteristic of a sample.

A sample of five executives received the following bonus last year ($000): 14.0, 15.0, 17.0, 16.0, The sample mean here is a Statistic (ie, not a Parameter).

Every set of interval-level and ratio-level data has a mean. A set of data has a unique mean. Sum of deviations of each value from the mean is zero*. All values included in computing the mean (a good thing).** The mean is affected by unusually large or small outlier data values (a shortcoming). Properties of the Mean * see next slide ** not true of Median or Mode

Example 3 mean Consider the set of values: 3, 8, and 4. The mean is 5. Illustrating the fifth property. ie, sum of deviations is zero

For an even set of values, the median will be the arithmetic average of the two middle numbers and is found around (n+1)/2 of data. Median The Median is the value at the middle location after all the data have been ordered from the smallest to the largest For an odd set of values, the median will be the middle number and is found at (n+1)/2 of data.

The ages for a sample of five college students are: 21, 25, 19, 20, 22. Arrange the data in ascending order The median is at location (5+1)/2 19, 20, 21, 22, Question: Calculate the median if the age of the 5 th student is 60 years (and not 25).

 Arrange the data in ascending order : 73, 75, 76, 80  The median is around (4+1)/2 = 2.5 th location  Take the mean of 2 nd & 3 rd observation  Thus the median is 75.5 The heights of four basketball players, in inches, are: 76, 73, 80, 75.

 Unique to each data set  Not affected by extremely large or small values (avoids influence of outlier values)  Can be computed for ordinal, interval and ratio level data Properties of the Median Eg. A good measure for Housing Prices

Examples: Examples: The exam scores for ten students are: 81, 93, 84, 75, 68, 87, 81, 75, 81, 87. Because the score of 81 occurs the most often, it is the mode. Mode can help you with making decisions! Prof. Beatle gives out more “B”s than any other grade. If you have excess production capacity, you may make more of the product that sells most. Mode The Mode is another measure of location and represents the value of the observation that appears most frequently

Properties of the Mode Can be used for all levels of data (nominal, ordinal, interval and ratio). Not affected by extreme values Problems: If every data value is unique, there is no mode! You can have equal number of different values leading to multiple modes eg. Bimodal, trimodal, etc.

Practice Problem #62 Page State how you would do (a) & (b)? What is the answer to (c)

Dispersion - spread or variability in the data If you are told the river ahead has an average depth of 4 feet, would you begin crossing it? Mean by itself is not reliable if dispersion is high Useful in comparing two sets of data with same mean value

Measures of dispersion  Range  Variance  Standard deviation

The following represents the current year’s Return on Equity of the 25 companies in an investor’s portfolio. Highest value: 22.1Lowest value: -8.1 Range = Highest value – lowest value = 22.1-(-8.1) = Range Uses just 2 values! Not useful if Range is wide

Variance: - average of the squared deviations from the mean. - larger deviations are given higher weight when squared. Standard deviation Standard deviation: - square root of the variance - brings the variance to the same unit as the data 3- 22

Population Variance Population Variance formula: X is the value of an observation in the population μ is the arithmetic mean of the population N is the number of observations in the population 3- 23

Example – Page 78 Fill me in σ is called the Population Standard Deviation (has the same unit of measure as the original data)

Sample variance (s 2 ) Sample standard deviation (s) (n-1) & NOT n Guaranteed question in any stat test! Watch out if data is a Population ( σ ) or Sample (s).

The hourly wages earned by a sample of five students are: $7, $5, $11, $8, $6. Find the sample variance and standard deviation Practice Time!

Chebyshev’s theorem: For any set of observations, the minimum proportion of the values that lie within k standard deviations of the mean is at least: where k is any constant greater than Say, you conduct a study of heights of all students in class. You compute mean and standard deviation. Now you decided to compute, how many students are within mean ± 1 s.d. how many students are within mean ± 2 s.d. how many students are within mean ± 3 s.d., … etc.

Practice! Page 84 Problems: 49, 50

Empirical Rule Empirical Rule : For any symmetrical, bell-shaped distribution:  About 68% of the observations will lie within 1 s.d. the mean  About 95% of the observations will lie within 2 s.d. of the mean  Virtually all the observations will be within 3 s.d. of the mean 3- 29

See Page 496: Z Column – Values 1, 2, 3 Empirical Rule