 Roughly describes where the center of the data is in the set.  2 Important measures of center: a) Mean b) Median.

Slides:



Advertisements
Similar presentations
CHAPTER 1 Exploring Data
Advertisements

Chapter 3 Describing Data Using Numerical Measures
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Intro to Descriptive Statistics
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
CHAPTER 2: Describing Distributions with Numbers
Measures of Central Tendency
Describing Data: Numerical
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
Describing Distributions With Numbers Section 1.3 (mean, median, range, quartiles, IQR) Target Goal: I can analyze data using shape, center and spread.
1.3: Describing Quantitative Data with Numbers
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Statistics: Concepts and Controversies
Chapter 12, Part 2 STA 291 Summer I Mean and Standard Deviation The five-number summary is not the most common way to describe a distribution numerically.
Histograms. Histogram: Class width Range = Largest – Smallest
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Lecture 3 Describing Data Using Numerical Measures.
BUS304 – Data Charaterization1 Other Numerical Measures  Median  Mode  Range  Percentiles  Quartiles, Interquartile range.
Mean, Median, and Mode 3.1 Measures of Central Tendency.
Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.
Chapter 3 Looking at Data: Distributions Chapter Three
Describing Quantitative Data with Numbers Section 1.3.
Means & Medians Chapter 4. Parameter - Fixed value about a population Typical unknown.
1.2 Describing Distributions with Numbers Is the mean a good measure of center? Ex. Roger Maris’s yearly homerun production:
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
1 Chapter 4 Numerical Methods for Describing Data.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
# credit card returns: (on the calculator…)
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers.
Copyright © Cengage Learning. All rights reserved. Averages and Variation 3.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
CHAPTER 1 Exploring Data
Measures of Central Tendency
Business and Economics 6th Edition
3 Averages and Variation
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 2: Describing Distributions with Numbers
Chapter 3 Describing Data Using Numerical Measures
Measures of Central Tendency: Mode, Median, and Mean
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Measures of Central Tendency
SWBAT: Measure center with the mean and median and spread with interquartile range. Do Now:
CHAPTER 1 Exploring Data
1.3 Describing Quantitative Data with Numbers
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
3 Averages and Variation
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

 Roughly describes where the center of the data is in the set.  2 Important measures of center: a) Mean b) Median

SamplePopulation

23, 25, 26, 29, 39, 42, 50 Show Your Work – Including the Formula that you used!

 The 50 states plus the District of Columbia have a total of 3137 counties. There are a total of 248,709,873 people in each of these counties. Find the average population per county.

Sample 1Sample 2Sample 3 20,09528,89516, ,97810, ,38416,17473,478 13,931959,27514,798 24,96030,79713,859 Do you suppose these are close to the values we’d get if we could use the population?

 We will study this further later on to see how to be able to use samples to predict the populations better.

The mean tells us how large each observation in the data set would be if the total were split equally among all the observations.

 A group of elementary school children was asked how many pets they have. Here are there responses. Find the mean and explain what it means We can look at it this way: If every child in the group had the same number of pets, each would have 5 pets.

 In the last unit, we introduced the median as an informal measure of center that described the “midpoint” of a distribution.  Now, it is time to offer an official “rule” for calculating the median.

 The median is the value in the middle!  50% of the data is above and below this value.  Steps: 1. Put numbers in order. 2. If the # of numbers is odd – median is the middle number. 3. If the # of numbers is even – median is the average of the two #’s in the middle.

Median = 12

“In the sample of New York workers, about half of the people reported traveling less than 22.5 minutes to work, and about half reported traveling more.”

Stemplot:

 The median travel time is 20 minutes.  The mean travel time is higher, 22.5 minutes.  The mean is pulled toward the right tail of this right-skewed distribution.  The median, unlike the mean, is resistant.  If the longest travel time were 600 minutes rather than 60 minutes, the mean would get higher, but the median would not change at all! Mean = 22.5 Median = 20

 The mean and median of a roughly symmetric distribution are close together.  If the distribution is exactly symmetric, the mean and median are exactly the same.  In a skewed distribution, the mean is usually farther out in the long tail than is the median.

 The mean is greatly affected by outliers – it is very sensitive to them – which means it is pulled towards the outlier.  The median is insensitive to outliers. It is often used more because it is more stable.

 Salaries for MLB players or NFL players  Scores on a test when there’s one that has NOT been made up yet.  Home prices  Personal Incomes  College tuitions

 Based only on the stemplot, would you expect the mean travel time to be less than, about the same as, or larger than the median? Why? Since the distribution is skewed to the right, we would expect the mean to be larger than the median.

 Use your calculator to find the mean and median travel time. Was your answer to Question 1 correct? The mean is minutes, which is bigger than the median of 22.5 minutes.

 Interpret your result from Question 2 in context without using the words “mean” or “average.” If we divided the travel time up evenly among all 20 people, each would have a minute travel time.

 Would the mean or median be a more appropriate summary of the center of this distribution of drive times? Justify your answer. Since the distribution is skewed, the median would be a better measure of the center.

 M F F F M F M F M M F F F M For a Population use “P”

 A disadvantage of the mean is that it can be affected by extremely high or low values.  One way to make the mean more resistant to exceptional values and still sensitive to specific data values is to do a trimmed mean.

 Order the data – delete a selected number of values from each end of the list then average the remaining values.  Trimming Percentage: The percent of values trimmed from the list.

Example a) Compute the mean for the entire sample. b) Compute a 5% trimmed mean.

Example c) Compute the median for the entire sample. d) Compute a 5% trimmed median. The median is still e) Is the trimmed mean or the original mean closer to the median? Trimmed Mean