Measuring Variation – The Five-Number Summary

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Additional Measures of Center and Spread
Probabilistic & Statistical Techniques
Measures of Position - Quartiles
Measures of Dispersion
Measures of Dispersion or Measures of Variability
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
BOX PLOTS/QUARTILES. QUARTILES: 3 points in a set of data that separate the set into 4 equal parts. Lower Quartile: Q1 (The median for the lower half.
M08-Numerical Summaries 2 1  Department of ISM, University of Alabama, Lesson Objectives  Learn what percentiles are and how to calculate quartiles.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Measures of Position and Outliers. z-score (standard score) = number of standard deviations that a given value is above or below the mean (Round z to.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
Review Measures of central tendency
Chapter 3 Numerically Summarizing Data 3.4 Measures of Location.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
Lecture 16 Sec – Mon, Oct 2, 2006 Measuring Variation – The Five-Number Summary.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Interquartile Range Lecture 21 Sec – Mon, Feb 23, 2004.
Chapter 2 Section 5 Notes Coach Bridges
Foundations of Math I: Unit 3 - Statistics
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Unit 3: Averages and Variations Week 6 Ms. Sanchez.
Compare the following heights in inches: BoysGirls
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Using Measures of Position (rather than value) to Describe Spread? 1.
Lecture 16 Sec – Wed, Feb 15, 2006 Measuring Variation 1.
Box Plots March 20, th grade. What is a box plot? Box plots are used to represent data that is measured and divided into four equal parts. These.
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Lecture 16 Sec – Tue, Feb 12, 2008 The Five-Number Summary.
5 Number Summary. Definition (Five-Number Summary) The five-number summary of a set of numbers consists of the five quantities – Minimum – 1st quartile.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 1 Lesson 4 Quartiles, Percentiles, and Box Plots.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Example - Fax Here are the number of pages faxed by each fax sent from our Math and Stats department since April 24 th, in the order that they occurred.
Chapter 3 Describing Data Using Numerical Measures
Measuring Variation Lecture 16 Sec – Mon, Oct 4, 2004.
Warm Up Convert to degrees a) 3
Averages and Variation
Midrange (rarely used)
Chapter 3 Describing Data Using Numerical Measures
Measures of Position.
Box and Whisker Plots Algebra 2.
Numerical Measures: Skewness and Location
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Quartile Measures DCOVA
Basic Practice of Statistics - 3rd Edition
Measures of Central Tendency
Statistics Fractiles
Statistics and Data (Algebraic)
Basic Practice of Statistics - 3rd Edition
Numerical Descriptive Measures
1-4 Quartiles, Percentiles and Box Plots
The Five-Number Summary
Box and Whisker Plots.
Basic Practice of Statistics - 3rd Edition
Box and Whisker Plots and the 5 number summary
5 Number Summaries.
Box and Whisker Plots and the 5 number summary
Ch. 12 Vocabulary 15.) quartile 16.) Interquartile range
Box Plot Lesson 11-4.
MATH 2311 Section 1.4.
Presentation transcript:

Measuring Variation – The Five-Number Summary Lecture 18 Sec. 5.3.1 – 5.3.3 Mon, Feb 19, 2007

The Five-Number Summary A five-number summary of a sample or population consists of five numbers that divide the sample or population into four equal parts. These numbers are called the quartiles. 0th Quartile = minimum. 1st Quartile = Q1. 2nd Quartile = median. 3rd Quartile = Q3. 4th Quartile = maximum.

Example If the distribution were uniform from 0 to 10, what would be the five-number summary? 1 5 6 7 8 9 2 3 4 10

Example If the distribution were uniform from 0 to 10, what would be the five-number summary? 1 5 6 7 8 9 2 3 4 10 50% 50% Median

Example If the distribution were uniform from 0 to 10, what would be the five-number summary? 1 5 6 7 8 9 2 3 4 10 25% 25% 25% 25% Q1 Median Q3

Example Where would the median and quartiles be in the following non-uniform distribution? 1 2 3 4 5 6 7

Example The five-number summary is Minimum = 0, Q1 = 2.5, Median = 5, Q3 = 7.5, Maximum = 10. If the distribution is not uniform, or if we are dealing with a list of numbers, the answer will not be so clear.

Quartiles – TI-38’s Method To find the quartiles, first find the median (2nd quartile). Then the 1st quartile is the “median” of all the numbers that are listed before the median. The 3rd quartile is the “median” of all the numbers that are listed after the median.

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32 Median

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32 Q1 Median Q3

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32 Min Q1 Median Q3 Max

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32, 33

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32, 33 Median 19.5

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32, 33 Q1 12.5 Median 19.5 Q3 27.5

Example Find the quartiles of the sample 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32, 33 Min Q1 12.5 Median 19.5 Q3 27.5 Max

Percentiles – Textbook’s Method The pth percentile – A value that separates the lower p% of a sample or population from the upper (100 – p)%. p% or more of the values fall at or below the pth percentile, and (100 – p)% or more of the values fall at or above the pth percentile.

Percentiles – Textbook’s Method Find the 25th percentile of the following sample: 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32.

Percentiles – Textbook’s Method Value % at or below % at or above 5 8 10 15 17 19 20 24 25 30 32

Percentiles – Textbook’s Method Value % at or below % at or above 5 9% 8 18% 10 27% 15 36% 17 45% 19 55% 20 64% 24 73% 25 82% 30 91% 32 100%

Percentiles – Textbook’s Method Value % at or below % at or above 5 9% 100% 8 18% 91% 10 27% 82% 15 36% 73% 17 45% 64% 19 55% 20 24 25 30 32

Percentiles – Textbook’s Method Value % at or below % at or above 5 9% 100% 8 18% 91% 10 27% 82% 15 36% 73% 17 45% 64% 19 55% 20 24 25 30 32

Percentiles – Textbook’s Method Value % at or below % at or above 5 9% 100% 8 18% 91% 10 27% 82% 15 36% 73% 17 45% 64% 19 55% 20 24 25 30 32 Min Q1 Median Q3 Max

Percentiles – Excel’s Formula To find position, or rank, of the pth percentile, compute the value

Excel’s Percentile Formula This gives the position (r = rank) of the pth percentile. Round r to the nearest whole number. The number in that position is the pth percentile.

Excel’s Percentile Formula Special case: If r is a “half-integer,” for example 10.5, then take the average of the numbers in positions r and r + 1, just as we did for the median when n was even. Microsoft Excel will interpolate whenever r is not a whole number. Therefore, by rounding, our answers may differ from Excel.

Example Use Excel’s formula to find a 5-number summary of 5, 8, 10, 15, 17, 19, 20, 24, 25, 30, 32. FiveNumberSummary.xls

The Principle Excel’s formula is based on the gaps between the numbers, not the numbers themselves.

Example Find the quartiles for the sample 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, 240.

Excel’s Percentile Formula The formula may be reversed to find the percentile rank of a number, given its position, or rank, in the sample. The formula is

Example In the sample 22, 28, 31, 40, 42, 56, 78, 88, 97 what percentile rank is associated with 40?

Example For the sample what is the percentile rank of 45? 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, 240, what is the percentile rank of 45?

The Interquartile Range The interquartile range (IQR) is the difference between Q3 and Q1. The IQR is a commonly used measure of spread, or variability. Like the median, it is not affected by extreme outliers.

Example The IQR of 22, 28, 31, 40, 42, 56, 78, 88, 97 is IQR = Q3 – Q1 = 78 – 31 = 47. Find the IQR for the sample 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, 240.

Two Homework Problems For the % on-time-arrival data (p. 252), use the formula, with rounding, to find The 10th percentile. The 43rd percentile. The 69th percentile. The 95th percentile. Use the formula to find the percentile percentages, with rounding, of the following % on-time arrivals. 76.0, 81.1, 85.8, 90.3.