Lecture 16 Sec. 5.3.1 – 5.3.3 Mon, Oct 2, 2006 Measuring Variation – The Five-Number Summary.

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

C. D. Toliver AP Statistics
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Additional Measures of Center and Spread
Probabilistic & Statistical Techniques
3.3 Measures of Position Measures of location in comparison to the mean. - standard scores - percentiles - deciles - quartiles.
Measures of Position - Quartiles
Understanding and Comparing Distributions 30 min.
Measures of Dispersion
Measures of Dispersion or Measures of Variability
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
BOX PLOTS/QUARTILES. QUARTILES: 3 points in a set of data that separate the set into 4 equal parts. Lower Quartile: Q1 (The median for the lower half.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
Measures of Position and Outliers. z-score (standard score) = number of standard deviations that a given value is above or below the mean (Round z to.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
Slide 1 Statistics Workshop Tutorial 6 Measures of Relative Standing Exploratory Data Analysis.
Chapter 3 Numerically Summarizing Data 3.4 Measures of Location.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Interquartile Range Lecture 21 Sec – Mon, Feb 23, 2004.
Percentiles For any whole number P (between 1 and 99), the Pth percentile of a distribution is a value such that P% of the data fall at or below it. The.
Foundations of Math I: Unit 3 - Statistics
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
Compare the following heights in inches: BoysGirls
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Using Measures of Position (rather than value) to Describe Spread? 1.
Lecture 16 Sec – Wed, Feb 15, 2006 Measuring Variation 1.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Measures of Position Section 3-3.
Box Plots March 20, th grade. What is a box plot? Box plots are used to represent data that is measured and divided into four equal parts. These.
Foundations of Math I: Unit 3 - Statistics Arithmetic average Median: Middle of the data listed in ascending order (use if there is an outlier) Mode: Most.
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
COMPUTATIONAL FORMULAS AND IQR’S. Compare the following heights in inches: BoysGirls
Unit 3: Averages and Variations Part 3 Statistics Mr. Evans.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Lecture 16 Sec – Tue, Feb 12, 2008 The Five-Number Summary.
5 Number Summary. Definition (Five-Number Summary) The five-number summary of a set of numbers consists of the five quantities – Minimum – 1st quartile.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 1 Lesson 4 Quartiles, Percentiles, and Box Plots.
StatisticsStatistics Unit 5. Example 2 We reviewed the three Measures of Central Tendency: Mean, Median, and Mode. We also looked at one Measure of Dispersion.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Chapter 4 – Statistics II
Measuring Variation Lecture 16 Sec – Mon, Oct 4, 2004.
Midrange (rarely used)
Chapter 3 Describing Data Using Numerical Measures
Measures of Position.
Box and Whisker Plots Algebra 2.
Numerical Measures: Skewness and Location
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Quartile Measures DCOVA
Measuring Variation – The Five-Number Summary
Lesson 2 Range and Quartiles.
Basic Practice of Statistics - 3rd Edition
Statistics and Data (Algebraic)
Basic Practice of Statistics - 3rd Edition
Numerical Descriptive Measures
The Five-Number Summary
Box and Whisker Plots.
5 Number Summaries.
Ch. 12 Vocabulary 15.) quartile 16.) Interquartile range
Measures of Relative Standing
Box Plot Lesson 11-4.
MATH 2311 Section 1.4.
Presentation transcript:

Lecture 16 Sec – Mon, Oct 2, 2006 Measuring Variation – The Five-Number Summary

The Five-Number Summary A five-number summary of a sample or population consists of five numbers that divide the sample or population into four equal parts. A five-number summary of a sample or population consists of five numbers that divide the sample or population into four equal parts. These numbers are called the quartiles. These numbers are called the quartiles. 0 th Quartile = minimum. 0 th Quartile = minimum. 1 st Quartile = Q1. 1 st Quartile = Q1. 2 nd Quartile = median. 2 nd Quartile = median. 3 rd Quartile = Q3. 3 rd Quartile = Q3. 4 th Quartile = maximum. 4 th Quartile = maximum.

Example If the distribution were uniform from 0 to 10, what would be the five-number summary? If the distribution were uniform from 0 to 10, what would be the five-number summary?

Example If the distribution were uniform from 0 to 10, what would be the five-number summary? If the distribution were uniform from 0 to 10, what would be the five-number summary? %

Example If the distribution were uniform from 0 to 10, what would be the five-number summary? If the distribution were uniform from 0 to 10, what would be the five-number summary? %

Example The five-number summary is The five-number summary is Minimum = 0, Minimum = 0, Q1 = 2.5, Q1 = 2.5, Median = 5, Median = 5, Q3 = 7.5, Q3 = 7.5, Maximum = 10. Maximum = 10. If the distribution is not uniform, or if we are dealing with a list of numbers, the answer will not be so clear. If the distribution is not uniform, or if we are dealing with a list of numbers, the answer will not be so clear.

Percentiles – Textbook’s Definition The p th percentile – A value that separates the lower p% of a sample or population from the upper (100 – p)%. The p th percentile – A value that separates the lower p% of a sample or population from the upper (100 – p)%. p% or more of the values fall at or below the p th percentile, and p% or more of the values fall at or below the p th percentile, and (100 – p)% or more of the values fall at or above the p th percentile. (100 – p)% or more of the values fall at or above the p th percentile.

Percentiles – Textbook’s Definition Find the 25 th percentile of the following sample: Find the 25 th percentile of the following sample: 22, 28, 31, 40, 42, 56, 78, 88, 97.

Percentiles – Excel’s Formula To find position, or rank, of the p th percentile, compute the value To find position, or rank, of the p th percentile, compute the value

Excel’s Percentile Formula This gives the position (r = rank) of the p th percentile. This gives the position (r = rank) of the p th percentile. Round r to the nearest whole number. Round r to the nearest whole number. The number in that position is the p th percentile. The number in that position is the p th percentile.

Excel’s Percentile Formula Special case: If r is a “half-integer,” for example 10.5, then take the average of the numbers in positions r and r + 1, just as we did for the median when n was even. Special case: If r is a “half-integer,” for example 10.5, then take the average of the numbers in positions r and r + 1, just as we did for the median when n was even. Microsoft Excel will interpolate whenever r is not a whole number. Microsoft Excel will interpolate whenever r is not a whole number. Therefore, by rounding, our answers may differ from Excel. Therefore, by rounding, our answers may differ from Excel.

Example Find the 25 th percentile of Find the 25 th percentile of 22, 28, 31, 40, 42, 56, 78, 88, 97. p = 25 and n = 9. p = 25 and n = 9. Compute r = 1 + (25/100)(9 – 1) = 3. Compute r = 1 + (25/100)(9 – 1) = 3. The 25 th percentile is the 3 rd number, i.e., 31. The 25 th percentile is the 3 rd number, i.e., 31.

The Principle Excel’s formula is based on the gaps between the numbers, not the numbers themselves. Excel’s formula is based on the gaps between the numbers, not the numbers themselves

The Principle Excel’s formula is based on the gaps between the numbers, not the numbers themselves. Excel’s formula is based on the gaps between the numbers, not the numbers themselves

The Principle Excel’s formula is based on the gaps between the numbers, not the numbers themselves. Excel’s formula is based on the gaps between the numbers, not the numbers themselves

The Principle Excel’s formula is based on the gaps between the numbers, not the numbers themselves. Excel’s formula is based on the gaps between the numbers, not the numbers themselves MinQ1MedQ3Max

Example Find the quartiles for the sample Find the quartiles for the sample 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, 240.

Excel’s Percentile Formula The formula may be reversed to find the percentage of the percentile of a number, given its position, or rank, in the sample. The formula may be reversed to find the percentage of the percentile of a number, given its position, or rank, in the sample. The formula is The formula is

Example In the sample In the sample 22, 28, 31, 40, 42, 56, 78, 88, 97 what percentile percentage is associated with 40? n = 9 and r = 4. n = 9 and r = 4. Compute p = 100(4 – 1)/(9 – 1) = Compute p = 100(4 – 1)/(9 – 1) = Therefore, 40 is the 37.5 th percentile. Therefore, 40 is the 37.5 th percentile.

Example For the sample For the sample 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, 240, what is the percentile percentage of 45?

The Interquartile Range The interquartile range (IQR) is the difference between Q3 and Q1. The interquartile range (IQR) is the difference between Q3 and Q1. The IQR is a commonly used measure of spread, or variability. The IQR is a commonly used measure of spread, or variability. Like the median, it is not affected by extreme outliers. Like the median, it is not affected by extreme outliers.

Example The IQR of The IQR of 22, 28, 31, 40, 42, 56, 78, 88, 97 is IQR = Q3 – Q1 = 78 – 31 = 47. Find the IQR for the sample Find the IQR for the sample 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, , 20, 30, 45, 60, 80, 100, 140, 175, 200, 240.

TI-83 – Five-Number Summary Follow the procedure used to find the mean and the median. Follow the procedure used to find the mean and the median. Scroll down the display to find the minimum, Q1, the median, Q3, and the maximum. Scroll down the display to find the minimum, Q1, the median, Q3, and the maximum.

Example Use the TI-83 to find Q1 and Q3 for the sample Use the TI-83 to find Q1 and Q3 for the sample 5, 20, 30, 45, 60, 80, 100, 140, 175, 200, , 20, 30, 45, 60, 80, 100, 140, 175, 200, 240.

Homework (2 problems – 10 points each) For the % on-time-arrival data (p. 252), use the formula, with rounding, to find For the % on-time-arrival data (p. 252), use the formula, with rounding, to find The 10 th percentile. The 10 th percentile. The 43 rd percentile. The 43 rd percentile. The 69 th percentile. The 69 th percentile. The 95 th percentile. The 95 th percentile. Use the formula to find the percentile percentages, with rounding, of the following % on-time arrivals. Use the formula to find the percentile percentages, with rounding, of the following % on-time arrivals. 76.0, 81.1, 85.8, , 81.1, 85.8, 90.3.