5-Number Summaries, Outliers, and Boxplots

Slides:



Advertisements
Similar presentations
Describing Distributions With Numbers
Advertisements

Lecture 17 Sec Wed, Feb 13, 2008 Boxplots.
Understanding and Comparing Distributions 30 min.
Unit 6B Measures of Variation.
Boxplots (Box and Whisker Plots). Comparing Data Using Boxplots Each section of the boxplot represents 25% of the data. The median (50%tile) is the line.
The Five Number Summary and Boxplots
Section 2.3 Measures of Center and Spread.  1) What were the main topics of 2.3?  2) What are the measures of center?  3) What is an IQR?  4) What.
Boxplots (Box and Whisker Plots). Boxplot and Modified Boxplot 25% of data in each section.
Created by: Tonya Jagoe. Measures of Central Tendency mean median mode.
Lesson 25 Finding measures of central tendency and dispersion.
 Boxplot  TI-83/84 Calculator  5 number summary  Do you have an outlier  Modified Boxplot.
Box and Whisker Plot Chapter 3.5. Box and Whisker Plot A Box-and-Whisker Plot or Box plot is a visual device that uses a 5-number summary to reveal the.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
Quantitative Data Continued
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Statistics 1: Statistical Measures
Chapter 1: Exploring Data
Describing Distributions Numerically
CHAPTER 2: Describing Distributions with Numbers
The Practice of Statistics, Fourth Edition.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Measures of Position.
CHAPTER 1 Exploring Data
DAY 3 Sections 1.2 and 1.3.
Please take out Sec HW It is worth 20 points (2 pts
Warmup What is the shape of the distribution? Will the mean be smaller or larger than the median (don’t calculate) What is the median? Calculate the.
Measure of Center And Boxplot’s.
Measure of Center And Boxplot’s.
CHAPTER 1 Exploring Data
Measuring Variation 2 Lecture 17 Sec Mon, Oct 3, 2005.
Chapter 1: Exploring Data
Chapter 1: Exploring Data
pencil, red pen, highlighter, GP notebook, graphing calculator
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Warm Up 1) What is Standard Deviation? 2) Given that the mean of a set of data is 15 and the standard deviation is 3, how many standard deviations away.
Describing a Skewed Distribution Numerically
Define the following words in your own definition
Box & Whiskers Plots AQR.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Measures of Position Section 3.3.
Describing Distributions Numerically
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Comparing Statistical Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Box-and-Whisker Plots
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
pencil, red pen, highlighter, GP notebook, graphing calculator
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

5-Number Summaries, Outliers, and Boxplots Section 2.3 Day 2 5-Number Summaries, Outliers, and Boxplots

5-Number Summary AKA 5-point summary If you include the minimum and maximum values of the data set along with the median and quartiles, you get the 5-number summary. AKA 5-point summary

5-Number Summary

Find 5-number summary.

Find 5-number summary….don’t forget key! 3I2 represents 32 mph

Find 5-number summary….n=18 3I2 represents 32 mph

5-Number Summary Graphical display of a 5-number summary is a boxplot or box-and-whiskers plot

How do we construct a boxplot?

1. Plot the 5 points

1. Plot the 5 points 2. Draw box from Q1 to Q3

1. Plot the 5 points 2. Draw box from Q1 to Q3 3. Draw vertical line at median

1. Plot the 5 points 2. Draw box from Q1 to Q3 3. Draw vertical line at median 4. Extend whiskers to min and max values

1. Plot the 5 points 2. Draw box from Q1 to Q3 3. Draw vertical line at median 4. Extend whiskers to min and max values 5. Label graph (context)

Outliers What are outliers?

Outliers Recall outliers in a set of data are any values that differ significantly from the other values.

For this data, are there any outliers?

Formula for Outliers A value is an outlier if it lies more than 1.5 times the IQR from the nearest quartile.

Formula for Outliers A value is an outlier if it lies more than 1.5 times the IQR from the nearest quartile. Thus, a value is an outlier if it is < Q1 – 1.5(IQR) or > Q3 + 1.5(IQR)

For this data, are there any outliers?

IQR = Q3 – Q1 = 42 – 30 = 12 Lower end:Q1 – 1.5(IQR) = 30 – 1.5(12) = 12 Upper end:Q3 + 1.5(IQR) = 42 + 1.5(12) = 60

Modified Boxplot Modified boxplot is like a basic boxplot except the whiskers only go as far as the largest and smallest nonoutliers (sometimes called adjacent values). Any outliers appear as individual dots or other symbols.

Modified Boxplot Modified boxplot is like a basic boxplot except the whiskers only go as far as the largest and smallest nonoutliers (sometimes called adjacent values). Any outliers appear as individual dots or other symbols.

Boxplots Useful when plotting a single quantitative variable and you want to compare shapes, centers, and spreads of two or more distributions you don’t need to see individual values, even approximately you don’t need to see more than the 5-number summary but would like outliers to be clearly indicated

Graphing Calculator You can use graphing calculator to find 5-number summary and draw boxplot. Use data from Display 2.46 on page 61

Graphing Calculator You can use graphing calculator to find 5-number summary and draw boxplot. Use data from Display 2.46 on page 61 Press “STAT” Select 1:Edit Enter the data elements in list Note: no need to reorder data first

Graphing Calculator is Your Friend! Your calculator will compute the summary statistics for a set of data. After entering data in list: Press “STAT” Arrow right to “CALC” Select “1: 1-Var Stats” 1-Var Stats L1 Enter

1-Var Stats Display 2.46 on page 61

1-Var Stats

Draw Boxplot 2nd STAT PLOT 1: Plot 1 …on Type: select modified boxplot symbol Xlist: L1 Freq: 1 Mark: Graph

Draw Boxplot If you can not see the boxplot, press “Zoom” Select 9: ZoomStat

Standard Deviation Differences from the mean, x – x, are called deviations.

∑(x – x ) = 0 Standard Deviation Differences from the mean, x – x, are called deviations. Mean is balance point of distribution so the set of deviations from the mean will always sum to zero. ∑(x – x ) = 0

Standard Deviation Formula for standard deviation, s, is:

Standard Deviation Formula for standard deviation, s, is: Dividing by n - 1 gives a slightly larger value than dividing by n. This is useful because otherwise the standard deviation of the sample would tend to be smaller than the standard deviation of the population the sample came from.

Computing Standard Deviation

Computing Standard Deviation Use 1-Var Stats. Symbol for standard deviation is sx

Summary from Frequency Table

Summary from Frequency Table Page 68

Summary from Frequency Table Enter “values” in List 1 Enter “frequency” in List 2 “STAT”, “CALC”, “1: 1-Var Stats” 1-Var Stats L1, L2 Enter

Important Note When homework says to use the formulas to compute something, you may use your calculator

Questions?