Boxplots The boxplot is an informative way of displaying the distribution of a numerical variable.. It uses the five-figure summary: minimum, lower quartile,

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
C. D. Toliver AP Statistics
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Understanding and Comparing Distributions 30 min.
CHAPTER 1 Exploring Data
Measures of Dispersion
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Numerical Representation of Data Part 3 – Measure of Position
Statistics: Use Graphs to Show Data Box Plots.
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
The Five-Number Summary And Boxplots. Chapter 3 – Section 5 ●Learning objectives  Compute the five-number summary  Draw and interpret boxplots 1 2.
Box and Whisker Plots A Modern View of the Data. History Lesson In 1977, John Tukey published an efficient method for displaying a five-number data summary.
BOX PLOTS/QUARTILES. QUARTILES: 3 points in a set of data that separate the set into 4 equal parts. Lower Quartile: Q1 (The median for the lower half.
6-9 Data Distributions Objective Create and interpret box-and-whisker plots.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
1 Further Maths Chapter 2 Summarising Numerical Data.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Chapter 2 Section 5 Notes Coach Bridges
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Using Measures of Position (rather than value) to Describe Spread? 1.
Cumulative frequency Cumulative frequency graph
Box Plots March 20, th grade. What is a box plot? Box plots are used to represent data that is measured and divided into four equal parts. These.
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
To graph and interpret Box-and-Whisker Plots 9/17/13 Do Now: Order the set of numbers from least to greatest, then find the median.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Box and Whisker Plots or Boxplots
CHAPTER 1 Exploring Data
a graphical presentation of the five-number summary of data
Describing Distributions Numerically
Get out your notes we previously took on Box and Whisker Plots.
Unit 2 Section 2.5.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
2.6: Boxplots CHS Statistics
Numerical Measures: Skewness and Location
Describing Distributions Numerically
BOX-and-WHISKER PLOT (Box Plot)
CHAPTER 1 Exploring Data
1.3 Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
Measures of Central Tendency
Constructing Box Plots
Define the following words in your own definition
Box & Whiskers Plots AQR.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Statistics and Data (Algebraic)
12.4 Box-and-Whisker Plots
Box-and-Whisker Plots
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
. . Box and Whisker Measures of Variation Measures of Variation 8 12
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Box and Whisker Plots.
Box Plots CCSS 6.7.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
BOX-and-WHISKER PLOT (Box Plot)
Presentation transcript:

Boxplots The boxplot is an informative way of displaying the distribution of a numerical variable.. It uses the five-figure summary: minimum, lower quartile, median, upper quartile and maximum.

Boxplots The median The median is the midpoint of a distribution; that is, 50% of values are lower than or equal to the median and 50% are higher than or equal to the median.

Boxplots Unordered data: The median is located by listing all the data values in numerical order The median... and then finding the point that divides the distribution into two equal parts. For example:

Boxplots Just as the median is the point that divides a distribution in half, quartiles are the points that divide a distribution into quarters. We use the symbols Q 1, Q 2, and Q 3 to represent the quartiles. Of course, Q 2 is the median.

Boxplots Q 1 Q 3 a box is used to represent the middle 50% of scores whisker minimummaximum whisker m The median is shown by a vertical line drawn within the box. Lines (called whiskers) are extended out from the lower and upper ends of the box to the smallest and largest data values of data set respectively.

Boxplots For example, these are the marks (out of 20) obtained by a group of 15 students on an assignment: The median is 10 The maximum is 18The minimum is 2 Q 3 is 15Q 1 is 5

Boxplots...and the boxplot looks like this. minimumQ1Q1 mQ3Q3 maximum

Boxplots From the boxplot we can see: shape centre spread

Boxplots So we can say that the distribution of scores on the assignment for this group of student is symmetric, with about 50% of the students scoring 10 or more out of 20. The marks on the assignment were quite variable, with the middle 50% of scores ranging from 5 to 15.

Boxplots We can also construct a more complicated boxplot which also show the outliers (if any). But first of all, we need a definition of an outlier.

Boxplots Suppose we have a data set for which the IQR range is represented by the following box: Then any data value which falls outside of these lines is called an outlier. IQR 1.5  IQR Imaginary lines are drawn 1.5 interquartile ranges below and above the box.

Boxplots IQR 1.5  IQR We join the whiskers from the box to the largest data value which is not an outlier... …and use an asterisk or similar to display the outlier. *

Boxplots The times (in minutes) taken for a sample of 18 people to assemble a new pre-packed TV cabinet are: We will demonstrate a boxplot with outliers using the following data.

Boxplots maxQ3Q3 Q1Q1 min By putting the data in order we can soon find the five-figure summary m=32 IQR = Q 3  Q 1 = 36  28 =  IQR = 1.5  8 = 12

Boxplots The imaginary lines are drawn at: 28  12 = 16 and = 48 This means we have one outlier (14),and the boxplot looks like this: