Drawing and comparing Box and Whisker diagrams (Box plots)

Slides:



Advertisements
Similar presentations
“Teach A Level Maths” Statistics 1
Advertisements

Cumulative Frequency and Box Plots. Learning Objectives  To be able to draw a cumulative frequency curve and use it to estimate the median and interquartile.
Interpreting data … Drawing and comparing Box and Whisker diagrams (Box plots)
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
Box Plot A plot showing the minimum, maximum, first quartile, median, and third quartile of a data set; the middle 50% of the data is indicated by a.
Cumulative Frequency Objectives: B Grade Construct and interpret a cumulative frequency diagram Use a cumulative frequency diagram to estimate the median.
Introduction to Data Analysis. Defining Terms  Population- the entire group of people or objects that you want information about  Sample- specific part.
Box and Whisker Plots A diagram that summarizes data by dividing it into four parts. It compares two sets of data.
Cumulative Frequency Diagrams & Box Plots
Drawing and comparing Box and Whisker diagrams (Box plots)
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Continued… Obj: draw Box-and-whisker plots representing a set of data Do now: use your calculator to find the mean for 85, 18, 87, 100, 27, 34, 93, 52,
1 Further Maths Chapter 2 Summarising Numerical Data.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Box and Whisker Plots and the 5 number summary Mr. J.D. Miles Turner Middle School Atlanta Georgia
Chapter 2 Section 5 Notes Coach Bridges
Topic: Data Handling LO: To be able to understand and produce box plots. STARTER PREPARE FOR LEARNING AGREE LEARNING OBJECTIVES Calculate the median of.
Box and Whisker Plots. Introduction: Five-number Summary Minimum Value (smallest number) Lower Quartile (LQ) Median (middle number) Upper Quartile (UP)
Section 6.7 Box-and-Whisker Plots Objective: Students will be able to draw, read, and interpret a box-and- whisker plot.
Texas Algebra I Unit 3: Probability/Statistics Lesson 28: Box and Whiskers plots.
Vocabulary to know: *statistics *data *outlier *mean *median *mode * range.
Box Plots March 20, th grade. What is a box plot? Box plots are used to represent data that is measured and divided into four equal parts. These.
2-6 Box-and-Whisker Plots Indicator  D1 Read, create, and interpret box-and whisker plots Page
Statistics Year 9. Note 1: Statistical Displays.
5-Number Summary A 5-Number Summary is composed of the minimum, the lower quartile (Q1), the median (Q2), the upper quartile (Q3), and the maximum. These.
5,8,12,15,15,18,20,20,20,30,35,40, Drawing a Dot plot.
Cumulative Frequency and Box Plots
Box and Whisker Plots or Boxplots
Bell Ringer What does the word “average” mean in math?
Get out your notes we previously took on Box and Whisker Plots.
Box and Whisker Plots and the 5 number summary
Box and Whisker Plots and the 5 number summary
Cummulative Frequency Plots
Box Plots EQ: How do you analyze box plots?
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Displaying & Analyzing Data
DS5 CEC Interpreting Sets of Data
Cumulative Frequency Diagrams & Box Plots
Cumulative Frequency Diagrams & Box Plots
Box and Whisker Plots Algebra 2.
Cumulative Frequency and Box Plots
Representing Quantitative Data
Measure of Center And Boxplot’s.
Unit 7. Day 10..
What can you tell about these films from this box plot?
Describing Distributions Numerically
Measure of Center And Boxplot’s.
Representation of Data
Common Core Math I Unit 2: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
A CLASS OF 15 PUPILS DID A TEST WITH A POSSIBLE
Measures of Central Tendency
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Common Core Math I Unit 1: One-Variable Statistics Boxplots, Interquartile Range, and Outliers; Choosing Appropriate Measures.
Define the following words in your own definition
Organizing, Summarizing, &Describing Data UNIT SELF-TEST QUESTIONS
“Teach A Level Maths” Statistics 1
Box and Whisker Plots A diagram that summarizes data by dividing it into four parts. It compares two sets of data. Dittamo & Lewis 2014.
“Teach A Level Maths” Statistics 1
Box and Whisker Plots A.K.A Box Plots.
Cumulative Frequency Objectives:
Box and Whisker Plots.
Integrated Math 2 - Santowski
Box and Whisker Plots and the 5 number summary
5 Number Summaries.
Quiz.
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Box and Whisker Plots and the 5 number summary
Find the Mean of the following numbers.
Cumulative Frequency and Box Plots
Presentation transcript:

Drawing and comparing Box and Whisker diagrams (Box plots) Interpreting data … Drawing and comparing Box and Whisker diagrams (Box plots)

Learning objectives All: Calculate quartiles and draw box and whisker diagrams Most: Interpret box and whisker diagrams and use to compare datasets Some: Use all terminology and use IQR to find “outliers”

A list of data The weights (KG) of 15 children: 37, 42, 31, 35, 48, 29, 50, 36, 44, 28, 63, 35, 41, 52, 43 Difficult to UNDERSTAND what these children look like from the list …

The weights (KG) of 15 children: 28, 29, 31, 35, 35, 36, 37, 41, 42, 43, 44, 48, 50, 52, 63 Minimum = 28KG Maximum = 63KG Range = 35KG Mode = 35KG Median = 41KG Mean = 40.9KG Slightly easier to understand with these summary figures …

Stem and leaf … 28, 29, 31, 35, 35, 36, 37, 41, 42, 43, 44, 48, 50, 52, 63 2 8 9 3 1 5 5 6 7 4 1 2 3 4 8 5 0 2 6 3 Key: 2 9 means 29 ORDERED STEM & LEAF We now have the raw data AND a pictorial representation …

Stem and leaf = Bar chart?

Another useful summary A diagram to show: min (28KG), max (63KG), median (41KG) … Min Median Max

Median ½(n + 1)th piece of data (ordered) 28, 29, 31, 35, 35, 36, 37, 41, 42, 43, 44, 48, 50, 52, 63 15 items of data … n = 15 ½(n + 1) = ½(15 + 1) = 8th item

Lower Quartile ¼(n + 1)th piece of data (ordered) 28, 29, 31, 35, 35, 36, 37, 41, 42, 43, 44, 48, 50, 52, 63 15 items of data … n = 15 ¼(n + 1) = ¼(15 + 1) = 4th item

Upper Quartile ¾(n + 1)th piece of data (ordered) 28, 29, 31, 35, 35, 36, 37, 41, 42, 43, 44, 48, 50, 52, 63 15 items of data … n = 15 ¾(n + 1) = ¾(15 + 1) = 12th item

Add that to our box plot Median LQ UQ Max Min A diagram to show: min (28KG), lower quartile = 35KG max (63KG), upper quartile = 48KG median (41KG) … Median LQ UQ Min Max

Some terminology Q2 Q1 Q3 Q0 Q4 Median LQ UQ Min Max Alternative names for quartiles

Some terminology UQ – LQ = Interquartile Range (IQR) Max – Min = Range

Some terminology Positive skew: median closer to LQ than UQ Negative skew: median closer to UQ than LQ Symmetrical distribution

Interpreting the box plot Easily see lightest / heaviest and range The ‘box’ contains the middle 50% of people (the most ‘representative half’) The ‘whiskers’ show the lightest 25% and heaviest 25% of people (extremes)

Comparing groups Boys Girls “Lightest girl lighter than lightest boy” “Heaviest boy heavier than heaviest girl” “Most representative half of girls generally lighter than most representative half of boys”

Comparing groups Boys Girls “Lightest girl same as lightest boy” “Heaviest boy same as heaviest girl” “All of the most representative half of girls lighter than most representative half of boys” “Three quarters of girls lighter than three quarters of boys”

Come on guys, this is so SLOW! The Queue of DEATH! 25,000 people 25,000 people 25,000 people 25,000 people Source: Dr Pearl’s 1938 study of 100,000 non smokers

Woah there! I’m not ready yet! The Queue of DEATH! 25,000 people 25,000 people 25,000 people 25,000 people Source: Dr Pearl’s 1938 study of 100,000 smokers

Direct comparisons easy with box plots non-smokers smokers Direct comparisons easy with box plots

23 boys and 11 girls were given a maths test. Their scores are listed below: Boys: 7, 13, 15, 19, 35, 35, 37, 43, 44, 44, 45, 46, 47, 47, 49, 51, 52, 55, 55, 56, 78, 82, 91 Girls: 7, 18, 23, 47, 58, 63, 68, 72, 72, 75, 87 Use box plots to compare the differences between the boys and girls scores and comment on the differences. Which scores (if any) might be considered ‘outliers’ and why (/why not)?

symmetrical distribution 23 boys and 11 girls were given a maths test. Their scores are listed below: Boys: 7, 13, 15, 19, 35, 35, 37, 43, 44, 44, 45, 46, 47, 47, 49, 51, 52, 55, 55, 56, 78, 82, 91 Girls: 7, 18, 23, 47, 58, 63, 68, 72, 72, 75, 87 Boys Girls Min 7 7 LQ 35 23 Median 46 63 UQ 55 72 Max 91 87 IQR 20 49 Range 84 80 } 11 9 symmetrical distribution } 40 9 negative skew

Box plot of boys and girls maths scores 0 10 20 30 40 50 60 70 80 90 100 (Maths score out of 100)

Looking for ‘outliers’ When do we feel our ‘extreme’ data is just TOO extreme?

Outliers High Outliers > UQ + 1.5 x IQR Low Outliers < LQ – 1.5 x IQR Eg. For boys 1.5 x IQR = 1.5 x 20 = 30 Scores less than LQ – 30 (35 – 30 = 5) are outliers Scores more than UQ + 30 (55 + 30 = 85) are outliers The only outlier is the score of 91 … but that is not such an unreasonable score! This is just a guide! For girls outliers are < -50.5 or > 145.5 … so no outliers there!

Plenary What’s the point in Box plots? Give some advantages and disadvantages of Box plots over other methods of comparing data.