Ronald Hui Tak Sun Secondary School

Slides:



Advertisements
Similar presentations
Learn to find measures of variability. Box-and-whisker Plots.
Advertisements

Central Tendency Mean – the average value of a data set. Add all the items in a data set then divide by the number of items in the data set.
Ch 11 – Probability & Statistics
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
QBM117 Business Statistics
Grouped Data Calculation
Chapter 13 Section 5 - Slide 1 Copyright © 2009 Pearson Education, Inc. AND.
CONFIDENTIAL 1 Grade 8 Algebra1 Data Distributions.
S2A Chapter 7 More about Statistical Graphs Chung Tai Educational Press. All rights reserved. © Terminologies about Classes Lower class.
Slide 3-2 Copyright © 2008 Pearson Education, Inc. Chapter 3 Descriptive Measures.
WHAT IS AN INTEGER? Integers can be thought of as discrete, equally spaced points on an infinitely long number line. (Nonnegative integers (purple) and.
Objectives Vocabulary
Frequency Distribution
Basic Statistics  Statistics in Engineering  Collecting Engineering Data  Data Summary and Presentation  Probability Distributions - Discrete Probability.
WARM UP Find the mean, median, mode, and range 1. 5, 10, 19, 34, 16, , 22, 304, 425, 219, 304, 22, 975 When you are done the warm up put the calculator.
DATA ANALYSIS n Measures of Central Tendency F MEAN F MODE F MEDIAN.
INVESTIGATION 1.
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
WE HAVE BEEN LOOKING AT: Univariate data: Collecting sets of data and analysing it to see the results Ungrouped data set: a list of individual results.
GCSE Statistics More on Averages.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
Barnett/Ziegler/Byleen Finite Mathematics 11e1 Chapter 11 Review Important Terms, Symbols, Concepts Sect Graphing Data Bar graphs, broken-line graphs,
Summary Statistics: Measures of Location and Dispersion.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons. 3-1 Business Statistics, 4e by Ken Black Chapter 3 Descriptive Statistics.
7.3 Measures of Central Tendency and Dispersion. Mean – the arithmetic average is the sum of all values in the data set divided by the number of values.
Box Plots March 20, th grade. What is a box plot? Box plots are used to represent data that is measured and divided into four equal parts. These.
Medical Statistics (full English class) Ji-Qian Fang School of Public Health Sun Yat-Sen University.
Central Tendency Mean – the average value of a data set. Add all the items in a data set then divide by the number of items in the data set.
Measures of Central Tendency PS Algebra I. Objectives Given a set of data, be able to find the following: 1) mean* 2) median* 3) mode* 4) range 5) first.
Statistics -Descriptive statistics 2013/09/30. Descriptive statistics Numerical measures of location, dispersion, shape, and association are also used.
Exploratory Data Analysis
Cumulative Frequency and Box Plots
Ronald Hui Tak Sun Secondary School
MATHEMATICS The Measure of Data Location
SUBTOPIC 8.3 : Measures of Location 8.4 : Measures of Dispersion
Add a 3rd column and do a running total of the frequency column
Range, Mean, Median, Mode Essential Question: How do we take a random sample, and what statistics can we find with the data? Standard: MM1D3.a.
Represent the solutions of the following inequalities graphically.
Medians.
Descriptive Statistics SL
Frequency Distribution Tables and Histogram
Single Variable Data Analysis
Mean, Median, Mode, and Range
Descriptive Statistics
(12) students were asked their SAT Math scores:
Box and Whisker Plots Algebra 2.
Cumulative Frequency and Box Plots
How to draw a cumulative frequency graph
Lesson 1: Measures of Center Mean and Median
Ronald Hui Tak Sun Secondary School
Draw a cumulative frequency table.
Algebra I Unit 1.
How to draw a cumulative frequency graph
Measures of central tendency
Median, Quartiles, Inter-Quartile Range and Box Plots.
Measures of Central Tendency
Whiteboardmaths.com © 2004 All rights reserved
Numerical Descriptive Statistics
Ronald Hui Tak Sun Secondary School
Bellwork 4/24 A students’ test scores on 6 tests were; 84, 76, 80, 64, 92, and 84. Find the average test score.
Comparing Statistical Data
Find median-table data
Ronald Hui Tak Sun Secondary School
Warm-Up Define mean, median, mode, and range in your own words. Be ready to discuss.
Section 12.3 Box-and-Whisker Plots
Analyze Data: IQR and Outliers
ALGEBRA STATISTICS.
Cumulative Frequency and Box Plots
Presentation transcript:

Ronald Hui Tak Sun Secondary School HKDSE Mathematics Ronald Hui Tak Sun Secondary School

Homework SHW6-C1 Sam L SHW6-R1 Walter (RD) Ronald HUI

Homework SHW7-B1 SHW7-R1 SHW7-P1 Sam L Kelvin, Sam L, Pako Sam L, Pako, Walter Ronald HUI

Homework SHW8-A1 SHW8-B1 SHW8-R1 SHW8-P1 Daniel, Sam L, Pako, Ken, Kelvin, Sam L, Marco S, Pako SHW8-R1 Today (18th Mar)! SHW8-P1 Ronald HUI

Range and Inter-quartile Range Title page: Font size 36, bold, theme color of the chapter (red for geometry, blue for algebra, green for statistics)

Consider the two sets of numbers: Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} Do you remember how to measure the central tendency of a set of data? We can use an average to measure.

Consider the two sets of numbers: Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} Good! The commonly used averages include the mean, the median and the mode. Try to find them for each set of numbers. We can use an average to measure.

Consider the two sets of numbers: Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} Mean of A Mean of B 6 9 7 5 3 1 + = 6 5 4 + = = 5 = 5 2 5 + = 2 5 + = Median of A Median of B = 5 = 5 Mode of A = 5 Mode of B = 5

Consider the two sets of numbers: Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} Mean of A = 5 Mean of B = 5 Median of A = 5 Median of B = 5 Mode of A = 5 Mode of B = 5 Although two sets of numbers have the same measures of central tendency, the numbers of each set spread out differently.

Consider the two sets of numbers: Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} Mean of A = 5 Mean of B = 5 Median of A = 5 Median of B = 5 Mode of A = 5 Mode of B = 5 1 2 3 4 5 6 1 7 8 9 Distribution of set A 1 2 3 4 5 6 1 7 8 9 Distribution of set B

Consider the two sets of numbers: Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} Mean of A = 5 Mean of B = 5 Median of A = 5 Median of B = 5 Mode of A = 5 Mode of B = 5 To measure how spread out or how dispersed the data are, we need some measures of dispersion. Range and inter-quartile range are two commonly used ones.

Range of Ungrouped Data For ungrouped data: range = largest datum – smallest datum In general, the greater the range, the greater is the dispersion of the set of data.

Range of Ungrouped Data For ungrouped data: range = largest datum – smallest datum Consider the data sets in the previous example. Set A = {1, 3, 5, 5, 7, 9} Set B = {4, 5, 5, 5, 5, 6} 1 9 4 6 Range of A = 9 – 1 9 1 = 8 Range of B = 6 – 4 6 4 = 2 Since the range of A > the range of B, the data in set A are more dispersed.

Follow-up question The following shows the prices of 6 pairs of trousers in 2 shops. Shop A: $250, $270, $350, $395, $420, $480 Shop B: $285, $310, $335, $385, $410, $450 (a) Find the range of prices of trousers in each shop. (b) Which shop has less dispersed prices? (a) Range of prices of trousers in shop A = $(480 – 250) = $230 Range of prices of trousers in shop B = $(450 – 285) = $165

Follow-up question The following shows the prices of 6 pairs of trousers in 2 shops. Shop A: $250, $270, $350, $395, $420, $480 Shop B: $285, $310, $335, $385, $410, $450 (a) Find the range of prices of trousers in each shop. (b) Which shop has less dispersed prices? (b) Since the range of prices for shop B < the range of prices for shop A, the prices for shop B are less dispersed.

Range of Grouped Data For grouped data: range = upper class boundary of the last class interval – lower class boundary of the first class interval Let us take a look in the following example.

The following table shows the ages of 50 employees. Age 20 – 24 25 – 29 30 – 34 35 – 39 Frequency 11 18 15 6 24 25 Upper class boundary of the last class interval = 39.5 Lower class boundary of the first class interval = 19.5 ∴ Range of the ages of the 50 employees = 39.5 – 19.5 = 20 25 24 25 24

Follow-up question The following table shows the weight distribution of pears in a fruit shop. Find the range of weights of the pears. Weight (g) 180  199 200  219 220  239 240  259 Number of pears 33 58 69 17 Range of weights of the pears = (259.5 – 179.5) g = 80 g highest class boundary = 259.5 g lowest class boundary = 179.5 g

The range of a data set is easy to find. However, if there is a extreme datum in the data set, the value of the range will be greatly affected. Right! In the following, we will introduce another measure of dispersion, the inter-quartile range, which is less affected by extreme values.

Inter-quartile Range of Ungrouped Data For ungrouped data: Median The median divides the set of data into two equal parts.

Inter-quartile Range of Ungrouped Data For ungrouped data: Median Q1 Lower quartile Q3 Upper quartile Q2 Middle quartile The quartiles divide the whole set of data into four equal parts.

Inter-quartile Range of Ungrouped Data For ungrouped data: Q1 Lower quartile Q1 Lower quartile Q2 Middle quartile Q2 Middle quartile Q3 Upper quartile Q3 Upper quartile Q1 (or the first quartile) is the middle value of the lower half. Q2 (or the second quartile) is the median of the data set. Q3 (or the third quartile) is the middle value of the upper half. The inter-quartile range (IQR) is defined as the difference between the upper quartile (Q3 ) and the lower quartile (Q1). Inter-quartile range (IQR) = Q3 – Q1

In general, the greater the inter-quartile range, the greater is the dispersion of the set of data.

A data set with an even number of data 9, 5, 1, 5, 3, 7 The procedures of finding the inter-quartile ranges of ungrouped data with an even number and an odd number of data are as follows. A data set with an even number of data 9, 5, 1, 5, 3, 7 Step 1 Arrange the data in ascending order. 1, 3, 5, 5, 7, 9 Step 2 Divide the whole data set into two equal halves. Lower half Upper half 1, 3, 5, 5, 7, 9 Q2

Find the median of each half of the data set. Lower half Upper half 1, 3, 5, 5, 7, 9 Q1 Q2 Q3 Step 3 Find the median of each half of the data set. Q1 = 3, Q3 = 7 Step 4 Find the inter-quartile range of the data set. Inter-quartile range = Q3  Q1 = 7  3 = 4

A data set with an odd number of data 8, 6, 2, 4, 7 8, 6, 2, 4, 7 Step 1 Arrange the data in ascending order. 2, 4, 6, 7, 8 Step 2 Divide the whole data set into two equal halves. Lower half Upper half 2, 4, 6, 7, 8 Q2

Find the median of each half of the data set. Lower half Upper half 2, 4, 6, 7, 8 Q1 Q2 Q3 Step 3 Find the median of each half of the data set. Step 4 Find the inter-quartile range of the data set. Inter-quartile range = Q3  Q1 = 7.5  3 = 4.5

Follow-up question The waiting times (in min) of customers for bank services along two queues in front of two ATM machines are recorded below. ATM A: 3.8, 6.5, 8.3, 15.3, 1.9, 5.8, 5.3, 13.9 ATM B: 6.7, 13.6, 4.7, 5.5, 9.1, 8.5, 7.6, 4.8 For each ATM, find the inter-quartile range of the waiting times of the customers. Based on the results in (a), which ATM’s waiting times have a greater dispersion? (a) Inter-quartile range of the waiting times for ATM A ◄ 1.9, 3.8, 5.3, 5.8, 6.5, 8.3, 13.9, 15.3 lower half upper half

Follow-up question The waiting times (in min) of customers for bank services along two queues in front of two ATM machines are recorded below. ATM A: 3.8, 6.5, 8.3, 15.3, 1.9, 5.8, 5.3, 13.9 ATM B: 6.7, 13.6, 4.7, 5.5, 9.1, 8.5, 7.6, 4.8 For each ATM, find the inter-quartile range of the waiting times of the customers. Based on the results in (a), which ATM’s waiting times have a greater dispersion? (a) (cont’d) Inter-quartile range of the waiting times for ATM B ◄ 4.7, 4.8, 5.5, 6.7, 7.6, 8.5, 9.1, 13.6 lower half upper half

Follow-up question The waiting times (in min) of customers for bank services along two queues in front of two ATM machines are recorded below. ATM A: 3.8, 6.5, 8.3, 15.3, 1.9, 5.8, 5.3, 13.9 ATM B: 6.7, 13.6, 4.7, 5.5, 9.1, 8.5, 7.6, 4.8 For each ATM, find the inter-quartile range of the waiting times of the customers. Based on the results in (a), which ATM’s waiting times have a greater dispersion? (b) ∵ IQR for ATM A > IQR for ATM B ∴ Based on the inter-quartile range, the waiting times for ATM A have a greater dispersion.

Inter-quartile Range of Grouped Data For grouped data, we also have: inter-quartile range = Q3 – Q1 Consider the cumulative frequency curve below: 25% of the total frequency = 200 25% = 50 From the graph, the lower quartile (Q1) is 30. Q1

Inter-quartile Range of Grouped Data For grouped data, we also have: inter-quartile range = Q3 – Q1 Consider the cumulative frequency curve below: 75% of the total frequency = 200 75% = 150 From the graph, the upper quartile (Q3) is 72. Q1 Q3

Inter-quartile Range of Grouped Data For grouped data, we also have: inter-quartile range = Q3 – Q1 Consider the cumulative frequency curve below: Q1 Q3 Q1 = 30 Q3 = 72 ∴ IQR = 72 – 30 = 42

Follow-up question The cumulative frequency polygon on the right shows the heights of 40 students. (a) Find the lower quartile and the upper quartile of the heights of the students. (b) Find the inter-quartile range of the heights of the students. Q1 Q3 (a) From the graph, lower quartile Q1 = 157 cm upper quartile Q3 = 175 cm

Follow-up question The cumulative frequency polygon on the right shows the heights of 40 students. (a) Find the lower quartile and the upper quartile of the heights of the students. (b) Find the inter-quartile range of the heights of the students. Q1 Q3 (b) Inter-quartile range = Q3 – Q1 = (175 – 157) cm = 18 cm