Download presentation
Presentation is loading. Please wait.
Published byDoreen Patterson Modified over 9 years ago
1
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113
2
4.2 Shapes of Distribution CLASS WORK: Worksheet REVIEW ACTIONS are REMEMBERED, WORDS can be FORGOTTEN! MAKE an EFFORT, NOT an EXCUSE
3
4.2 Shapes of Distribution Variation: Describes how widely data are spread out about the center of a distribution. ????How would you expect the variation to differ between the heights of NCAA Division 1A Men’s College Basketball Centers and the heights of all High School Boy Basketball Players???? NCAA Division 1A Centers less variation High School Boy Basketball Players more variation
4
4.3 Measures of Variation How do we investigate variation? Study all of the raw data… Range… Quartiles… Five-number summary (BOXPLOT or BOX-and-WHISKER) … Interquartile range… Semi-quartile range… Percentiles… MAD… Variance & Standard Deviation…
5
4.3 Measures of Variation RANGE: The range of a distribution is the difference between the highest and lowest data values.
6
4.3 Measures of Variation Find the range of the data. 4.1, 5.2, 5.6, 6.2, 7.2, 7.7, 7.7, 8.5, 9.3, 11.0 Range = 11.0 – 4.1 = 6.9
7
4.3 Measures of Variation Misleading range: Which Quiz Set has greater variation? Quiz Set 1: 1, 10, 10, 10, 10, 10, 10, 10, 10, 10 Quiz Set 2: 2, 3, 4, 5, 6, 7, 8, 9, 10, 8, 9, 10, 6, 5 ** Even though Set 1 has a greater range than Set 2 (9 > 8). Set 2 has a greater variation because Set 1 contains an outlier. Therefore, we use quartiles.**
8
4.3 Measures of Variation Quartiles: Quartiles divide the data into four quarters. Lower Quartile (1 st Quartile): is the median of the data values in the lower half of a data set. Exclude the middle value in the data set if the number of data points is odd. Middle Quartile (2 nd Quartile): is the overall median Upper Quartile (3 rd Quartile): is the median of the data values in the upper half of a data set. Exclude the middle value in the data set if the number of data points is odd.
9
4.3 Measures of Variation Find quartiles… Example 1 – Upper and lower quartiles Data 6, 47, 49, 15, 43, 41, 7, 39, 43, 41, 36 Ordered data6, 7, 15, 36, 39, 41, 41, 43, 43, 47, 49 Median (2 nd Quartile)41 Upper quartile (3 rd Quartile)43 Lower quartile (1 st Quartile)15
10
4.3 Measures of Variation Find quartiles… Example 2 – Range and quartiles A year ago, Angela began working at a computer store. Her supervisor asked her to keep a record of the number of sales she made each month. The following data set is a list of her sales for the last 12 months: 34, 47, 1, 15, 57, 24, 20, 11, 19, 50, 28, 37 Use Angela's sales records to find: a)the median b) the range c) the upper and lower quartiles
11
4.3 Measures of Variation Answers The values in ascending order are: 1, 11, 15, 19, 20, 24, 28, 34, 37, 47, 50, 57. a) Median = (6th + 7th observations) ÷ 2 = (24 + 28) ÷ 2 = 26 b) Range = difference between the highest and lowest values = 57 - 1 = 56
12
4.3 Measures of Variation c) Lower quartile = value of middle of first half of data Q1 = the median of 1, 11, 15, 19, 20, 24 = (3rd + 4th observations) ÷ 2 = (15 + 19) ÷ 2 = 17 d) Upper quartile = value of middle of second half of data Q3 = the median of 28, 34, 37, 47, 50, 57 = (3rd + 4th observations) ÷ 2 = (37 + 47) ÷ 2 = 42 These results can be summarized as follows:
13
4.3 Measures of Variation Five-number summary: Consists of the following… 1) Low Value 2) Q1 (lower quartile) 3) Q2 (median) 4) Q3 (upper quartile) 5) High Value #
14
4.3 Measures of Variation BOXPLOT or BOX-and- WHISKER: Box plots show variation along the number line. Steps for creating a box plot: 1. Draw a number line that spans the entire data set. 2. Above the number line, enclose the values from the lower to the upper quartile in a box. 3. Draw a line through the box at the value corresponding to the median. 4. Add “whiskers” extending to the low and high values. Vertical box plot showing “normal” distribution “ FORESHADOWING ”
15
4.3 Measures of Variation Example of 5 number summary and box plot. Lowest Value1 First Quartile (Q1)6.5 Median (Q2)12 Third Quartile (Q3)19.5 Highest Value24 So for the data set 1, 4, 9, 12, 12, 16, 23, 24 here is our box plot:
16
4.3 Measures of Variation Right-Skewed Left-Skewed Symmetric Q1Q2Q3Q1Q2Q3 Q1Q2Q3 Digest of BOXPLOTS and SKEWNESS
17
4.3 Measures of Variation Below is a Box-and-Whisker plot for the following data: 0 2 2 2 3 3 4 5 5 10 27 The data are right skewed, as the plot depicts 0 2 3 5 27 Min Q1 Q2 Q3 Max
18
4.3 Measures of Variation Interquartile range: i.e. If the five number summary is low: 3, high: 23, Q1: 4, Q2: 12, Q3: 19. Then the interquartile range is IQR: (Q3-Q1) = (19 – 4) =15. Interquartile range The interquartile range is another range used as a measure of the variation. The difference between upper and lower quartiles (Q 3 –Q 1 ), which is called the interquartile range, also indicates the dispersion of a data set. The inter- quartile range spans 50% of a data set, and eliminates the influence of outliers because, in effect, the highest and lowest quarters are removed. Interquartile range = upper quartile (Q 3 ) minus lower quartile (Q 1 )
19
4.3 Measures of Variation Next Time: Semi-quartile range… Percentiles… MAD… Variance & Standard Deviation… According to the box-n-whisker above what are the values for the 5 number summary: Low: 12 Q1: 22 Q2: 31 Q3: 45 High: 50
20
4.3 Measures of Variation Classwork: PPRACTICE MAKES PERMANENT PPg 174 # 2-6 even and # 25-27 (Letters a, b only)
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.