Presentation is loading. Please wait.

Presentation is loading. Please wait.

Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.

Similar presentations


Presentation on theme: "Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots."— Presentation transcript:

1 Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots

2 Section 1 Topic 32 Summarising metric data: Median, IQR & Box Plots Can we describe a distribution with just one or two numbers? What is the median, how is it calculated and what does it tell us? What is the interquartile range, how is it calculated and what does it tell us? What is a five number summary? What is a box plot and why is it useful?

3 Section 1 Topic 33 Will less than the whole picture do? Summary Statistics Measures of centre Median Mean Measures of spread Range Interquartile Range Standard Deviation

4 Section 1 Topic 34 Median 3514835148 Firstly numerically order the data set 1345813458 50% higher than or equal to median 50% lower than or equal to median Location of Median = (n+1)/2 = (5+1)/2 = 3 rd observation Notes p.97

5 For an odd number of data values the median will be one of the data values 1345813458 Median = 4 For an even number of data values the median may not coincide with an actual data value 34583458 Median = 4.5 Location of Median = (4+1)/2 = (5)/2 = 2.5 observation

6 Section 1 Topic 36 Limitations: Range Depends on only two extreme values. Data set 1 5 6789101112 Range = 12 - 5 = 7 Data set 2 5 12121212121212

7 Section 1 Topic 37 Interquartile range Quartiles are the points that divide a distribution into quarters Q1Q2Q3Q1Q2Q3 25%50%75% Median IQR = Q 3 - Q 1 The interquartile range (IQR) is defined to be the spread of the middle 50% of data values, so that Notes p.99

8 Section 1 Topic 38 Why is the IQR more useful that the range? IQR describes the middle 50% of observations. Upper 25% and lower 25% of observations are discarded. IQR generally not affected by outliers.

9 Section 1 Topic 39 Picturing quartiles with histogram Notes p.97

10 Section 1 Topic 310 Five number summary Minimum value, Q 1, Median, Q 3, Maximum value

11 Section 1 Topic 311 The Boxplot Graphical representation of five number summary Notes p.98

12 Section 1 Topic 312 Constructing a Boxplot Notes p.99

13 Section 1 Topic 313 *Exercise 4 Notes p.103

14 Section 1 Topic 314 Relating a boxplot to the shape of the distribution : Symmetric Notes p.104

15 Section 1 Topic 315 Positively skewed distributions

16 Section 1 Topic 316 Negatively skewed distributions

17 Section 1 Topic 317 Boxplot with outliers Possible outliers defined as any values outside of the interval (Q 1 -1.5 X IQR, Q 3 + 1.5 X IQR) We say possible, since the point may just be part of the tail of the distribution but we may not have enough data to be sure Notes p.101

18 Section 1 Topic 318 Boxplot with outliers Min Q 1 M Q 3 Max 38 63 70 75 76

19 Section 1 Topic 319 *Exercise 5 Notes p.107


Download ppt "Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots."

Similar presentations


Ads by Google