Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.

Slides:



Advertisements
Similar presentations
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Advertisements

Measures of Center and Variation
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
1.3: Describing Quantitative Data with Numbers
Table of Contents 1. Standard Deviation
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Measures of Dispersion How far the data is spread out.
1 Further Maths Chapter 2 Summarising Numerical Data.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
Chapter 5 Describing Distributions Numerically.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Box Plots March 20, th grade. What is a box plot? Box plots are used to represent data that is measured and divided into four equal parts. These.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
CHAPTER 1 Exploring Data
Chapter 16: Exploratory data analysis: numerical summaries
a graphical presentation of the five-number summary of data
Boxplots.
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Warmup What is the shape of the distribution? Will the mean be smaller or larger than the median (don’t calculate) What is the median? Calculate the.
Measures of Position Quartiles Interquartile Range
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Boxplots.
CHAPTER 1 Exploring Data
Boxplots.
Basic Practice of Statistics - 3rd Edition
Measures of Central Tendency
Define the following words in your own definition
Box & Whiskers Plots AQR.
Boxplots.
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Boxplots.
Essential Statistics Describing Distributions with Numbers
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
Describing Distributions with Numbers
CHAPTER 1 Exploring Data
Box and Whisker Plots and the 5 number summary
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Boxplots.
Number Summaries and Box Plots.
T2_MAN , n=92 min max mean median std range 25 quartile 50 quartile
Presentation transcript:

Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range

Sample Range R = largest obs. - smallest obs. or, equivalently R = x max - x min

Sample Variance

Sample Standard Deviation

it is the typical (standard) difference (deviation) of an observation from the mean think of it as the average distance a data point is from the mean, although this is not strictly true What is a standard deviation?

Sample Interquartile Range IQR = third quartile - first quartile or, equivalently IQR = Q 3 - Q 1

Measures of Variation - Some Comments Range is the simplest, but is very sensitive to outliers Variance units are the square of the original units Interquartile range is mainly used with skewed data (or data with outliers) We will use the standard deviation as a measure of variation often in this course

Boxplot - a 5 number summary smallest observation (min) Q 1 Q 2 (median) Q 3 largest observation (max)

Boxplot Example - Table 4, p. 30 smallest observation = 3.20 Q 1 = Q 2 (median) = Q 3 = largest observation =

Creating a Boxplot Create a scale covering the smallest to largest values Mark the location of the five numbers Draw a rectangle beginning at Q1 and ending at Q3 Draw a line in the box representing Q2, the median Draw lines from the ends of the box to the smallest and largest values Some software packages that create boxplots include an algorithm to detect outliers. They will plot points considered to be outliers individually.

Min = 3.20 Q 1 = Q 2 = Q 3 = Max = Boxplot Example

Boxplot Interpretation The box represents the middle 50% of the data, i.e., IQR = length of box The difference of the ends of the whiskers is the range (if there are no outliers) Outliers are marked by an * by most software packages. Boxplots are useful for comparing two or more samples –Compare center (median line) –Compare variation (length of box or whiskers)