LECTURE 12 Tuesday, 6 October STA291 Fall 2009. Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

CHAPTER 1 Exploring Data
Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
Measures of Dispersion
Numerically Summarizing Data
LECTURE 7 THURSDAY, 11 FEBRUARY STA291 Fall 2008.
Descriptive Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
A Look at Means, Variances, Standard Deviations, and z-Scores
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
STA Lecture 111 STA 291 Lecture 11 Describing Quantitative Data – Measures of Central Location Examples of mean and median –Review of Chapter 5.
Numerical Descriptive Techniques
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
1.3: Describing Quantitative Data with Numbers
STA Lecture 131 STA 291 Lecture 13, Chap. 6 Describing Quantitative Data – Measures of Central Location – Measures of Variability (spread)
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Lecture 3 Describing Data Using Numerical Measures.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Lecture 2 Dustin Lueker.  Center of the data ◦ Mean ◦ Median ◦ Mode  Dispersion of the data  Sometimes referred to as spread ◦ Variance, Standard deviation.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Summary Statistics: Measures of Location and Dispersion.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 6: Interpreting the Measures of Variability.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
Descriptive Statistics ( )
CHAPTER 1 Exploring Data
Numerical Descriptive Measures
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Descriptive Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
CHAPTER 1 Exploring Data
Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Presentation transcript:

LECTURE 12 Tuesday, 6 October STA291 Fall 2009

Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software SAS output (Murder Rate Data) Note the distance from the median to the maximum compared to the median to the minimum.

Interquartile Range 3 The Interquartile Range (IQR) is the difference between upper and lower quartile IQR = Q 3 – Q 1 IQR= Range of values that contains the middle 50% of the data IQR increases as variability increases

Box Plot (AKA Box-and-Whiskers Plot) A box plot is basically a graphical version of the five- number summary (unless there are outliers) It consists of a box that contains the central 50%of the distribution (from lower quartile to upper quartile), A line within the box that marks the median, And whiskers that extend to the maximum and minimum values, unless there are outliers 4

Outliers An observation is an outlier if it falls – more than 1.5 IQR above the upper quartile or – more than 1.5 IQR below the lower quartile Example: Murder Rate Data w/o DC – upper quartile Q3 = 10.3 – IQR = 6.4 – Q IQR = _______ – Any outliers? 5

Illustrating Boxplot with Murder Rate Data 6 (w/o DC—key: 20|3 = 20.3)

Measures of Variation 7 Mean and Median only describe a typical value, but not the spread of the data Two distributions may have the same mean, but different variability Statistics that describe variability are called measures of variation (or dispersion)

Sample Measures of Variation 8 Sample Range: Difference between maximum and minimum sample value Sample Variance: Sample Standard Deviation: Sample Interquartile Range: Difference between upper and lower quartile of the sample

Population Measures of Variation 9 Population Range: Difference between maximum and minimum population values Population Variance: Population Standard Deviation: Population Interquartile Range: Difference between upper and lower quartile of the population

Range 10 Range: Difference between the largest and smallest observation Very much affected by outliers (one misreported observation may lead to an outlier, and affect the range) The range does not always reveal different variation about the mean

Deviations 11 The deviation of the i th observation, x i, from the sample mean,, is, the difference between them The sum of all deviations is zero because the sample mean is the center of gravity of the data (remember the balance beam?) Therefore, people use either the sum of the absolute deviations or the sum of the squared deviations as a measure of variation

Sample Variance 12 The variance of n observations is the sum of the squared deviations, divided by n – 1.

Variance: Interpretation 13 The variance is about the average of the squared deviations “average squared distance from the mean” Unit: square of the unit for the original data Difficult to interpret Solution: Take the square root of the variance, and the unit is the same as for the original data

Sample standard deviation 14 The standard deviation s is the positive square root of the variance

Standard Deviation: Properties 15 s ≥ 0 always s = 0 only when all observations are the same If data is collected for the whole population instead of a sample, then n-1 is replaced by n s is sensitive to outliers

Standard Deviation Interpretation: Empirical Rule 16 If the histogram of the data is approximately symmetric and bell-shaped, then – About 68% of the data are within one standard deviation from the mean – About 95% of the data are within two standard deviations from the mean – About 99.7% of the data are within three standard deviations from the mean

Standard Deviation Interpretation: Empirical Rule 17

Sample Statistics, Population Parameters 18 Population mean and population standard deviation are denoted by the Greek letters μ (mu) and  (sigma) They are unknown constants that we would like to estimate Sample mean and sample standard deviation are denoted by and s They are random variables, because their values vary according to the random sample that has been selected

Attendance Survey Question On a your index card: – Please write down your name and section number – Today’s Question: