Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data

Slides:



Advertisements
Similar presentations
DESCRIBING DISTRIBUTION NUMERICALLY
Advertisements

Class Session #2 Numerically Summarizing Data
The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
Measures of Dispersion
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Measures of Dispersion or Measures of Variability
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Variability Ibrahim Altubasi, PT, PhD The University of Jordan.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
Chapter 2 Describing Data with Numerical Measurements
Describing distributions with numbers
Recap All about measures of location Mean Median Mode
Chapter 2 Describing Data with Numerical Measurements General Objectives: Graphs are extremely useful for the visual description of a data set. However,
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Objectives 1.2 Describing distributions with numbers
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
Chapter 4 Variability. Variability In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In.
Descriptive statistics Describing data with numbers: measures of variability.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Chapter 4 Numerical Methods for Describing Data. Parameter - Fixed value about a population Typical unknown Suppose we want to know the MEAN length of.
Table of Contents 1. Standard Deviation
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Numerical Methods for Describing Data
STATISTICS “CALCULATING DESCRIPTIVE STATISTICS –Measures of Dispersion” 4.0 Measures of Dispersion.
INVESTIGATION 1.
Variability Pick up little assignments from Wed. class.
Numerical Measures of Variability
Review: Measures of Dispersion Objectives: Calculate and use measures of dispersion, such as range, mean deviation, variance, and standard deviation.
Summary Statistics and Mean Absolute Deviation MM1D3a. Compare summary statistics (mean, median, quartiles, and interquartile range) from one sample data.
1 Chapter 4 Numerical Methods for Describing Data.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Chapter 5 Describing Distributions Numerically.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Summary Statistics: Measures of Location and Dispersion.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
1 1 Slide © 2003 South-Western/Thomson Learning TM Chapter 3 Descriptive Statistics: Numerical Methods n Measures of Variability n Measures of Relative.
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
© 2012 W.H. Freeman and Company Lecture 2 – Aug 29.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Measures of Central Tendency, Dispersion, IQR and Standard Deviation How do we describe data using statistical measures? M2 Unit 4: Day 1.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 1 Lesson 7 Variance and Standard Deviation.
Data Analysis Student Text :Chapter 7. Data Analysis MM2D1. Using sample data, students will make informal inferences about population means and standard.
Chapter 4 Lesson 4.1 Numerical Methods for Describing Data 4.1: Describing the Center of a Data Set.
Numerical Methods for Describing Data
Notes 13.2 Measures of Center & Spread
Numerical Methods for Describing Data
Measures of dispersion
(12) students were asked their SAT Math scores:
Numerical Descriptive Measures
Unit 4 Statistics Review
Summarizing Numerical Data Sets
Algebra I Unit 1.
LESSON 4: MEASURES OF VARIABILITY AND PROPORTION
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Numerical Descriptive Measures
Describing Distributions with Numbers
Presentation transcript:

Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data 4.2: Describing Variability in a Data Set

Why is the study of variability important? Does this can of soda contain exactly 12 ounces? There is variability in virtually everything Reporting only a measure of center doesn’t provide a complete picture of the distribution.

Notice that these three data sets all have the same mean and median (at 45), but they have very different amounts of variability.

Measures of Variability The simplest numeric measure of variability is range. Range = largest observation – smallest observation The first two data sets have a range of 50 (70-20) but the third data set has a much smaller range of 10.

Measures of Variability Remember the sample of 6 fish that we caught from the lake . . . They were the following lengths: 3”, 4”, 5”, 6”, 8”, 10” The mean length was 6 inches. We can calculate the deviations from the mean. Another measure of the variability in a data set uses the deviations from the mean (x – x).

YES Now find how each observation deviates from the mean. The mean is considered the balance point of the distribution because it “balances” the positive and negative deviations. This is the deviation from the mean. x (x - x) 3 4 5 6 8 10 Sum 3-6 -3 -2 Find the rest of the deviations from the mean -1 What is the sum of the deviations from the mean? 2 Will this sum always equal zero? 4 YES

Measures of Variability Population variance is denoted by s2. Remember the sample of 6 fish that we caught from the lake . . . They were the following lengths: 3”, 4”, 5”, 6”, 8”, 10” The mean length was 6 inches. We can calculate the deviations from the mean. What’s the sum of these deviations? Can we find an average deviation? The estimated average of the deviations squared is called the variance.

Remember the sample of 6 fish that we caught from the lake . . . Find the variance of the length of fish. First square the deviations x (x - x) (x - x)2 3 -3 4 -2 5 -1 6 8 2 10 Sum Finding the average of the deviations would always equal 0! 9 4 1 16 What is the sum of the deviations squared? Divide this by 5. 34 s2 = 6.5

Measures of Variability The square root of the variance is called standard deviation. Standard Deviation: the average difference from the mean s2 = 6.8 inches2 so s = 2.608 inches The fish in our sample deviate from the mean of 6 by an average of 2.608 inches.

Calculation of standard deviation of a sample The most commonly used measures of center and variability are the mean and standard deviation, respectively. Calculation of standard deviation of a sample Population standard deviation is denoted by s.

Measures of Variability Interquartile range (IQR) is the range of the middle half of the data. Lower quartile (Q1) is the median of the lower half of the data Upper quartile (Q3) is the median of the upper half of the data IQR = Q3 – Q1 What advantage does the interquartile range have over the standard deviation? The IQR is resistant to extreme values

Find the interquartile range for this set of data. The Chronicle of Higher Education (2009-2010 issue) published the accompanying data on the percentage of the population with a bachelor’s or higher degree in 2007 for each of the 50 states and the District of Columbia. 21 27 26 19 30 35 35 26 47 26 27 30 24 29 22 24 29 20 20 27 35 38 25 31 19 24 27 27 23 34 25 32 26 24 22 28 26 30 23 25 22 25 29 33 34 30 17 25 23 34 26 Find the interquartile range for this set of data.

First put the data in order & find the median. 21 27 26 19 30 35 35 26 47 26 27 30 24 29 22 24 29 20 20 27 35 38 25 31 19 24 27 27 23 34 25 32 26 24 22 28 26 30 23 25 22 25 29 33 34 30 17 25 23 34 26 17 19 19 20 20 21 22 22 22 23 23 23 24 24 24 24 25 25 25 25 25 26 26 26 26 26 26 27 27 27 27 27 28 29 29 29 30 30 30 30 31 32 33 34 34 34 35 35 35 38 47 24 26 30 First put the data in order & find the median. Find the lower quartile (Q1) by finding the median of the lower half. Find the upper quartile (Q3) by finding the median of the upper half. IQR = 30 – 24 = 6

Homework Pg.120: #4.19, 4.21, 4.22, 4.26