Describing Quantitative Data Numerically

Slides:



Advertisements
Similar presentations
NUMERICAL DESCRIPTIVE STATISTICS Measures of Variability.
Advertisements

Statistics for the Social Sciences
Measures of Central Tendency
Section 3.2 Measures of Variation Range Standard Deviation Variance.
Measures of Dispersion Week 3. What is dispersion? Dispersion is how the data is spread out, or dispersed from the mean. The smaller the dispersion values,
1.3: Describing Quantitative Data with Numbers
Section 3.2 Measures of Dispersion. Range = the difference between the largest value and smallest value range = (largest value) – (smallest value)
Describing Quantitative Data Numerically Symmetric Distributions Mean, Variance, and Standard Deviation.
Statistics Describing, Exploring and Comparing Data
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
AP Statistics HW: p #39, 43,44, 45 Lab due next class Obj: to use variance and standard deviation Do Now: 1.Make a modified box plot for the # of.
Today: Standard Deviations & Z-Scores Any questions from last time?
Measures of Central Tendency and Dispersion from Grouped Data.
Chapter 1: Exploring Data Lesson 7: Variance and Standard Deviation Mrs. Parziale.
Using Standard Deviation in AP Biology. Why would we use the standard deviation to analyze our lab result? In statistics and probability theory, standard.
Standard Deviation. Two classes took a recent quiz. There were 10 students in each class, and each class had an average score of 81.5.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
4.1 Measures of Center We are learning to…analyze how adding another piece of data can affect the measures of center and spread.
2.4 Measures of Variation Prob & Stats Mrs. O’Toole.
2.4 Measures of Variation The Range of a data set is simply: Range = (Max. entry) – (Min. entry)
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Exploratory Data Analysis
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
CHAPTER 1 Exploring Data
One-Variable Statistics
Chapter 1: Exploring Data
Copyright © Cengage Learning. All rights reserved.
Using Standard Deviation in AP Biology
Describing Distributions Numerically
Objective: Given a data set, compute measures of center and spread.
Descriptive Statistics: Presenting and Describing Data
Do-Now-Day 2 Section 2.2 Find the mean, median, mode, and IQR from the following set of data values: 60, 64, 69, 73, 76, 122 Mean- Median- Mode- InterQuartile.
Section 3.2 Measures of Spread.
Standard Deviation.
Standard Deviation.
DAY 3 Sections 1.2 and 1.3.
Please take out Sec HW It is worth 20 points (2 pts
Histograms: Earthquake Magnitudes
Statistics Variability
Numerical Descriptive Measures
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Measuring location: percentiles
Means & Medians.
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Describing a Skewed Distribution Numerically
Chapter 1: Exploring Data
Summary (Week 1) Categorical vs. Quantitative Variables
Measures of Center.
Summary (Week 1) Categorical vs. Quantitative Variables
Describing Distributions Numerically
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Standard Deviation.
14.3 Measures of Dispersion
Warm Up – 5/15 - Thursday Consider the following test scores: Answer the following in complete sentences: A) Who is the best student? B) How do you know?
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Warm up Honors Algebra 2 3/14/19
Numerical Descriptive Measures
Chapter 1: Exploring Data
Presentation transcript:

Describing Quantitative Data Numerically Symmetric Distributions Mean, Variance, and Standard Deviation

Symmetric Distributions Describing a “typical” value for a set of data when the distribution is at least approximately symmetric allows us to choose our measure of center: We can use either Mean Median

Finding the Mean of a Distribution The mean of a set of numbers is the arithmetic average. We find this value by adding together each value and then dividing by the number of values we added together The formula for the mean is:

Let’s see the Formula in Action Consider Babe Ruth’s HR data A check of a dotplot indicates that the distribution is approximately symmetric 54 59 35 41 46 25 47 60 49 34 22

So… the first step is to add all the values 54 + 59 + 35 + 41 + 46 + 25 + 47 + 60 + 54 + 46 + 49 + 46 + 41 + 34 + 22 = 659 Now we need to divide that sum by the number of values we added together.

So the mean of the data is 43. 9333 So the mean of the data is 43.9333. Now, if we wish to talk about the “typical” number of home runs for Babe Ruth (and we ALWAYS wish to talk about the context of our data!), we could say something like… On average, Babe Ruth hit approximately 44 home runs per season during the 15 seasons he played.

Remember that although the center is a very important part of our description, we also need to look at the spread of the distribution. When we use the mean as our measure of center, we use the standard deviation as our measure of spread. We can think of standard deviation as “an average distance of values from the mean” To calculate the standard deviation by hand, we’ll make a data table…

Calculating Standard Deviation

X X - X (X – X)2 54 43.9333 10.0667 101.3384 59 15.0667 227.0054 35 -8.9333 79.8038 41 -2.9333 8.6042 46 2.0667 4.2712 25 -18.9333 358.4698 47 3.0667 9.4046 60 16.0667 258.1388 49 5.0667 25.6714 34 -9.9333 98.6704 22 -21.9333 481.0696 SUM .0005 (essentially 0) 1770.9333

Creating the Data Table X - X 54 – 43.9333 = 10.0667 15.0667 -8.9333 -2.9333 2.0667 -18.9333 3.0667 16.0667 10.0667 5.0667 -9.9333 -21.9333 The first part of our formula indicates that we need to find the distance from the mean for each of our values (x – x)

Now that we know the individual distances for each value, we want to find an “average” of those distances. To find an average we have to add all the values together We find, though, that the sum of those values is always zero. Why? Because some of the values are above the mean (positive values) and some are below (negative). The positives and negatives cancel each other out. So what values can we use to find the “average” distance from the mean for a set of values?

One way to get rid of the negative values in these distances is to square each of the values. That’s exactly what our formula tells us to do. (x – x)2 Once we have these values, to find the average we must add them together (X – X)2 101.3384 227.0054 79.8038 8.6042 4.2712 358.4698 9.4046 258.1388 25.6714 98.6704 481.0696 SUM = 1770.9333

The final step in finding an average is to divide by the number of values we added together, but our formula is a little different here. Instead of dividing by the total number of values we added together, we divide by 1 less than the total. Why? We have taken a “sample” of the data instead of every piece of data in the population. Since another “sample” would produce a slightly different mean, it would also produce a slightly different standard deviation. Dividing by 1 less than the total number of values added together will give us a slightly larger spread to account for this sampling variation.

So, we divide the “sum of the squared deviations” by n-1 We have now calculated everything inside the square root sign This value is an important one—It is called the Variance --S2

Since the units of the variance are not the same as our original units, we have one more calculation we must make. The square root of the variance will restore the original units and give us the “average distance from the mean”—the standard deviation S = 11.2470

TI-Tips Mean, Variance, & Standard Deviation Find the MEAN Enter the data into a list 2nd STAT MATH 3:mean(list name) If you have used a frequency list, 3:mean(data list, freq list)

TI-Tips Find the Variance Enter the data in a list 2nd STAT MATH 8:variance(list name) If you have used a frequency list, 8:variance(data list, freq list)

TI-Tips Find Standard Deviation Enter the data in a list 2nd STAT Math 7:stdDev(list name) If you have used a frequency list, 7:stdDev(data list, freq list)

TI-Tips Find Mean and Std Dev. Enter the data in a list STAT Calc 1:1-Var Stats(list name) Enter If you have used a frequency list, 1:1-var stats(data list, freq list) Discuss Sx and sigma (x)

Additional Resources Practice of Statistics: Pg 30-34, 43-46