1.Assemble the following tools: Graphing calculator z-tables (modules 3 - 5)z-tables Paper and pencil Reference for calculator keystrokes 2.Complete the.

Slides:



Advertisements
Similar presentations
Measures of Central Tendency and Variation 11-5
Advertisements

Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
PPA 415 – Research Methods in Public Administration Lecture 4 – Measures of Dispersion.
Biostatistics Unit 2 Descriptive Biostatistics 1.
Learning Objectives In this chapter you will learn about the importance of variation how to measure variation range variance standard deviation.
Measures of Central Tendency
Describing Data: Numerical
Describing distributions with numbers
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Describing Data from One Variable
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
BIOSTAT - 2 The final averages for the last 200 students who took this course are Are you worried?
A Closer Look at the NEW High School Statistics Standards Focus on A.9 and AII.11 K-12 Mathematics Institutes Fall 2010 A Closer Look at the NEW High School.
Overview Summarizing Data – Central Tendency - revisited Summarizing Data – Central Tendency - revisited –Mean, Median, Mode Deviation scores Deviation.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Objectives Vocabulary
Table of Contents 1. Standard Deviation
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
By: Amani Albraikan 1. 2  Synonym for variability  Often called “spread” or “scatter”  Indicator of consistency among a data set  Indicates how close.
Describing distributions with numbers
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
Skewness & Kurtosis: Reference
SECTION 12-3 Measures of Dispersion Slide
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Measures of Dispersion How far the data is spread out.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Descriptive Statistics: Presenting and Describing Data.
Central Tendency & Dispersion
Sociology 5811: Lecture 3: Measures of Central Tendency and Dispersion Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Lesson 25 Finding measures of central tendency and dispersion.
Applied Quantitative Analysis and Practices LECTURE#07 By Dr. Osman Sadiq Paracha.
Measures of Center & Spread. Measures of Center.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
MEASURES OF DISPERSION Standard Deviation and Variance.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Measures of Dispersion Measures of Variability
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
2.4 Measures of Variation The Range of a data set is simply: Range = (Max. entry) – (Min. entry)
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
Descriptive Statistics ( )
INTRODUCTION TO STATISTICS
One-Variable Statistics
Measures of Dispersion
Introduction to Summary Statistics
Introduction to Summary Statistics
Descriptive Statistics: Presenting and Describing Data
Introduction to Summary Statistics
Numerical Descriptive Measures
Introduction to Summary Statistics
Chapter 5: Describing Distributions Numerically
Introduction to Summary Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Presentation transcript:

1.Assemble the following tools: Graphing calculator z-tables (modules 3 - 5)z-tables Paper and pencil Reference for calculator keystrokes 2.Complete the diagnostic to assess your prior knowledge or at least review the diagnostic to help you frame the content within this module. Before beginning this module

1.Access solutions if you completed the diagnostic.solutions 2.If you did not score 100%, continue with this module 3.If you score 100%, you may proceed to the next module. Next Steps

Previously Learned Defined the mean in terms of a balance point Represented the mean graphically Recognized and interpreted sigma notation Entered data into a graphing calculator and calculated 1-variable statistics Recognized x and µ as symbolic representations of the mean

Outcomes Compare and contrast mean absolute deviation, variance, and standard deviation. Calculate and apply the three measures of dispersion. Understand the properties of three measures of dispersions. Interpret the mean absolute deviation and standard deviation. Determine when to use Mean Absolute Deviation and Standard Deviation

How do you describe a data set? Measures of Central Tendencies –Mean (μ, x) –Median –Mode Measures of Spread –Range only uses the minimum and maximum. Extremely sensitive to outliers. –Interquartile Range (IQR) Represents the middle 50%. It is relative to the median. –Mean Absolute Deviation –Variance –Standard Deviation These three new measures of variability are each relative to the mean.

Measures of Dispersion Measures of Central Tendency (Mean, Median, Mode) give a typical value for a data set. Measures of Dispersion(Variability or Spread) provide measures to describe the data set’s variation from the typical value. Are the data points clustered tightly around the mean or median, somewhat near the mean or median, or very spread out from the mean or median? What value is the average, which is the middle value, which occurs the most, or which is most typical?

Measures of Dispersion (Spread) Relative to the Mean

In Central Tendency Park, a student counted the number of players playing basketball each day over a two week period and gathered the following data Data Set#1 Example: Comparing Data Sets

In Dispersion Park, another student counted the number of players playing basketball over a two week period and gathered the following data Data Set #2 Example: Comparing Data Sets

How are the two data sets similar and how are they different? Data Set #1 Data Set #2 In this case, one measure does not tell us very much about the data sets.

Value Data Set #1 Data Set # Let’s look at the frequency of the values. Compare and contrast the data sets

Data Set # X X X X X X X X X X X X X X Data Set #2 What do you observe? X X X X X X X X X X X X X X How does the spread of the data compare between the two data sets? Where is the data relative to the mean?

One way to analyze a data set is to look at the deviations between each data value and the mean. We could then compare the deviations of one data set with the deviations of another comparable data set. How can we do this?

Deviation from the mean: Data Set #1 (sorted) Deviation is the observed value minus the mean. It can be positive or negative depending on whether the observed value is greater than or less than the mean.

Which data set has data values further from the mean or is more spread out? If we compare the sums of the deviations of the two data sets, we would find that both have a sum of 8. While the sum of the deviations are the same, the variation of data set is quite different. A B 3 3 Each data set has a mean equal to 3.

Data Set #1 (sorted) What if we found the average of all the deviations? = 0 The sum of the deviations below the mean is The sum of the deviations above the mean is This makes sense because we know mean is the balance point

What if we found the average of all the DISTANCES? Data Set #1 (sorted) Remember that distance is positive!

Compare the Mean Absolute Deviations Data Set #1 (sorted) How do you expect the mean absolute deviation of data set #1 to compare to data set #2 ? In other words, compare the average distance of the data values from the mean for each data set. Data Set #2 (sorted)

Calculate the Mean Absolute Deviation for Data Set #2 Value |x i - µ| Total Data Set #2 (sorted)

Data Set #1 Data Set #2 How does Mean Absolute Deviation help us differentiate between the data sets? X X X X X X X X X X X X X X X X X X X X X X X X X X X X Having these two measures tell us more about how different or similar the data sets are. The larger the value, the more spread out the data or the more variablity exists in the data.

Practice with a Small Data Set The ages (years) of 10 grooms at their first marriage are given below. Determine the mean absolute deviation. 24.3, 46.6, 41.6, 32.9, 26.8, 39.8, 21.5, 45.7, 33.9, 35.1 Value|x i - µ| Total69.4

Measures of Dispersion (Spread) Relative to the Mean The variance and standard deviation are the most common and useful measures of variability.

Variance is the average of the deviations squared. Data Set #1 (sorted) To do the formula substitution, notice there is one -35, two -25’s, three -15’s, etc. players 2 By squaring the deviations, all squares will be positive. Therefore, the sum will not be zero.

Calculate the Variance for Data Set #2 Value (x i - µ) Total Data Set #2 (sorted)

Data Set #1 How does variance help us differentiate between the data sets? X X X X X X X X X X X X X X Data Set #2 X X X X X X X X X X X X X X Variance can be very cumbersome to use especially with large data values and large spread. Squaring the deviation can result in large values. Just look at the value for data set #1. The unit for variance is the original data’s unit squared. Not always very meaningful when trying to interpret. Again, the larger the value for variance, the more variability in the data.

Practice with a Small Data Set The ages (years) of 10 grooms at their first marriage are given below. Calculate the variance. 24.3, 46.6, 41.6, 32.9, 26.8, 39.8, 21.5, 45.7, 33.9, 35.1 Value(x i - µ) Total

Measures of Dispersion (Spread) Relative to the Mean

Standard Deviation of a Population Data Set Is just the positive square root of the variance. So for data set # 1 Mean Absolute Deviation, Variance and Standard Deviation will always be POSITIVE!!!!!!

Calculate the Standard Deviation for Data Set #2 Value (x i -µ) Total Data Set #2 (sorted)

How do you do this on the calculator? For the TI, –enter the data into a list, STAT, EDIT –Select STAT, CALC, 1-Var Stats, and then list you entered the data into. For the Casio, –Enter the data into a list, Select MENU, STAT –Set up the calculator, Select F6 (Set) and verify the 1- Var List Xlist has the specified list, Exit. –Calculate the mean and standard deviation, select 1- Var Stats. σ x is the standard deviation of the population. Calculate variance by squaring the standard deviation. Take standard deviation out to 4 decimal places when calculating the variance. This way the answers calculated by the formula will be closer to those calculated from squaring the standard deviation.

Data Set #2 How does Standard Deviation help us differentiate between the data sets? X X X X X X X X X X X X X X Data Set # X X X X X X X X X X X X X X Just like mean absolute deviation, the more the data is spread out from the mean, the larger the value.

Practice with a Small Data Set The annual number of deaths from tornadoes in the United States from 1990 through 2000 is given. 53, 39, 39, 33, 69, 30, 25, 67, 130, 94, 40. Determine the standard deviation and variance.

How would you compare and contrast each pair of data sets? Problem 1 Data Set 1 Data Set 2 μ = 212 μ = 208 σ = 14 σ = 23 Problem 2 Data Set 1 Data Set 2 μ = 55 μ = 65 Mean.Abs.Dev. = 4.9 Mean.Abs.Dev. = 1.5

How would you compare the variability of the data of the two data sets? Problem 1 Data Set 1 Data Set 2 μ = 212 μ = 208 σ = 14 σ = 23 Problem 2 Data Set 1 Data Set 2 μ = 55 μ = 65 Mean.Abs.Dev. = 4.9 = 1.5 Problem 1-Answer Data Set 1 has a higher mean but it’s data is less spread out than Data Set 2 since its standard deviation is less than data set 2. Problem 2-Answer Data Set 1 has a lower mean but it’s data has more variability than Data Set 2 since its mean absolute deviation is more than Data Set 2.

Without calculating the standard deviation, order each figure’s standard deviation from smallest to largest A.A B.B. Answer: B, C, A C.C µ = 4.5, σ = 2.29

Describe the standard deviation you would expect for the situation given. You’re in charge of a carnival game, where the player will win a prize by being the first to shoot 3 ping pong balls through a hole. 1.What type of standard deviation for the accuracy of the ping pong pistols would you want to have? 2. What would the participants want? A larger standard deviation would indicate less pistol accuracy and less chance of getting the 3 ping pong balls in the hole. Thus, no prize. The participants would benefit from a small standard deviation, indicating a greater accuracy with the pistol.

Describe the standard deviation you would expect for the situation given. You are having laser eye surgery. Describe the relative size of the standard deviation you would want in the precision of the laser. You have been hired by a company that has employees with years of service ranging from 0 to 27 years. What type of standard deviation would you expect in the salaries of the employees? What would the data look like if the standard deviation is 0?

“Sample Standard Deviation” Bessel Adjustment Sample standard deviation formula subtracts one from the size. This results in dividing by a smaller number, therefore getting a larger value. Since it is a sample of the population, it is better to err with a larger standard deviation. You could see though, that as the sample size gets larger and larger, subtracting 1 has less and less effect. (example, dividing by 1000 versus 999) Teacher Note:

How do the 2 data sets compare? Data Set #1 Data Set #2 Screen shots from a TI calculator. (Remember on a Casio, the sample and population standard deviation are in reverse order.

Population vs. Sample Standard Deviation for Data Set #1 μ if data set is population Sample Standard Deviation Population Standard Deviation Throughout Algebra I and Algebra II, we will be dealing with population only!

By examining the formulas, which of these would be the least sensitive to outliers?

Mean absolute deviation is more resistant to outliers than variance or standard deviation. With variance and standard deviation, the deviation of the outlier is being squared, adding a significantly larger value to the sum. Let’s test the sensitivity of outliers, by adding a single value of 100 to data set #2. We would expect the mean to increase. How much did each of our measures of spread increase? Which had the least amount of change? Which had the most amount of change? Value |xi- µ |(x i - µ ) Total 120/

Symbol Review PopulationSample Number of Entries Nn Mean Deviation Variance Standard Deviation

Summary The sum of the deviations,, is equal to 0. Mean absolute deviation, variance, and standard deviation are all measures of variability (dispersion or spread) relative to the mean. Mean Absolute Deviation (MAD) –It’s more resistant to outliers than variance and standard deviation. –If the data has outliers, the MAD will be a better indicator of spread. –The greater the value, the more dispersed the data will be from the mean. –It’s always positive.

Summary Variance –It’s the square of Standard Deviation. –Since the units of variance are units 2, this measure is used less frequently than standard deviation. –It’s always positive. Standard Deviation –It’s the square root of variance. –It’s more sensitive to outliers than Mean Absolute Deviation. –The units are the same as the data. –The greater its value, the more dispersed or spread out the data will be from the mean. –The closer it is to 0, the more clustered the data will be about the mean.

Revisit your diagnostic problem set that you completed prior to starting this module or take it for the first time. Assess your knowledge and understanding of the targeted outcome skills. Go to Solutions.Solutions Go to final slide for a quick survey.

Please click on the picture to take a quick 4 question survey. Your input is important to us.