Review – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation.

Slides:



Advertisements
Similar presentations
Chapter 2: Modeling Distributions of Data
Advertisements

HS 67 - Intro Health Stat The Normal Distributions
Topic 3 The Normal Distribution. From Histogram to Density Curve 2 We used histogram in Topic 2 to describe the overall pattern (shape, center, and spread)
Homework for 2.1 Day 1: 41, 43, 45, 47, 49, 51. 1) To use the rule to estimate the percent of observations from a Normal Distribution that.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
CHAPTER 3: The Normal Distributions Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
3-3 Measures of Variation. Definition The range of a set of data values is the difference between the maximum data value and the minimum data value. Range.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 4: The Normal Distribution and Z-Scores.
12.3 – Measures of Dispersion
Chapter 5 DESCRIBING DATA WITH Z-SCORES AND THE NORMAL CURVE.
Chapter 2: The Normal Distribution
Density Curves and Normal Distributions
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Density Curve A density curve is the graph of a continuous probability distribution. It must satisfy the following properties: 1. The total area.
BPT 2423 – STATISTICAL PROCESS CONTROL.  Frequency Distribution  Normal Distribution / Probability  Areas Under The Normal Curve  Application of Normal.
Chapter 13, Part 1 STA 200 Summer I At this point… we have a couple of methods for graphing a data set (histogram, stem-and-leaf plot) we have a.
Normal Curves and Sampling Distributions
Probabilistic and Statistical Techniques
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
1 Normal Random Variables In the class of continuous random variables, we are primarily interested in NORMAL random variables. In the class of continuous.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.
Review Measures of central tendency
CHAPTER 3: The Normal Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Density Curves Can be created by smoothing histograms ALWAYS on or above the horizontal axis Has an area of exactly one underneath it Describes the proportion.
Essential Statistics Chapter 31 The Normal Distributions.
Slide 1 © 2002 McGraw-Hill Australia, PPTs t/a Introductory Mathematics & Statistics for Business 4e by John S. Croucher 1 n Learning Objectives –Identify.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
Think about this…. If Jenny gets an 86% on her first statistics test, should she be satisfied or disappointed? Could the scores of the other students in.
+ Chapter 2: Modeling Distributions of Data Section 2.1 Describing Location in a Distribution The Practice of Statistics, 4 th edition - For AP* STARNES,
AP Statistics Chapter 2 Notes. Measures of Relative Standing Percentiles The percent of data that lies at or below a particular value. e.g. standardized.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Section 3-3 Measures of Variation. WAITING TIMES AT DIFFERENT BANKS Jefferson Valley Bank (single waiting line) Bank of Providence.
NORMAL DISTRIBUTION Chapter 3. DENSITY CURVES Example: here is a histogram of vocabulary scores of 947 seventh graders. BPS - 5TH ED. CHAPTER 3 2 The.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
+ Chapter 2: Modeling Distributions of Data Lesson 1: Describing Location in a Distribution.
B AD 6243: Applied Univariate Statistics Data Distributions and Sampling Professor Laku Chidambaram Price College of Business University of Oklahoma.
Chapter 2 Modeling Distributions of Data Objectives SWBAT: 1)Find and interpret the percentile of an individual value within a distribution of data. 2)Find.
Descriptive Statistics Review – Chapter 14. Data  Data – collection of numerical information  Frequency distribution – set of data with frequencies.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
1 Chapter 2: The Normal Distribution 2.1Density Curves and the Normal Distributions 2.2Standard Normal Calculations.
+ Progress Reports Homework Test Corrections Signed 1.
Interpreting Histograms
Thinking Mathematically Statistics: 12.4 The Normal Distribution.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Chapter 7: Sampling Distributions Section 7.2 Sample Proportions.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.Copyright © 2010 Pearson Education Section 3-3 Measures of Variation.
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
January 24,  So far, we have used histograms to represent the overall shape of a distribution. Now smooth curves can be used:
THE NORMAL DISTRIBUTION
Copyright ©2011 Brooks/Cole, Cengage Learning Continuous Random Variables Class 36 1.
Descriptive Statistics Measures of Variation
Modeling Distributions of Data
Interpreting Center & Variability.
Section 3.3 Measures of Variation.
Stat 2411 Statistical Methods
ANATOMY OF THE EMPIRICAL RULE
Lecture Slides Elementary Statistics Twelfth Edition
Normal Probability Distributions
Lecture Slides Elementary Statistics Twelfth Edition
Chapter 6: Normal Distributions
Z-Scores The Normal Distribution
Stat 2411 Statistical Methods Chapter 4. Measure of Variation.
Presentation transcript:

Review – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within a) one standard deviation from the mean i.e. 59, 70, 73, 75, 81, 84, 86 b) two standard deviations from the mean i.e. 59, 70, 73, 75, 81, 84, 86 c) three standard deviations from the mean i.e. 35, 59, 70, 73, 75, 81, 84, 86 1

Idea! The example suggests that there may be a general rule which allows us to estimate the fraction of data points which are within a given number of standard deviations of the mean. 2

Distribution curves If the number of data points n is small, one uses a small number of class intervals and obtains a typical histogram 3

Histogram (small n) 4

Distribution curves If the number of data points n is lager, one can use more subintervals in producing a histogram for the data set 5

Histogram (Medium n) 6

Distribution curves If the number of data points n is very large, one can use a corresponding large number of class intervals and obtain a histogram which cn be seen to approach a curve in the limit as n increases. In this case, it is convenient to choose the scale on the vertical axis so that the area of each vertical bar corresponds to the fraction of data poins in the corresponding subinterval. NOTE: The total area under the graph of the histogram will therefore be 1. 7

Histogram (large n) 8

In the limit as the size of the population increases, one obtains a smooth curve. This curve was called a probability density function in Math 112 One obtains different curves corresponding to different population means and variances. 9

Normal Distributions Shape of this curve is determined by µ and σ –µ it’s centered, σ is how far it’s spread out.

Interpreting the Standard Deviation Chebyshev’s Theorem The proportion (or fraction) of any data set lying within K standard deviations of the mean is always at least 1-1/K 2, where K is any positive number greater than 1. For K=2 we obtain, at least 3/4 (75 %) of all scores will fall within 2 standard deviations of the mean, i.e. 75% of the data will fall between 11

Interpreting the Standard Deviation Chebyshev’s Theorem The proportion (or fraction) of any data set lying within K standard deviations of the mean is always at least 1-1/K 2, where K is any positive number greater than 1. For K=3 we obtain, at least 8/9 (89 %) of all scores will fall within 3 standard deviations of the mean, i.e. 89% of the data will fall between 12

Exercise 1 Data collected daily at an intersection giving the number of cars passing through. Mean = 375 with standard deviation 25 Estimate the fraction of days that more than 425 cars used the intersection. 13

Exercise 1 Data collected daily at an intersection giving the number of cars passing through. Mean = 375 with standard deviation 25 Estimate the fraction of days that more than 425 cars used the intersection. ANS: At least 75% lie in the interval (325, 425). Therefore, at most 25% lie outside this interval. 14

Exercise 1 NOTE: Assuming the data is symmetric and therefore evenly distributed about the mean, we can conclude that the 25% which lie outside the interval (325,425) are evenly distributed into 12.5% lying above 425 and 12.5% lying below , 425). Therefore, on roughly 12.5% of the days there will be more than 425 cars using the intersection. 15

If the data is known to have a histogram which is symmetric about the mean and “bell shaped”, one can improve upon Chebyshev’s Rule 16

This Data is Symmetric, Bell Shaped (or Normal Data) 17 Relative Frequency

This Data is Symmetric, Bell Shaped (or Normal Data) 18 Relative Frequency

This Data is Symmetric, Bell Shaped (or Normal Data) 19 Relative Frequency

The Empirical Rule The Empirical Rule states that for bell shaped (normal) data: 68% of all data points are within 1 standard deviations of the mean 95% of all data points are within 2 standard deviations of the mean 99.7% of all data points are within 3 standard deviations of the mean 20

The Empirical Rule The Empirical Rule states that for bell shaped (normal) data, approximately: 68% of all data points are within 1 standard deviations of the mean 95% of all data points are within 2 standard deviations of the mean 99.7% of all data points are within 3 standard deviations of the mean 21

Exercise 2 Data collected daily at an intersection giving the number of cars passing through. Mean = 375 with standard deviation 25 Estimate the fraction of days that more than 425 cars used the intersection assuming the data is bell-shaped. 22

Exercise 2 Data collected daily at an intersection giving the number of cars passing through. Mean = 375 with standard deviation 25 Estimate the fraction of days that more than 425 cars used the intersection assuming the data is bell-shaped. ANS: 2.5% (Check this!) 23

Z-Score To calculate the number of standard deviations a particular point is away from the standard deviation we use the following formula. 24

Z-Score To calculate the number of standard deviations a particular point is away from the standard deviation we use the following formula. The number we calculate is called the z-score of the measurement x. 25

Example – Z-score Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. a) Find the z-score of the data point 35. b) Find the z-score of the data point

Example – Z-score Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation are 70.4 and 16.7, respectively. a)Find the z-score of the data point 35. z = b) Find the z-score of the data point 73. z =

Interpreting Z-scores The further away the z-score is from zero the more exceptional the original score. Values of z less than -2 or greater than +2 can be considered exceptional or unusual (“a suspected outlier”). Values of z less than -3 or greater than +3 are often exceptional or unusual (“a highly suspected outlier”). 28

29 Example: Aptitude tests Before being accepted into a manufacturing job, one must complete two aptitude tests. Your score on the tests will decide whether you will be in management or whether you will work on the factory floor. One test is a manual dexterity test, the other is a statistics test. The manual dexterity test (out of 10) has a mean of 6 and a standard deviation of 1. The statistics test (out of 50) has a mean of 25 with a standard deviation of 3. Your score is 7/10 on the manual dexterity test, and a 34/50 on the statistics test. In which test were you exceptional?

Example: Aptitude tests 30 The problem with comparing the two test scores stems from the fact that the tests are on two different scales. If we are going to do meaningful comparisons, then we must somehow, standardize the scores.

Answer Calculate the z-score for the two tests. –Z-score of Man. Dex. = (7-6)/1 = 1 –Z-score of Stats. = (34-25)/3 = 3 Your score on the stats test was exceptionally high (3 standard deviations above the mean. 31