Presentation is loading. Please wait.

Presentation is loading. Please wait.

Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.

Similar presentations


Presentation on theme: "Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2."— Presentation transcript:

1 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2

2 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 2 Some key words: sample vs. population categorical (ordinal?) vs. quantitative explanatory vs. response outlier mean vs. median standard deviation vs. range vs. IQR range histogram vs. boxplot shape?

3 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 3 Symmetric: mean = median Skewed Left: mean < median Skewed Right: mean > median

4 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 4

5 5 2.7Bell-Shaped Distributions of Numbers Many measurements follow a predictable pattern: Most individuals are clumped around the center The greater the distance a value is from the center, the fewer individuals have that value. Variables that follow such a pattern are said to be “bell-shaped”. A special case is called a normal distribution or normal curve.

6 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 6 Example 2.11 Bell-Shaped British Women’s Heights Data: representative sample of 199 married British couples. Below shows a histogram of the wives’ heights with a normal curve superimposed. The mean height = 1602 millimeters.

7 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 7 Describing Spread with Standard Deviation Standard deviation measures variability by summarizing how far individual data values are from the mean. Think of the standard deviation as roughly the average distance values fall from the mean.

8 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 8 Formula for the (sample) standard deviation: The value of s 2 is called the (sample) variance. An equivalent formula, easier to compute, is: Calculating the Standard Deviation

9 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 9 Data sets usually represent a sample from a larger population. If the data set includes measurements for an entire population, the notations for the mean and standard deviation are different, and the formula for the standard deviation is also slightly different. A population mean is represented by the symbol  (“mu”), and the population standard deviation is Population Standard Deviation

10 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 10 Interpreting the Standard Deviation for Bell-Shaped Curves: The Empirical Rule For any bell-shaped curve, approximately 68% of the values fall within 1 standard deviation of the mean in either direction 95% of the values fall within 2 standard deviations of the mean in either direction 99.7% of the values fall within 3 standard deviations of the mean in either direction

11 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 11 The Empirical Rule, the Standard Deviation, and the Range Empirical Rule => the range from the minimum to the maximum data values equals about 4 to 6 standard deviations for data with an approximate bell shape. You can get a rough idea of the value of the standard deviation by dividing the range by 6.

12 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 12 Example 2.11 Women’s Heights (cont) Mean height for the 199 British women is 1602 mm and standard deviation is 62.4 mm. 68% of the 199 heights would fall in the range 1602  62.4, or 1539.6 to 1664.4 mm 95% of the heights would fall in the interval 1602  2(62.4), or 1477.2 to 1726.8 mm 99.7% of the heights would fall in the interval 1602  3(62.4), or 1414.8 to 1789.2 mm

13 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 13 Example 2.11 Women’s Heights (cont) Summary of the actual results: Note: The minimum height = 1410 mm and the maximum height = 1760 mm, for a range of 1760 – 1410 = 350 mm. So an estimate of the standard deviation is:

14 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 14 Standardized z-Scores Standardized score or z-score: Example: Mean resting pulse rate for adult men is 70 beats per minute (bpm), standard deviation is 8 bpm. The standardized score for a resting pulse rate of 80: A pulse rate of 80 is 1.25 standard deviations above the mean pulse rate for adult men.

15 Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. 15 The Empirical Rule Restated For bell-shaped data, About 68% of the values have z-scores between –1 and +1. About 95% of the values have z-scores between –2 and +2. About 99.7% of the values have z-scores between –3 and +3.


Download ppt "Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2."

Similar presentations


Ads by Google