1 Normal Probability Distributions. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores 60 70 80 90 In.

Slides:



Advertisements
Similar presentations
Very simple to create with each dot representing a data value. Best for non continuous data but can be made for and quantitative data 2004 US Womens Soccer.
Advertisements

1 The Normal Distribution and Microsoft Excel. 2 Say a company has developed a new tire for cars. In testing the tire it has been determined the distribution.
The Normal distributions BPS chapter 3 © 2006 W.H. Freeman and Company.
HS 67 - Intro Health Stat The Normal Distributions
Stat350, Lecture#4 :Density curves and normal distribution Try to draw a smooth curve overlaying the histogram. The curve is a mathematical model for the.
Chapter 2: The Normal Distributions
AP Statistics Section 2.1 B
DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.
1 The Normal Probability Distribution. 2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores
Chapter 6 Introduction to Continuous Probability Distributions
2-5 : Normal Distribution
1 Examples. 2 Say a variable has mean 36,500 and standard deviation What is the probability of getting the value 37,700 or less? Using the z table.
1 The Normal Probability Distribution The use of tables.
The Normal Distribution
CHAPTER 3: The Normal Distributions Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Chapter 2: Density Curves and Normal Distributions
1 The Sample Mean rule Recall we learned a variable could have a normal distribution? This was useful because then we could say approximately.
BPS - 5th Ed. Chapter 31 The Normal Distributions.
Chapter 2: The Normal Distribution
What We Know So Far… Data plots or graphs
3.3 Density Curves and Normal Distributions
C HAPTER 2: T HE N ORMAL D ISTRIBUTIONS. D ENSITY C URVES 2 A density curve describes the overall pattern of a distribution. Has an area of exactly 1.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
Chapter 2: The Normal Distribution Section 1: Density Curves and the Normal Distribution.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
CHAPTER 3: The Normal Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Essential Statistics Chapter 31 The Normal Distributions.
Density Curves and the Normal Distribution.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
CHAPTER 3: The Normal Distributions
2.1 Density Curves and the Normal Distribution.  Differentiate between a density curve and a histogram  Understand where mean and median lie on curves.
Density Curves Section 2.1. Strategy to explore data on a single variable Plot the data (histogram or stemplot) CUSS Calculate numerical summary to describe.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
BPS - 5th Ed. Chapter 31 The Normal Distributions.
Essential Statistics Chapter 31 The Normal Distributions.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
NORMAL DISTRIBUTION Chapter 3. DENSITY CURVES Example: here is a histogram of vocabulary scores of 947 seventh graders. BPS - 5TH ED. CHAPTER 3 2 The.
Chapter 2: Modeling Distributions of Data
Ch 2 The Normal Distribution 2.1 Density Curves and the Normal Distribution 2.2 Standard Normal Calculations.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Density Curves & Normal Distributions Textbook Section 2.2.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
The Normal Model Chapter 6 Density Curves and Normal Distributions.
The Normal Distributions.  1. Always plot your data ◦ Usually a histogram or stemplot  2. Look for the overall pattern ◦ Shape, center, spread, deviations.
Normal Distributions Overview. 2 Introduction So far we two types of tools for describing distributions…graphical and numerical. We also have a strategy.
Chapter 2 The Normal Distributions. Section 2.1 Density curves and the normal distributions.
2.2 Normal Distributions
Chapter 4: The Normal Distribution
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 6 The Normal Curve.
CHAPTER 3: The Normal Distributions
Density Curves and Normal Distribution
CHAPTER 2 Modeling Distributions of Data
2.1 Density Curve and the Normal Distributions
CHAPTER 3: The Normal Distributions
CHAPTER 2 Modeling Distributions of Data
2.1 Density Curves and the Normal Distributions
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 3: The Normal Distributions
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 3: The Normal Distributions
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

1 Normal Probability Distributions

2 Review relative frequency histogram 1/10 2/10 4/10 2/10 1/10 Values of a variable, say test scores In this example 10 people took a test. The height of each bar is the relative frequency or percentage of those in that range of scores. What % of people had test scores between 70 and 80? 40% What % of people had scores less than 70? 30% If you add up all the fractions what do you get? 1

3 My example on the previous slide is about test scores. Test score is a quantitative variable and the authors suggest that to describe the distribution 1) Plot the data and/or make some sort of graph, 2) Look for the overall pattern – shape, center, and spread, and 3) Calculate a measure of center and spread. The authors suggest that the overall pattern of a large number of observations can be described by a smooth curve called a density curve. A density curve is a mathematical model for a distribution. We use the density curve when we have a real world data pattern that is reasonably enough like the model.

4 A density curve has the following properties: 1) Is always on or above the horizontal axis, and 2) Has area exactly 1 underneath it. A density curve describes the overall pattern of a distribution. The area under the curve and above any range of values is the proportion of all observations that fall in that range. Back on slide 2 here we do not have a density curve because it is not smooth. But if we smoothed it out it would have the properties we have here. Imagining what we have on slide 2 is smooth, what proportion of values are between 60 and 90? The answer is.8. Note that here that would mean 8 of the 10 people who took the test had a score between 60 and 90.

5 The Normal Distributions - Basic idea u The normal distribution is a tool we use to try to convey the same information as we get from a relative frequency histogram. u The normal distribution has been used a lot in statistics and we will use it later, so we will look at some details about it. u But, first let’s look at circles - yes I mean circles!

6 circles and density u Imagine you are at the intersection by Dairy Queen. Now imagine a large circle is placed on the earth such that the center of the circle is at the intersection plus enough houses have been included so 1,000,000 people live in the circle. u New York City has a similar intersection and circle, except the circle is smaller (WHY?).

7 circles and density u The New York circle is smaller because you travel a shorter distance from the center to get an equal density of people in New York. u The smaller circle in a sense has a smaller standard deviation(actually it has a smaller radius) - the distance is less spread out. u One thing similar about the two circles is that you can divide them both up into quarters. Let’s do this on the next screen with two circles

8 circles and density A a 25% of the area is in A on the large circle and 25% of the area of the small circle is in part a. How can they both be 25%? It is 25 % of its own total. There are as many different normal distributions as there are circles. BUT, normal distributions are divided up, not into quarters, but in another way.

9 The Normal Distributions - normal dist. and density u Normal distributions can roughly be drawn by modifying a circle. flip this part out to left flip this part out to right like this

10 The Normal Distributions - normal dist. and density u Let’s label parts of the normal distributions. number line for the variable- like test score This is the center of the distribution. It really is the mean value. This point is where the bottom part of the circle flipped. Let’s call it the inflection point. There is one on the other side as well. This point on the number line is directly below the inflection point. It turns out that the point on the number line is one standard deviation away from the center.

11 On the previous screen we see a graph of a normal distribution. Let’s consider an example to highlight some points. Say a company has developed a new tire for cars. In testing the tire it has been determined that the mean tire mileage is 36,500 miles and the standard deviation is 5000 miles. Along the horizontal axis we measure tire mileage. The normal distribution rises above the axis. Note the highest point of the curve occurs above the mean - in our tire example we would be at 36,500. On the curve we have two inflection points, and these occur 1 standard deviation away from the mean. So, mileages 31,500 and 41,500 are 1 standard deviation for the mean and the inflection points occur above them.

12 The Normal Distributions - notation  In general, now, we will talk about a variable having a normal distribution. We will say variable X is normally distributed with mean mu and standard deviation sigma.  More simply, we say X is N( mu, sigma ). u Don’t let the N(---) part fool you, it means N(mean value listed first, then standard deviation value listed).

13 The Normal Distributions - example with graphical thinking u Say we have a variable X is N(3, 1) X is measured on the line 3 3 is the mean 2 4 Use the dots as your guide to draw the normal dist. Why is this dot, and the one across, above #’s 2 and 4?

14 The Normal Distributions - another example with graphical thinking u Say we have a variable X is N(3, 2) X is measured on the line 3 3 is the mean 2 4 Use the dots as your guide to draw the normal dist. Why is this dot, and the one across, above #’s 1 and 5? 1 5

15 The Normal Distributions - compare the two examples u here is what the two examples look like, one on top of the other X is N(3, 1) X is N(3, 2)

16 The Normal Distributions - compare the two examples u Note on the previous screen how the X is N(3, 2) had its inflection points wider than on the X is N(3, 1). u Remember how we labeled the quarters of the different circles A and a. We said there was 25% of the circle in both A and a, but based on its own total. u Normal dist.’s have a similar rule. 68% of the area under the curve is between the two inflection points. There is more.

17 The Normal Distributions rule u On any normal distribution the inflection points will be 1 standard deviation on either side of the mean. 68% of the area under the curve will be within this one standard dev. u By moving out 2 standard deviations on either side of the mean you have about 95% of the area under the curve. u By moving out 3 stand. dev.’s you have 99.7 % of the area under the curve.

18 The Normal Distributions rule u What is the meaning of the phrase, “1 standard dev. on either side of the mean?” u The answer may best seen by an example. X is (10, 2.5) means X is normal with mean 10 and standard deviation 2.5. Thus 7.5 is 1 stand. dev. on the low side of the mean and 12.5 is 1 stand. dev. on the high side of the mean. Thus being anywhere from 7.5 to 12.5 is within 1 standard deviation of the mean.

19 Note about normal distribution: 1. There are many normal distributions, each characterized by a mean value and a standard deviation. 2. The high point of the curve is above the mean and for a normal distribution the mean = median = mode. 3. Depending on the variable, the mean can be negative, zero, or positive. 4. The normal curve is symmetric. This means each side is a mirror image of itself. 5. Larger standard deviations result in a flatter, wider distribution. 6. Probabilities for the variable are found from areas under the curve - the 68, 95, 99.7 rule is an example of this.

20 miles 26,500 31,500 36,500 41,500 46, z Note the concept of a z score for a value for a variable. Using the tire example from a few slides back: z = ( a value minus the mean)/standard deviation. So the value 26,500 has a z = (26, ,500)/5000 = -2. This means 26,500 is 2 standard deviations below the mean. You can check the other values.

21 So, from the previous slide we see that approximately 68% of all the tires will last between 31,500 and 41,500 miles. Approximately 95% of the tires will last between 26,500 and 46,500 miles. Approximately 99.7% of the tires will last between 21,500 and 51,500 miles. This approximation rule is useful to make a quick set of calculations that are roughly correct about a distribution that is normal. Sometimes we want to make more accurate statements about a normal distribution. We turn to that next.

22 The standard normal distribution Remember how we said there are many different circles and many different normal distribution? Sure you do. The z value translates any normally distributed variable into what is called the standard normal variable. Technically the picture I have on the previous screen is misleading because the z’s are a different scale than the miles, but don’t worry. In the book there is a table with z values and areas under the curve. Let’s see how to use the table. Here is one place where I want you to be extra careful when you calculate z. Round z to 2 decimal places. In general the z value can be thought of as a.bc. To use the table we break up a.bc into a.b and.0c For example the number 2.13 is broken up into 2.1 and.03

23 Using the standard normal table The z = 2.13 means we should go down the table to 2.1 and then over to.03. The number in the table is This means the probability of getting a value less than z = 2.13 is In the tire example if we look at the mean value 36,500, we see the z = (36, ,500)/5000 = 0.00 and in the table we see the value Thus, there is a.5 probability the tire mileage will be less than 36,500. So the table has the area under the curve to the left of the value of interest. We may want other z’s and other areas. What do we do?

24 Say we want the area to the right of a z that is greater than 0? The table has the area to the left. Whatever the z is, go into the table and get the area and then take 1 minus the area in the table. a b The z here would be negative. Say we want area b. Area a is in the table and b is 1 minus area a. Area b would be found in a similar way to what is above.

25 Back in the old days when I had to walk to school uphill both ways in three feet of snow, the standard normal table was all we had to calculate probabilities for a normal distribution. Now we have Microsoft Excel to make the calculations. This is not a class in Excel – but I show you if you like Excel. The NORMSDIST function assumes we have a z value and we want to find the area to the left of the z - the area to the left is the cumulative probability. The function has the form =NORMSDIST(z), where z is the value we have. z can be negative in Excel. The NORMDIST function allows us to just work with the variable without getting the z and we can still have the cumulative probability. The function has the form =NORMDIST(value, mean, standard deviation, TRUE). This is an innovation of Excel over the old days.

26 Sometimes we may have an area and want to know the z. The function NORMSINV asks us to give an area to the left of a value and the function will give us the z value. The form of the function is =NORMSINV(cumulative probability). The function NORMINV does the same, except not in z value form. It just give the value in the same form as the variable. The form of the function is =NORMINV(cumulative prob, mean, standard deviation)