The Normal Distribution
Normal and Skewed Distributions
Normal Distribution Definition: A continuous, symmetric, bell-shaped distribution of a variable. The shape and position of the curve depend on 2 variables. ( The mean and the standard deviation) The larger the deviation the more dispersed, or spread out, the distribution is. The area under the curve is more important than the frequencies themselves. When pictured the y-axis is usually omitted. Z-value: The number of standard deviations that a particular X value is away from the mean.
Other Properties of Normal Distribution 1. The normal distribution curve is unimodal. 2. Theoretically, no matter how far it extends, it never touches the x-axis. It just gets increasingly closer. 3. The total area, under the curve, is equal to 1.00 or 100%. 4. The area under the curve follows the Empirical Rule. One deviation about 68% of the curves area. Two deviations about 95% of curves area. Finally 3 deviations about 99.7% of curves area.
Standard Normal Distribution Differs slightly from normal distribution in the fact that its’ mean value is 0 and the standard deviation value is 1.
Finding Areas Under the Curve There are 7 basic types of problems. 1. Between 0 and any z-value. 2. In any tail of the curve. 3. Between two z-values on the same side of the mean. 4. Between z-values on opposite sides of the mean. 5. Less than any z-value to the right of the mean. 6. Greater than any z-value to the left of the mean 7. In any two tails of the curve.
How to Solve for the Area Step 1: Draw the picture of the curve. Step 2: Shade the desired area. Step 3: Depending on the area shaded, you may either do nothing, add, or even subtract appropriate z-values from Table E.
Examples #1: Find the area to the left of z = 1.25 #2: Find the area between z = 0 and z = #3: Find the area between z =.85 and z = 1.7 #4: Find the area between z = -.54 and z = 2 #5: Find the area to the left of z = and to the right of z = 3.04
Find the Z May need to find the specific z-value for a given area under the normal distribution. Work backwards: Step 1: Find the area in Table E. Step 2: Read the correct z-value in the left column and in the top row and add two values together. Example 1: Find the z-value that corresponds to the area Example 2: Find the z-value that corresponds to the area
Other Applications The standard normal distribution curve can be used to solve a wide variety of problems. Including probabilities and finding specific data values for given percentages. Only requirement is that the variable be normally or approximately distributed. To solve problems we need to transform the original variable into a standard normal distribution variable and then use Table E to solve the problems. To determine z-value we use the same formula from Chapter 3. z = ( value – mean ) ÷ sd.
Examples #1 The avg. hourly wage of Detroit car workers is $ If standard deviation is $1.80 find the probabilities for a randomly selected worker. Assume variable is normally distributed. P(The worker earns more than $15.60) P(The worker earns less than $9.00) #2 The avg. age of CEOs is 62 years. If standard deviation is 3 years find the probability for randomly selected CEOs will fall in the following ranges. Assume variable is normally distributed. Between 64 and 70 years old. Between 57 and 62 years old.
Steps to finding specific data values
Examples The scores on a test has a mean of 90 and standard deviation of 7. If a teacher wishes to select the top 30% of students who took the test find the cutoff score. Assume variable is normally distributed. A pet shop owner decides to sell snakes that appeal to the middle 50% of customers. The mean for the snakes is $37.90 and the standard deviation is $5.60. Find the minimum and maximum prices of the snakes the owner should sell. Assume variable is normally distributed.
Distribution of Sample Means Along with knowing how data values vary about a mean for a population, statisticians are also interested in knowing about the distribution of the means of samples taken. Sampling distribution of sample means: A distribution obtained by using the means computed from random samples of a specific size taken from a population. If randomly selected, the sample means, for most part are somewhat different from the population mean. These differences are caused by sampling error. Sampling Error: The difference between the sample measure and the corresponding population measure due to the fact sample is not a perfect representation of the population
Properties of the Distribution of Sample Means If all possible samples of a specific size are selected from a population, the distribution of the sample means has 2 important properties. #1: The mean of the sample means will be the same as the population mean. #2: The standard deviation of the sample means will be smaller than the standard deviation of the population. It will be equal to the population deviation divided by the square root of the sample size. Standard deviation of the sample means is called the standard error of the mean.
Central Limit Theorem
Examples The avg. age of teachers is 46 years, with a standard deviation of 6 years. If a school employs 65 teachers, find the probability that the average age of 7 randomly selected teachers is greater than 50 years old. Assume the variable is normally distributed. The mean weight of 13 year old males is 108 pounds and the standard deviation is 11. If a sample of 35 males is selected find the probability that the mean of the sample will be less than 104 pounds. Assume the variable is normally distributed.
The Correction Factor
The Correction Factor Formulas
Examples #1: A study of 100 renters showed that the average rent of their apartments was $1,200 and the standard deviation was $150. If 28 renters are selected, find the probability the average rent of their apartments was greater than $1,450. #2: The average price of a new suit at Macy’s is $275 and a standard deviation of $15. If 15 suits are sold from a lot of 40, find the probability that the average price will be less than $235?
The Normal Approximation to the Binomial Distribution The normal distribution is used to solve problems that involve the binomial distribution due to the fact that when n is rather large the calculations get to difficult to do by hand. When p is close to 0.5 and as n increases the binomial distribution becomes similar to the normal distribution. When p is close to 0 or 1 and n is smaller the normal approximation becomes inaccurate. Due to the fact above the normal approximation should only be used when n ● p and n ● q are both greater than 5.
Formulas for Mean and Standard Deviation of Binomial Distribution
Steps for Normal Approximation to the Binomial Distribution Step 1: Check to see whether normal approximation can be used. Step 2: Find the mean and standard deviation. Step 3: Write the problem in probability notation. Step 4: Rewrite the problem using the continuity correction factor, and show corresponding area under the curve. Step 5: Find the corresponding z-values. Step 6: Find solution. Correction factor: Employed when a continuous distribution is used to approximate a discrete distribution. Works like lower and upper boundaries.
Examples Of all 5 to 6 year-old children, in NJ, 87% are enrolled in school. If a sample of 300 such children are randomly selected, find the probability that at least 247 will be enrolled in school. A survey found that 17% of teenage drivers text while driving. If 400 drivers are selected at random, find the probability that exactly 65 say they text while driving.
Examples Use the normal approximation to the binomial to find the probabilities for specific value/s of X. #1: n = 25, p =.65, X = 12 #2: n = 80, p =.4, X = 34 #3: n = 30, p =.5 X ≤ 8