Normal Distribution Recall how we describe a distribution of data:

Slides:



Advertisements
Similar presentations
(Day 1).  So far, we have used histograms to represent the overall shape of a distribution. Now smooth curves can be used:
Advertisements

HS 67 - Intro Health Stat The Normal Distributions
Topic 3 The Normal Distribution. From Histogram to Density Curve 2 We used histogram in Topic 2 to describe the overall pattern (shape, center, and spread)
AP Statistics Section 2.1 B
CHAPTER 3: The Normal Distributions Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
Chapter 2: Density Curves and Normal Distributions
The Normal Distributions
Chapter 2: The Normal Distribution
Normal Distribution Recall how we describe a distribution of quantitative (continuous) data: –plot the data (stemplot or histogram) –look for the overall.
What We Know So Far… Data plots or graphs
+ Chapter 2: Modeling Distributions of Data Section 2.2 Normal Distributions The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
3.3 Density Curves and Normal Distributions
Looking at Data - Distributions Density Curves and Normal Distributions IPS Chapter 1.3 © 2009 W.H. Freeman and Company.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. 1 PROBABILITIES FOR CONTINUOUS RANDOM VARIABLES THE NORMAL DISTRIBUTION CHAPTER 8_B.
The distribution of heights of adult American men is approximately normal with mean 69 inches and standard deviation 2.5 inches. Use the rule.
Stat 1510: Statistical Thinking and Concepts 1 Density Curves and Normal Distribution.
Chapter 2.2 STANDARD NORMAL DISTRIBUTIONS. Normal Distributions Last class we looked at a particular type of density curve called a Normal distribution.
+ Chapter 2: Modeling Distributions of Data Section 2.2 Normal Distributions The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Do NOT glue (we’ll do that later)— simply.
NOTES The Normal Distribution. In earlier courses, you have explored data in the following ways: By plotting data (histogram, stemplot, bar graph, etc.)
CHAPTER 3: The Normal Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
+ Chapter 2: Modeling Distributions of Data Section 2.1 Describing Location in a Distribution The Practice of Statistics, 4 th edition - For AP* STARNES,
Chapter 6 The Normal Curve. A Density Curve is a curve that: *is always on or above the horizontal axis *has an area of exactly 1 underneath it *describes.
Density Curves and the Normal Distribution.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
CHAPTER 3: The Normal Distributions
2.1 Density Curves and the Normal Distribution.  Differentiate between a density curve and a histogram  Understand where mean and median lie on curves.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 6 Probability Distributions Section 6.2 Probabilities for Bell-Shaped Distributions.
Density Curves Section 2.1. Strategy to explore data on a single variable Plot the data (histogram or stemplot) CUSS Calculate numerical summary to describe.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 2 Modeling Distributions of Data 2.2 Density.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
+ Chapter 2: Modeling Distributions of Data Section 2.2 Normal Distributions The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
Section 2.1 Density Curves. Get out a coin and flip it 5 times. Count how many heads you get. Get out a coin and flip it 5 times. Count how many heads.
Ch 2 The Normal Distribution 2.1 Density Curves and the Normal Distribution 2.2 Standard Normal Calculations.
Chapter 3 The Normal Distributions. Chapter outline 1. Density curves 2. Normal distributions 3. The rule 4. The standard normal distribution.
Density Curves & Normal Distributions Textbook Section 2.2.
The Normal Distributions.  1. Always plot your data ◦ Usually a histogram or stemplot  2. Look for the overall pattern ◦ Shape, center, spread, deviations.
Chapter 2 The Normal Distributions. Section 2.1 Density curves and the normal distributions.
2.2 Normal Distributions
Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Do NOT glue (we’ll do that later)—simply type the data into List 1
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 6 The Normal Curve.
Good Afternoon! Agenda: Knight’s Charge-please wait for direction
CHAPTER 3: The Normal Distributions
Density Curves and Normal Distribution
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
2.1 Density Curve and the Normal Distributions
Chapter 2: Modeling Distributions of Data
Chapter 2 Data Analysis Section 2.2
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
CHAPTER 3: The Normal Distributions
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
CHAPTER 3: The Normal Distributions
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

Normal Distribution Recall how we describe a distribution of data: plot the data (stemplot or histogram) look for the overall pattern (shape, peaks, gaps) and departures from it (possible outliers) calculate appropriate numerical measures of center and spread (5-number summary and/or mean & s.d.) then we may ask "can the distribution be described by a specific model?" (one of the more common models for symmetric, single-peaked distributions is the normal distribution having a certain mean and standard deviation) can we imagine a density curve fitting fairly closely over the histogram of the data? a density curve is a curve that is always on or above the horizontal axis (>= 0) and whose total area under the curve is 1

An important property of a density curve is that areas under the curve correspond to relative frequencies - see figures below. rel. freq=287/947=.303 area = .293 Note the relative frequency of vocabulary scores <= 6 is roughly equal to the area under the density curve <= 6.

We can describe the shape, center and spread of a density curve in the same way we describe data… e.g., the median of a density curve is the “equal-areas” point - the point on the horizontal axis that divides the area under the density curve into two equal (.5 each) parts. The mean of the density curve is the balance point - the point on the horizontal axis where the curve would balance if it were made of a solid material.

For a normal density curve we see the characteristic “bell-shaped”, symmetric curve with single peak (at the mean value ) and spread out according to the standard deviation () x=seq(-6,6,.01) plot(x,dnorm(x),col="red") lines(x,dnorm(x,sd=2.5),col="black") dnorm(x) gives the value of the standard normal density curve at the point x - change the mean & sd as arguments…

The 68-95-99. 7 Rule describes the relationship between  and  The 68-95-99.7 Rule describes the relationship between  and . See Figure below -

If X is N(,), then Z is N(0,1), where Z=(X-)/. How many different normal curves are there? Ans: One for every combination of values of  and …but they all are alike except for their  and . So we take advantage of this and consider a process called standardization to reduce all normals to one we call the Standard Normal Distribution. Denote a normal distribution with mean  and standard deviation  by N(,). Let X correspond to the variable whose distribution is N(,). We may standardize any value of X by subtracting  and dividing by  - this re-writes any normal into a variable called Z whose values represent the number of standard deviations X is away from its mean. The standardized value is sometimes called a z-score. If X is N(,), then Z is N(0,1), where Z=(X-)/. We can find areas under Z from the Standard Normal Table (Table A in the Book), and these areas equal the corresponding areas under X. See the next example . . .

Do readings & practice before going on - now jump to the last slide! Consider this example: Let X=height (inches) of a young woman aged 18-24 years. Then X is ~N(64.5", 2.5"). What proportion of these women's heights are between 62" and 67"? What proportion are above 67"? Below 72"? What proportions of these women's heights are between 61" and 66"? NOTE: This cannot be solved by the 68-95-99.7 rule… What proportion are below 64.5"? Below 68"? What proportion are between 58" and 60"? Etc., etc., etc. …. What height represents the 90th percentile of this aged woman? All problems of this type are solvable by sketching the picture, standardizing, and doing appropriate arithmetic to get the final answer…the last question above is what I call a "backwards problem", since you're solving for an X value while knowing an area… Do readings & practice before going on - now jump to the last slide!

We’ve seen examples of data that seem to fit the normal model, and examples of data that don’t seem to fit … Because normality is an important property of data for specific types of analyses we’ll do later, it is important to be able to decide whether a dataset is normal or not. A histogram is one way but a better graphical method is through the normal quantile plot … We'll use R to draw a normal quantile plot it will allow us to assess the normality of our data in the following sense: if the data points fall along the straight line (and within the bands on the plot) then the data can be treated as normal. Systematic deviations from the line indicate non-normal distributions - outliers often appear as points far away from the pattern of the points... the y-intercept of the line corresponds to the mean of the normal distribution and the slope of the line corresponds to the standard deviation of the normal distribution

Normal quantile plot of co2 data Notice the systematic failure of the points to fall on the line, especially at the low end where the data is “piled up”. Also, note the outliers at the high end… Conclusion: Not normal

Normal quantile plot Hrs_Completed in our Stt Class data Notice that the data points follow the line fairly well in the middle, but the high hours are too high and the low hours are too low for what would be expected of a normal distribution. Conclusion: Not normal

Normal quantile plot of IQ data These IQ scores follow the line fairly well, except for the lowest ones, which are lower than what we would expect. Conclusion: Normal; mean ~ 110, sd ~ 10

Some on-line readings to help with the normal distribution: http://www.stat.psu.edu/~resources/ClassNotes/ljs_08/index.htm (select "View Lecture Notes") http://cnx.org/content/m16979/latest/ (start with Chapter 6, The Normal Distribution, and work through the Homework section. http://www.stat.ucla.edu/textbook/ (check out the readings here in Chapter V - there are also several good examples of how to do these normal computations) http://www-unix.oit.umass.edu/~biep540w/index.html (check out the link to the fifth chapter…) There are a total of 19 Homework problems given at the cnx.org site (#2) above. Make sure you can work all those problems. We'll have a quiz on the normal distribution soon…