Chapter 6 Continuous Distributions The Gaussian (Normal) Distribution.

Slides:



Advertisements
Similar presentations
Theoretical Probability Distributions We have talked about the idea of frequency distributions as a way to see what is happening with our data. We have.
Advertisements

Note 7 of 5E Statistics with Economics and Business Applications Chapter 5 The Normal and Other Continuous Probability Distributions Normal Probability.
Continuous Probability Distributions.  Experiments can lead to continuous responses i.e. values that do not have to be whole numbers. For example: height.
Continuous Random Variables & The Normal Probability Distribution
Chapter 6 Continuous Random Variables and Probability Distributions
Chapter 6 The Normal Distribution and Other Continuous Distributions
Ch. 6 The Normal Distribution
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Chapter 5 Continuous Random Variables and Probability Distributions
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Chapter 6 ~ Normal Probability Distributions
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
Chapter 6: Normal Probability Distributions
Chapter 4 Continuous Random Variables and Probability Distributions
Chapter 7: The Normal Probability Distribution
Chapter 6 The Normal Probability Distribution
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
8.5 Normal Distributions We have seen that the histogram for a binomial distribution with n = 20 trials and p = 0.50 was shaped like a bell if we join.
Chapter 6: Probability Distributions
Section 7.1 The STANDARD NORMAL CURVE
PROBABILITY & STATISTICAL INFERENCE LECTURE 3 MSc in Computing (Data Analytics)
Overview 6.1 Discrete Random Variables
1 Normal Random Variables In the class of continuous random variables, we are primarily interested in NORMAL random variables. In the class of continuous.
Theory of Probability Statistics for Business and Economics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 5-1 Introduction to Statistics Chapter 6 Continuous Probability Distributions.
INTEGRALS Areas and Distances INTEGRALS In this section, we will learn that: We get the same special type of limit in trying to find the area under.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
6-2: STANDARD NORMAL AND UNIFORM DISTRIBUTIONS. IMPORTANT CHANGE Last chapter, we dealt with discrete probability distributions. This chapter we will.
Continuous Random Variables Continuous Random Variables Chapter 6.
Chapter 7 Continuous Distributions. Continuous random variables Are numerical variables whose values fall within a range or interval Are measurements.
1 Chapter 5 Continuous Random Variables. 2 Table of Contents 5.1 Continuous Probability Distributions 5.2 The Uniform Distribution 5.3 The Normal Distribution.
The Normal Probability Distribution
Applied Quantitative Analysis and Practices LECTURE#11 By Dr. Osman Sadiq Paracha.
Chapter 6 Normal Probability Distribution Lecture 1 Sections: 6.1 – 6.2.
CHAPTER 3: The Normal Distributions
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Random Variables Presentation 6.. Random Variables A random variable assigns a number (or symbol) to each outcome of a random circumstance. A random variable.
Continuous Probability Distributions Statistics for Management and Economics Chapter 8.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter The Normal Probability Distribution 7.
Continuous Probability Distributions. A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
6-2: STANDARD NORMAL AND UNIFORM DISTRIBUTIONS. IMPORTANT CHANGE Last chapter, we dealt with discrete probability distributions. This chapter we will.
CY1B2 Statistics1 (ii) Poisson distribution The Poisson distribution resembles the binomial distribution if the probability of an accident is very small.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Continuous Random Variables Chapter 6.
Basic Business Statistics
Chapter 5 Continuous Distributions The Gaussian (Normal) Distribution.
Chapter 6 The Normal Distribution.  The Normal Distribution  The Standard Normal Distribution  Applications of Normal Distributions  Sampling Distributions.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 6-1 The Normal Distribution.
1 ES Chapter 3 ~ Normal Probability Distributions.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 6-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
Chap 6-1 Chapter 6 The Normal Distribution Statistics for Managers.
THE NORMAL DISTRIBUTION
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics, A First Course 4 th.
Theoretical distributions: the Normal distribution.
Yandell – Econ 216 Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Continuous Distributions
Continuous Random Variables
Properties of the Normal Distribution
The Normal Probability Distribution
CONTINUOUS RANDOM VARIABLES AND THE NORMAL DISTRIBUTION
THE NORMAL DISTRIBUTION
Statistics for Managers Using Microsoft® Excel 5th Edition
Chapter 5 Continuous Random Variables and Probability Distributions
Normal Probability Distribution Lecture 1 Sections: 6.1 – 6.2
Presentation transcript:

Chapter 6 Continuous Distributions The Gaussian (Normal) Distribution

When Discrete Distributions Aren’t Enough Discrete distributions are used in situations involving counts. (Others are possible but this is the vast majority.) What happens when you want to measure things? –Height –Weight –Miles per Gallon These aren’t counts. (Why not?) Measurements involve rounding and precision. When any level of precision is theoretically possible, we call this a “continuous” variable. The values come from the set of Real Numbers, ie, the number line.

Real Numbers   -∞ … … ∞ The Real Numbers include all possible values between the pictured integers. That includes rational numbers like ½, 1/3, 237/573, etc. It also includes irrational numbers like π and √2. Real numbers have an infinite string of decimal places. There are “uncountably many” real numbers between any two specified real numbers.

Intervals An interval is a “piece” of the number line, or a subset of the Real numbers. There are no “gaps.” For any two numbers in it, all real numbers between them are included. Therefore an interval is described by its endpoints—with a few special considerations. The endpoints may or may not be included. Round brackets are used to exclude the endpoints, square brackets to include them. Write in order. Ex.: [0,1], (9, 100), [3,6), (0,7]. If an interval goes on to infinity, the ∞ or -∞ symbol is used with a round bracket, since infinity is not a number. Ex.: [0,∞), (-∞,-10).

Definition of Continuous R.V. A continuous random variable takes on values in some Real Interval.   -∞ … … ∞ Suppose a r.v. X takes values in [0,1]. How many different values are there? Suppose you assign some tiny probability to each Real Number in [0,1]. What is the total probability? Suppose you divide [0,1] up into 10 subintervals. Can you assign probabilities to these so the total is 1?

Definition of Continuous R.V. This illustrates the problem with assigning probabilities to individual numbers, and the contrasting ease of assigning probability to intervals. Summary: –Any continuous distribution has infinitely many values. –No single point has a positive probability. –Said another way: Every individual value of a continuous random variable has probability zero, and as such is an impossible event. –Intervals can be assigned positive probability.

The Paradox Obviously, a r.v., X, must take on some value, and if it does, that value is not impossible (it has P>0). We never actually “mean” a single value. Measurements are given with a certain precision. Example: temperature is continuous, but measured to the nearest degree, “70” really means the interval [69.5,70.5). Intervals can have positive probability, and we can make them as small as we like. The fact that a continuous r.v. cannot take a single value agrees nicely with the fact that it is impossible to measure anything to the exact real number value. Instead, we divide up our scale using equal-width subintervals based on the precision of the measuring device. These subintervals have positive probability.

Continuous Probabilities Probabilities for a continuous random variable, X, are given by a probability function, P. P(X=k)=0 for any k. We might find positive probabilities for expressions like –P(X>k),Note: the interval is (k,∞) –P(X<k), orNote: the interval is (-∞,k) –P(a<X<b).Note: the interval is (a,b) A formula that gives probabilities for X would need to give probabilities for intervals, rather than single values.

Has anything prepared us for this? Tables of probability for discrete r.v.’s? Not if only individual values were given. Ungrouped histograms? No, same. Grouped histograms? Let’s see…. Each bar represents a frequency for an interval, even though this is a discrete example.

What about relative histograms? Look at the histogram for the number of three’s showing in a two-dice toss. Notice it shows the probabilities for 3 discrete values. Replace the discrete values with intervals, [0,1), [1,2), and [2,3). Then this histogram looks like it belongs to a continuous distribution with values in [0,3).

Making the Leap Change the horizontal axis to show that the bars belong to each interval. Each bar is 1 unit wide and its height represents the probability for that interval. Each bar is a rectangle, whose area is 1 x height. Since the heights add up to 1, the total area of the shaded region is 1. Make the transition to the continuous case: Instead of representing probability by height, use area.

What did we leap over? This has been more of an analogy than an explanation. Many details that require calculus are glossed over. The problem: can’t represent probabilities by height at a point, because points all have probability zero. Solution: switch to areas, where the bottom boundary (on the x axis) represents an interval for which we want to determine probability. The area of the graph above that interval represents its probability. In calculus, these areas are called “definite integrals.” You don’t really need to know that, but you may come across the following symbol, which means “the integral from a to b.”

Uniform Distribution A uniform distribution is defined for an interval outside of which there is no positive probability. (This is to prevent the area from being infinite.) Inside that interval, it has the same probabilities for any sub-interval of a given size (they are “always the same”). A uniform distribution on the interval [0,3] is shown here. Note that the height is 1/3, because 3 x 1/3 = 1. However, we should not say that 1/3 is the probability of anything in particular.

Uniform Examples Let X be a uniform r.v. on the interval [1,5]. Find P(X>3), P(X<5), P(2<X<3), and P(0<X<3). Solution: The width of the distribution is 4, so the height of the graph is ¼ between 1 and 5. The area for any interval will be ¼ x the width of the interval. –P(X>3)=(5-3)/4=1/2 –P(X<5)=(5-1)/4=1 –P(2<X<3)=(3-2)/4=1/4 –Careful! P(0<X<3)=P(1<X<3)=(3-1)/4=1/2

Uniform Examples Let X be a uniform r.v. on the interval [1,5]. Find P(X>3), P(X<5), P(2<X<3), and P(0<X<3). Solution: The width of the distribution is 4, so the height of the graph is ¼ between 1 and 5. The area for any interval will be ¼ x the width of the interval. –P(X>3)=(5-3)/4=1/2 –P(X<5)=(5-1)/4=1 –P(2<X<3)=(3-2)/4=1/4 –Careful! P(0<X<3)=P(1<X<3)=(3-1)/4=1/2

More Uniform Examples Let X be a uniform r.v. on the interval [0,8]. –Find P(X>3), P(X<5), and P(2<X<3 or 7<X<8). –Find the median and the 90 th percentile. Solution: The width of the distribution is 8, so the height of the graph is 1/8. –P(X>3)=(8-3)/8=5/8. –P(X<5)=(5-0)/8=5/8. –P(2<X<3 or 7<X<8)=P(2<X<3)+P(7<X<8)=1/4. –The median must have half the probability above it and half below. Therefore the median is 4. –P 90 is a number such that 90% of the probability is below it, so we have (P 90 -0)/8=.9, so P 90 =7.2.

Probability Density Function We have been dealing with the uniform distribution in terms of graphs. Before moving on, we need to put these ideas into the form of mathematical notation. We were focusing on the areas of portions of a graph like the one below. But how do we define the region we want the area for? –The bottom boundary is the x axis –The sides are vertical lines going through the x values we want –The top of the region is a special “curve” (straight lines are curves too). This curve is defined by a function, called the probability density function, or pdf. For our graph, it is:

Normal Probability Distributions The normal probability distribution (Gaussian Distribution) is the most important distribution in all of statistics. Many continuous random variables have normal or approximately normal distributions. A normal distribution is defined by its pdf.

The Normal pdf The parameters are μ and σ. The mean of the distribution is μ. The standard deviation is σ. The median and mode are also μ. There is a normal distribution for every combination of values of μ and σ

Basic Shape Here we see the basic shape of a normal distribution. The blue band is an example of an “area under the curve” that we might want to calculate. This particular distribution has μ=110 and σ=10. The “x” axis represents values of the r.v. X.

Effect of Changing μ Changing μ just causes a horizontal shift, centering the graph in a different place.

Effect of Changing σ Changing σ causes the graph to stretch out or squeeze together around the mean.

What does this mean? The normal pdf is a complicated formula. It is not easy to calculate probabilities from it, even if you know calculus. So, we use tables (or computers). We can’t have a table for every possible normal distribution. We have one table for the “standard” normal distribution, which has μ =0 and σ=1. This r.v. is called Z. It is easy to convert proba- bility statements from other normal distributions to Z.

Table 3, Appendix B entries: The table contains the area under the standard normal curve between 0 and a specific value of z.

Example: Find the area under the standard normal curve between z = 0 and z = A portion of Table 3:

Example: Find the area under the normal curve to the right of Z = 1.45; P(Z > 1.45). Area asked for

Example: Find the area to the left of Z = 1.45; P(Z < 1.45).

Example: Find the area between Z =  1.26 and the mean (Z = 0). Area asked for Area from table

Example: Find the area to the left of .98; P(Z < .98). Area asked for Area from table Same as area asked for

Applications of Normal Distributions Apply the techniques learned for the Z distribution to all normal distributions. Start with a probability question in terms of x-values. Convert, or transform, the question into an equivalent probability statement involving z-values.

Standardization Suppose X is a normal r.v. with mean  and standard deviation . The r.v. has a standard normal distribution.

Example: A bottling machine is adjusted to fill bottles with a mean of 32.0 oz of soda and standard deviation of Assume the amount of fill is normally distributed and a bottle is selected at random. 1.Find the probability the bottle contains between 32 oz and oz. 2. Find the probability the bottle contains more than oz.

Other Normal Applications Find a cutoff point: a value of X such that there is a certain probability in a specified interval defined by x. Example: The waiting time X at a certain bank is approximately normally distributed with a mean of 3.7 minutes and a standard deviation of 1.4 minutes. The bank would like to claim that 95% of all customers are waited on by a teller within c minutes. Find the value of c that makes this statement true.

Solution:

Notation: If X is a normal random variable with mean  and standard deviation , this is often denoted: X ~ N( ,   ). Example: Suppose X is a normal random variable with  = 35 and  = 6. A convenient notation to identify this random variable is: x ~ N(35, 36). z(  ) and z   are commonly used notations for the z- score (point on the z axis) such that there is  of the area (probability) to the right of z(  ) or z  .

Illustrations: z(0.10) represents the value of Z such that the area to the right under the standard normal curve is 0.10 z(0.80) represents the value of Z such that the area to the right under the standard normal curve is 0.80

Example: Find the numerical value of z(0.10). Use Table 3: look for an area as close as possible to z(0.10) = (area information from notation) Table shows this area (0.4000)

Note: The values of Z that will be used regularly come from one of the following situations: 1.The z-score such that there is a specified area in one tail of the normal distribution. 2.The z-scores that bound a specified middle proportion of the normal distribution.

Example: Find the z-scores that bound the middle 0.99 of the normal distribution. Use Table 3: