Continuous Distributions

Slides:



Advertisements
Similar presentations
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Advertisements

Note 7 of 5E Statistics with Economics and Business Applications Chapter 5 The Normal and Other Continuous Probability Distributions Normal Probability.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Continuous random variable  Normal distribution Overview Figure 5-1 Formula 5-1 LAPTOP3: f(x) = 
Probability Densities
Chapter 6 Continuous Random Variables and Probability Distributions
Definitions Uniform Distribution is a probability distribution in which the continuous random variable values are spread evenly over the range of possibilities;
Chapter 6 The Normal Distribution and Other Continuous Distributions
3-1 Introduction Experiment Random Random experiment.
Chapter 5 Continuous Random Variables and Probability Distributions
Some Continuous Probability Distributions
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 4 Continuous Random Variables and Probability Distributions.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution Business Statistics: A First Course 5 th.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Chapter 4 Continuous Random Variables and Probability Distributions
Chapter 6 The Normal Probability Distribution
JMB Chapter 6 Lecture 3 EGR 252 Spring 2011 Slide 1 Continuous Probability Distributions Many continuous probability distributions, including: Uniform.
JMB Ch6 Lecture 3 revised 2 EGR 252 Fall 2011 Slide 1 Continuous Probability Distributions Many continuous probability distributions, including: Uniform.
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.
Chapter 6: Probability Distributions
Continuous Probability Distributions  Continuous Random Variable  A random variable whose space (set of possible values) is an entire interval of numbers.
Ch5 Continuous Random Variables
Theory of Probability Statistics for Business and Economics.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Chapter 6. Continuous Random Variables Reminder: Continuous random variable.
JMB Ch6 Lecture2 Review EGR 252 Spring 2011 Slide 1 Continuous Probability Distributions Many continuous probability distributions, including: Uniform.
Introduction to Probability and Statistics Thirteenth Edition
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Continuous Random Variables.
Continuous probability distributions
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Normal distributions The most important continuous probability distribution in the entire filed of statistics is the normal distributions. All normal distributions.
Continuous Random Variables Continuous random variables can assume the infinitely many values corresponding to real numbers. Examples: lengths, masses.
MATB344 Applied Statistics Chapter 6 The Normal Probability Distribution.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Continuous Random Variables Chapter 6.
Basic Business Statistics
Introduction to Probability and Statistics Thirteenth Edition Chapter 6 The Normal Probability Distribution.
1 7.5 CONTINUOUS RANDOM VARIABLES Continuous data occur when the variable of interest can take on anyone of an infinite number of values over some interval.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 6-1 Chapter 6 The Normal Distribution and Other Continuous Distributions Basic Business.
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 6 Normal Probability Distributions 6-1 Overview 6-2 The Standard Normal Distribution 6-3 Applications.
President UniversityErwin SitompulPBST 9/1 Lecture 9 Probability and Statistics Dr.-Ing. Erwin Sitompul President University
Theoretical distributions: the Normal distribution.
Chapter 6 Continuous Random Variables Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Chapter 6 The Normal Distribution and Other Continuous Distributions
Chapter 4 Applied Statistics and Probability for Engineers
Normal Probability Distributions
MATB344 Applied Statistics
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
4-1 Continuous Random Variables 4-2 Probability Distributions and Probability Density Functions Figure 4-1 Density function of a loading on a long,
Engineering Probability and Statistics - SE-205 -Chap 4
Normal Probability Distributions
Chapter 6. Continuous Random Variables
ENGR 201: Statistics for Engineers
Uniform and Normal Distributions
Keller: Stats for Mgmt & Econ, 7th Ed
Continuous Probability Distributions
Probability Theory and Specific Distributions (Moore Ch5 and Guan Ch6)
Continuous Probability Distributions Part 2
Continuous distributions
Continuous Probability Distributions Part 2
Continuous Probability Distributions Part 2
Introduction to Probability Distributions
Lecture 12: Normal Distribution
Introduction to Probability Distributions
Continuous Probability Distributions
Continuous Probability Distributions Part 2
Presentation transcript:

Continuous Distributions

Normal Distribution

Normal Distribution n(x; m, s) = Curve is completely specified by two parameters: mean m and standard deviation s. The area under the curve must stay = 1, so the larger s, the lower the curve. maximum occurs at m curve is symmetric around m inflections occur at m±s curve approaches horizontal axis asymptotically Reasonable approximation for many real-life experimental outcomes. heights, weights measurement errors reaction times Often good approximation to binomial and hypergeometric distributions (if p is not close to 0 or 1 and n is large  np and nq > 5). Good approximation to Poisson if m > 30.

Normal Probability

Normal Probability

Normal Distribution Probability that X is between a and b is given by In practice this is never calculated. Normal curve areas are tabulated, Table A3. In order to have ONE table for all m and s, transform X to Z: or x = sz + m, going back Z is a random variable with m = 0 and s2 = 1. This transforms the integral to: standard normal distribution -4 -2 2 4 Z point Standard Normal (z) curve Shaded Area = F (z) cumulative probability, P(Z  z) = (z)

Standard Normal Table P(Z1.25) = (1.25) = 0.894 -4 -2 2 4 P(0.25Z1.5) = P(Z1.5) – P(Z0.25) = (1.5) - (0.25) = 0.933 – 0.599 = 0.334 A standard normal table tabulates the probability of a random variable, Z, being less than or equal to some stated value These values have been calculated using the probability density function for the standard normal given as shown on slide #5 The probability is found from the chart by finding the intersection of the row corresponding to the first two digits of the stated value (shown in the leftmost column) and the column corresponding to the final digit of the stated value (shown in the topmost row). If the probability of a random variable being between two given values is of interest, the probability relating to the smaller value is subtracted from the probability related to the larger value. 0.25 1.5

Standard Normal Table P(Z1.25) = 1 - P(Z1.25) = 1 - 0.894 = 0.106 -4 -2 2 4 1.25 Since Symmetric, P(Z -1.0) = 1 – P(Z1.0) = 1 – 0.841 = 0.159 Since the area under any probability density function, including the standard normal, is equal to one; AND since the normal is a symmetric distribution, the probability of a random variable being greater than a stated value is equal to 1 minus the probability of the random variable being less than a stated value. Also since the standard normal probability density function is a symmetric distribution (and since most standard normal tables do not tabulate negative values), the probability of a random variable being less than a negative stated value is equal to the probability of the random variable being greater than the positive root of the negative stated value. -4 -1.0 -2 2 4 1.0

Normal Distribution Example 6.13 Walpole Average motor life m = 10 years with s = 2 years, lives follow normal distribution. Motors that fail within the warrantee period are replaced free. If you want to replace only 3% of the motors, how long should the warrantee be? Go to table and find 3%. This corresponds to z = -1.88 x = sz + m = 2(-1.88) + 10 = 10 - 3.76 = 6.24 years  7 year warrantee

Z Notation Often we want to know at what point the tail area to the right is equal to a, this is denoted by za P(Z  z) = 0.025 = 1 – P(Z  z) So P(Z  z) = 0.975 and z0.025 = 1.96 Z0.025 = 1.96 Shaded area = P(ZZ0.025) = 0.025 Interpretation: The probability of a random variable N(0,1) having a value greater than 1.96 is 2.5%

Normal Distribution 5th percentile = -1.645 95th percentile = 1.645 25th percentile = -0.675 75th percentile = 0.675 50th percentile = 0.00 Sometimes questions are asked in terms of percentiles. The percentiles are the points at which p% of the distribution is less than the point.

Normal Probability Plot Example: 10 independent measurement errors The Normal Probability Plot suggests that the standard normal distribution is a reasonable probability model for measurement error The aspect of interest in a probability plot is to take note of whether or not the data follow a somewhat straight line. If the data follow a somewhat straight line against the distribution being checked, the distributional parameters can be estimated by the straight line. In the case of a normal distribution, the intercept is an estimate of the mean and the slope is an estimate of the variance.

Normal Approximation to Binomial Normal with m = np and s2 = np(1-p) is a good approximation for the binomial, even if n is small if p is close to ½ This approximation is useful when n is large. (Note that binomial table A1 only goes to n = 20). The normal approximation to the binomial, if np>=5 and nq>=5

Normal Approximation to Binomial Example, Problem 6.36 Walpole Airlines sell more tickets than they have seats for a flight because 2% of customers who buy tickets do not show up for the flight. For a flight with 197 seats, 200 tickets were sold. What is the probability that the airline overbooked this flight? The procedure for getting the answer once you have set up the problem is simple, but this example illustrates why it’s important to set up the problem carefully. This problem provides the number of no-shows: P(no show) = 0.02 Should you use n = 200 or n = 197? Use n = 200 because they sold 200 tickets, and the no-show percentage pertains to the number of people who buy tickets (not the number of seats). One needs to figure out what is being asked: P(x > 3) or P (x < 3) ? The flight will be overbooked if there are fewer than 3 no-shows, so it’s the latter. I.e., x = 0, 1, or 2.

Normal Approximation to Binomial Example, Problem 6.36 Walpole Airlines sell more tickets than they have seats for a flight because 2% of customers who buy tickets do not show up for the flight. For a flight with 197 seats, 200 tickets were sold. What is the probability that the airline overbooked this flight? P(no show) = 0.02, n = 200 m = np = 4 s2 = npq = 0.02*200*0.98 = 3.92  s = 1.98 z = (x - m) / s = (2-4)/1.98 = -1.01 P(z < -1.01) = 0.1587, or a nearly 16% chance of an overbooked flight. This was done without the continuity correction. Was this a good idea?

Normal Approximation to Binomial = F3-F2 = NORMSDIST(E2) = (A2-4)/1.98 = BINOMDIST(A2,200,0.02,FALSE)

=(A3+0.5-4)/1.98

Normal Approximation to Binomial Example, Problem 6.36 Walpole Airlines sell more tickets than they have seats for a flight because 2% of customers who buy tickets do not show up for the flight. For a flight with 197 seats, 200 tickets were sold. What is the probability that the airline overbooked this flight? P(no show) = 0.02, n = 200 m = np = 4 s2 = npq = 0.02*200*0.98 = 3.92  s = 1.98 With the continuity correction: z = (x - m) / s = (2 + 0.5 - 4)/1.98 = -0.758 P(z < -0.76) = 0.2236, or a 22% chance of an overbooked flight.  When np or n(1-p) is small (< ~15), use the continuity correction. 

Normal Approximation to Binomial How likely is this number of fours? n = 47; x = 14; p = 0.205 # Heads # Times Binomial Binomdist Expected   0.000977 0.045898 1 0.009766 0.458984 2 5 0.043945 2.06543 3 0.117188 5.507813 4 14 0.205078 9.638672 7 0.246094 11.56641 6 10 8 9 Total 47 m = np = 47*0.20508 = 9.639 s2 = npq = 9.639*0.7949 = 7.662 s = 2.768 z = (x - 9.639)/2.768

Normal Approximation to Binomial P (13.5 < x < 14.5) = ? z1 = (14.5 - 9.639)/2.768 = 4.861/2.768 = 1.756 z2 = (13.5 - 9.639)/2.768 = 3.861/2.768 = 1.395 P (1.39 < z < 1.76) = 0.9608 – 0.9177 = 0.0431 This compares with P = 0.0408 using the binomial.

Chi-Squared Distribution

Chi Squared If X has a normal distribution with mean m and variance s2, then the variable Y = X2 has a chi-squared distribution with v = 1 degree of freedom. Corollary 7.1 If X1, X2, ... Xn are independent, random variables having normal distributions with means m1, m2, ... mn and variances s12, s22, ...sn2, , then the random variable Y = 1. also has a chi-squared distribution 2. with v = n degrees of freedom

chi squared distributions f(x;v) = A chi squared random variable is a continuous random variable that results from the squares or sum of squares of independent standard normal random variables. The chi squared distribution is specified by the single parameter v, where v is called the degrees of freedom. chi squared distributions Images thanks to Ewa Paszek , http://cnx.org/content/m13129/latest/

Chi Squared Distribution The chi-squared distribution is one of the most widely used distributions in statistical tests. The mean and variance of the chi-squared distribution are m = v and s2 = 2v. Values of are tabulated. is the c2 value above which there is an area of a. Small values of a indicate unlikely events.

unlikely If n = 5, v = 4 degrees of freedom

Gamma & Exponential Distributions

Gamma Distribution f(x) = a, b, x > 0 where G() = m = ab s2 = ab2 Analogous to the negative binomial. The time/space required for a specified number of Poisson events to occur a = # events b = 1/(the average number of events per unit time/area) = the mean time between events nth event? Bernoulli process negative binomial Poisson event gamma Study example 6.18

Gamma Distribution Gamma Poisson f(x) =  compare  BUT x = time a = # of events b = the mean time to/between events x = number of occurrences lt = average number of occurrences

Gamma Distribution f(x;a,b) = a, b, x > 0 Values are tabulated (Table A24), but as the incomplete gamma function, which is the cumulative function. So, integrate f(x;a,b) To convert to the incomplete gamma function, set x = by (y = x/b). This scales everything so that the function now only depends on a. Note that we still have x in the integral. If we want to know a survival time to x’, then we convert to x = x’/b and integrate y to that. Study example 6.19. Note that x is usually a time.

Exponential Distribution m = b s2 = b2 Gamma with a = 1. f(x;b) = b, x > 0 The time required for the first Poisson event to occur. Note that b = 1/l = mean time between events (failures) Example 6.17 in Walpole Component failure is described by exponential distribution with mean time to failure b = 5. What is the probability that at least 2 of these are still functioning after 8 years? First, find P(X > 8), using the exponential distribution. P(X > 8) = Then find the probability that 2 of the 5 are functioning, for which you go back to the binomial. P(X > 2) =

Exponential Distribution The exponential assumes that the system has no memory. P(X > t) = P(X > t0 + t | X > t0) In other words, it does not take into account wear. For that, you need the gamma or the Weibull. These two are often curve-fitting distributions, with the basic physics not considered. Note the wording in example 6.20 -- it is known from previous data that the number of complaints can be described by a gamma distribution with a = 2 and b = 4. Use the exponential for things like telephone calls, car arrivals, etc. where this assumption holds, or for failures that are due to random events and not wear.

Gamma and Exponential Distributions Time (space) until a specified number of Poisson events occur. Time (space) until the first of Poisson event occurs (or time between events)

Other Distributions

also called a stretched exponential Other Distributions Chi-Squared We will return to this later. Lognormal If the random variable Y has a normal distribution, and if Y = ln(X), then X has a lognormal distribution. Find statistics by converting to/from the normal distribution. Weibull Also used to describe times to failure. F(x) = also called a stretched exponential