1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint.

Slides:



Advertisements
Similar presentations
Let X 1, X 2,..., X n be a set of independent random variables having a common distribution, and let E[ X i ] = . then, with probability 1 Strong law.
Advertisements

Distributions of sampling statistics Chapter 6 Sample mean & sample variance.
Joint Probability Distributions and Random Samples
Sampling Distributions
Sampling Distributions (§ )
THE CENTRAL LIMIT THEOREM The “World is Normal” Theorem.
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide
Chapter 18 Sampling Distribution Models
Chapter 7 Introduction to Sampling Distributions
© 2010 Pearson Prentice Hall. All rights reserved Sampling Distributions and the Central Limit Theorem.
Sampling Distributions
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Chapter 7: Variation in repeated samples – Sampling distributions
Continuous Random Variables and Probability Distributions
7/2/2015 IENG 486 Statistical Quality & Process Control 1 IENG Lecture 05 Interpreting Variation Using Distributions.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Chapter Sampling Distributions 8 © 2010 Pearson Prentice Hall. All rights reserved.
Sampling Theory Determining the distribution of Sample statistics.
The Lognormal Distribution
Lecture II-2: Probability Review
Continuous Probability Distributions A continuous random variable can assume any value in an interval on the real line or in a collection of intervals.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Standard error of estimate & Confidence interval.
Sampling Distributions  A statistic is random in value … it changes from sample to sample.  The probability distribution of a statistic is called a sampling.
Copyright © 2014, 2013, 2010 and 2007 Pearson Education, Inc. Chapter Sampling Distributions 8.
Jointly Distributed Random Variables
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 8 Continuous.
AP Statistics Chapter 9 Notes.
Topic 4 - Continuous distributions
Probability theory 2 Tron Anders Moger September 13th 2006.
Copyright ©2011 Nelson Education Limited The Normal Probability Distribution CHAPTER 6.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
AP Statistics 9.3 Sample Means.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Lecture 15: Statistics and Their Distributions, Central Limit Theorem
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Objectives 1.Describe the distribution of the sample mean: samples from normal populations 2.Describe.
Discrete distribution word problems –Probabilities: specific values, >, =, … –Means, variances Computing normal probabilities and “inverse” values: –Pr(X
1 Topic 3 - Discrete distributions Basics of discrete distributions Mean and variance of a discrete distribution Binomial distribution Poisson distribution.
Statistics 300: Elementary Statistics Section 6-5.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Maz Jamilah Masnan Institute of Engineering Mathematics Semester I 2015/ Sampling Distribution of Mean and Proportion EQT271 ENGINEERING STATISTICS.
Chapter 8 Sampling Variability and Sampling Distributions.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Topic 5 - Joint distributions and the CLT
Review of Probability. Important Topics 1 Random Variables and Probability Distributions 2 Expected Values, Mean, and Variance 3 Two Random Variables.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
SESSION 37 & 38 Last Update 5 th May 2011 Continuous Probability Distributions.
Continuous Random Variables and Probability Distributions
1 Sampling distributions The probability distribution of a statistic is called a sampling distribution. : the sampling distribution of the mean.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
Chapter 8 Sampling Variability and Sampling Distributions.
Green Belt – SIX SIGMA OPERATIONAL Central Limit Theorem.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
WARM UP: Penny Sampling 1.) Take a look at the graphs that you made yesterday. What are some intuitive takeaways just from looking at the graphs?
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
Sampling and Sampling Distributions
Continuous Probability Distributions
Chapter 7 Sampling and Sampling Distributions
Chapter 6: Sampling Distributions
Elementary Statistics
Chapter 7: Sampling Distributions
Combining Random Variables
Determining the distribution of Sample statistics
Sampling Distribution Models
MATH 2311 Section 4.4.
Sampling Distribution of a Sample Proportion
Sampling Distributions
Sampling Distributions (§ )
Presentation transcript:

1 Topic 5 - Joint distributions and the CLT Joint distributions –Calculation of probabilities, mean and variance –Expectations of functions based on joint distributions Central Limit Theorem –Sampling distributions Of the mean Of totals

22 Often times, we are interested in more than one random variable at a time. For example, what is the probability that a car will have at least one engine problem and at least one blowout during the same week? X = # of engine problems in a week Y = # of blowouts in a week P ( X ≥ 1, Y ≥ 1) is what we are looking for To understand these sorts of probabilities, we need to develop joint distributions.

33 Discrete distributions A discrete joint probability mass function is given by f ( x, y ) = P( X = x, Y = y ) where

44 Return to the car example Consider the following joint pmf for X and Y P ( X ≥ 1, Y ≥ 1) = P( X ≥ 1) = E( X + Y ) = X\Y /21/161/32 11/161/32 2

55 Joint to marginals The probability mass functions for X and Y individually (called marginals) are given by Returning to the car example: f X ( x ) = f Y ( y ) = E ( X ) = E ( Y ) =

66 Continuous distributions A joint probability density function for two continuous random variables, ( X, Y ), has the following four properties:

77 Continuous example Consider the following joint pdf: Show condition 2 (total volume is 1) holds on your own. Show P(0 < X < 1, ¼ < Y < ½) = 23/512

88 Joint to marginals The marginal pdfs for X and Y can be found by For the previous example, find f X ( x ) and f Y ( y ).

99 Independence of X and Y The random variables X and Y are independent if – f ( x, y ) = f X ( x ) f Y ( y ) for all pairs ( x, y ). For the discrete clunker car example, are X and Y independent? For the continuous example, are X and Y independent?

10 Sampling distributions We assume that each data value we collect represents a random selection from a common population distribution. The collection of these independent random variables is called a random sample from the distribution. A statistic is a function of these random variables that is used to estimate some characteristic of the population distribution. The distribution of a statistic is called a sampling distribution. The sampling distribution is a key component to making inferences about the population.

Statistics used to infer parameters We take samples and calculate statistics to make inferences about the population parameters. SamplePopulation Mean Std. Dev. Variance Proportion 11

12 StatCrunch example StatCrunch subscriptions are sold for 6 months ($5) or 12 months ($8). From past data, I can tell you that roughly 80% of subscriptions are $5 and 20% are $8. Let X represent the amount in $ of a purchase. E( X ) = Var( X ) =

13 StatCrunch example continued Now consider the amounts of a random sample of two purchases, X 1, X 2. A natural statistic of interest is X 1 + X 2, the total amount of the purchases. Outcomes X 1 + X 2 Probability 5,5 5,8 8,5 8,8 X 1 + X 2 Probability

14 StatCrunch example continued E( X 1 + X 2 ) = E([ X 1 + X 2 ] 2 ) = Var( X 1 + X 2 ) =

15 StatCrunch example continued If I have n purchases in a day, what is –my expected earnings? –the variance of my earnings? –the shape of my earnings distribution for large n ? Let’s experiment by simulating 10,000 days with 100 purchases per day using StatCrunch.

16 Simulation instructions Data > Simulate data > Binomial Specify Rows to be 10000, Columns to be 1, n to be 100 and p to be.2. This will give you a new column called Binomial1 To compute the total for each day, go to Data > Transform data and enter the expression, 8*Binomial1+5*(100-Binomial1). This will add a new column to the data table. Make a histogram and set the bin width to 1 for best results. For the new sum column, do a histogram and a QQ plot. Both should verify normality! StatCrunch

17 Should result in a dataset like this

18 Central Limit Theorem We have just illustrated one of the most important theorems in statistics. As the sample size, n, becomes large the distribution of the sum of a random sample from a distribution with mean  and variance  2 converges to a Normal distribution with mean n  and variance n  2. A sample size of at least 30 is typically required to use the CLT ( arguable in the general statistics community). The amazing part of this theorem is that it is true regardless of the form of the underlying distribution.

19 Airplane example Suppose the weight of an airline passenger has a mean of 150 lbs. and a standard deviation of 25 lbs. What is the probability the combined weight of 100 passengers will exceed the maximum allowable weight of 15,500 lbs? How many passengers should be allowed on the plane if we want this probability to be at most 0.01?

What are the probabilities at n = 99? The mean is The variance is The standard deviation is 20

21 The distribution of the sample means For constant c, E( cY ) = c E( Y ) and Var( cY ) = c 2 Var( Y ) The CLT says that for large samples, is approximately normal with a mean of  and a variance of  2 / n. So, the variance of the sample mean decreases with n.

What are the probabilities we get a sample average at some level? If the parent population is assumed with a mean of 150 lbs. and a standard deviation of 25 lbs., what’s the probability we get a sample average below 141 with a sample size of 30? Talking about the sampling distribution, the mean is 150 and the standard deviation is 22

Sampling distribution applet In StatCrunch, go to the “Applets” tab and click on “sampling distributions”. It will demonstrate how any parent distribution will converge to normal with larger, repeated samples. The closer the parent is to symmetrical, the quicker the sampling distribution will converge. The additional file for Topic 5 has discussion and examples on both sampling distributions and joint probability distributions. There are also additional examples of double integration. 23