Lecture 10. Random Sampling and Sampling Distributions David R. Merrell 90-786 Intermediate Empirical Methods for Public Policy and Management.

Slides:



Advertisements
Similar presentations
Probability Distribution
Advertisements

Chapter 6 Continuous Random Variables and Probability Distributions
Discrete Uniform Distribution
Chapter 5 Some Important Discrete Probability Distributions
Chapter 5 Discrete Random Variables and Probability Distributions
Note 6 of 5E Statistics with Economics and Business Applications Chapter 4 Useful Discrete Probability Distributions Binomial, Poisson and Hypergeometric.
ฟังก์ชั่นการแจกแจงความน่าจะเป็น แบบไม่ต่อเนื่อง Discrete Probability Distributions.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Statistics.
Modeling Process Quality
MATH408: Probability & Statistics Summer 1999 WEEK 4 Dr. Srinivas R. Chakravarthy Professor of Mathematics and Statistics Kettering University (GMI Engineering.
Lecture 8. Discrete Probability Distributions David R. Merrell Intermediate Empirical Methods for Public Policy and Management.
Chapter 4 Probability Distributions
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Overview Parameters and Statistics Probabilities The Binomial Probability Test.
T T07-01 Sample Size Effect – Normal Distribution Purpose Allows the analyst to analyze the effect that sample size has on a sampling distribution.
A random variable that has the following pmf is said to be a binomial random variable with parameters n, p The Binomial random variable.
Discrete and Continuous Distributions G. V. Narayanan.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Chapter 21 Random Variables Discrete: Bernoulli, Binomial, Geometric, Poisson Continuous: Uniform, Exponential, Gamma, Normal Expectation & Variance, Joint.
Problem A newly married couple plans to have four children and would like to have three girls and a boy. What are the chances (probability) their desire.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 4 and 5 Probability and Discrete Random Variables.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Discrete Random Variables Chapter 4.
Chap 5-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 5 Discrete Probability Distributions Business Statistics: A First.
1 If we can reduce our desire, then all worries that bother us will disappear.
Statistics for Engineer Week II and Week III: Random Variables and Probability Distribution.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
5.5 Distributions for Counts  Binomial Distributions for Sample Counts  Finding Binomial Probabilities  Binomial Mean and Standard Deviation  Binomial.
Binomial Distributions Calculating the Probability of Success.
Slide 1 Copyright © 2004 Pearson Education, Inc..
Guide to Using Minitab 14 For Basic Statistical Applications To Accompany Business Statistics: A Decision Making Approach, 8th Ed. Chapter 5: Introduction.
Introduction Discrete random variables take on only a finite or countable number of values. Three discrete probability distributions serve as models for.
Introduction to Probability and Statistics Thirteenth Edition Chapter 5 Several Useful Discrete Distributions.
MTH3003 PJJ SEM I 2015/2016.  ASSIGNMENT :25% Assignment 1 (10%) Assignment 2 (15%)  Mid exam :30% Part A (Objective) Part B (Subjective)  Final Exam:
PROBABILITY DISTRIBUTIONS
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Basic Business Statistics.
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
ENGR 610 Applied Statistics Fall Week 3 Marshall University CITE Jack Smith.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Continuous Random Variables.
BINOMIALDISTRIBUTION AND ITS APPLICATION. Binomial Distribution  The binomial probability density function –f(x) = n C x p x q n-x for x=0,1,2,3…,n for.
Biostatistics Class 3 Discrete Probability Distributions 2/8/2000.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 5 Discrete Random Variables.
Introduction to Probability and Statistics Thirteenth Edition Chapter 5 Several Useful Discrete Distributions.
AP Statistics Chapter 8 Notes. The Binomial Setting If you roll a die 20 times, how many times will you roll a 4? Will you always roll a 4 that many times?
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Since everything is a reflection of our minds, everything can be changed by our minds.
Lecture 9. Continuous Probability Distributions David R. Merrell Intermediate Empirical Methods for Public Policy and Management.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
Module 5: Discrete Distributions
Chapter 31Introduction to Statistical Quality Control, 7th Edition by Douglas C. Montgomery. Copyright (c) 2012 John Wiley & Sons, Inc.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 5 Discrete Random Variables.
Construction Engineering 221 Probability and Statistics Binomial Distributions.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 5-1 Chapter 5 Some Important Discrete Probability Distributions Business Statistics,
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Discrete Random Variables
Discrete Random Variables
Engineering Probability and Statistics - SE-205 -Chap 4
Chapter Six Normal Curves and Sampling Probability Distributions
Discrete Random Variables
ENGR 201: Statistics for Engineers
Using the Tables for the standard normal distribution
Guide to Using Minitab For Basic Statistical Applications
Lecture 11: Binomial and Poisson Distributions
Introduction to Probability and Statistics
Elementary Statistics
Day 46 Agenda: DG minutes.
Introduction to Sampling Distributions
1/2555 สมศักดิ์ ศิวดำรงพงศ์
Presentation transcript:

Lecture 10. Random Sampling and Sampling Distributions David R. Merrell Intermediate Empirical Methods for Public Policy and Management

Agenda Normal Approximation to Binomial Poisson Process Random sampling Sampling statistics and sampling distributions Expected values and standard errors of sample sums and sample means

Binomial Random Variable Binomial random variable X is the number of “successes” in n trials, where Probability of success remains the same from trial to trial Trials are independent

Binomial Probability Distribution Discrete distribution with: P(X=x) = ( n!/(x!(n-x)!)) p x q n-x n is number of trials x is number of successes in n trials (x = 0, 1, 2,..., n) p is the probability of success on a single trial q is the probability of failure on a single trial

Properties of the Binomial RV Mean:  = np Variance:   = npq Standard Deviation: 

Binomial(n = 10, p =.4) xP(X=x)

Approximation to Binomial Distribution Use normal distribution when: n is large np > 10 n(1 - p) > 10 Parameters of the approximating normal distribution are the mean and standard deviation from the binomial distribution

Approximation of Binomial Distribution n = 80, p =.4

How Good is the Approximation? Normal with mean = and standard deviation = x P( X <= x) x P( X <= x) Binomial with n = 80 and p = x P( X <= x) P(X < 29)

Application 1 The Chicago Equal Employment Commission believes that the Chicago Transit Authority (CTA) discriminates against Republicans. The records show that 37.5% of the individuals listed as passing the CTA exam were Republicans; the remainder were Democrats (no one registers as an independent in Illinois). CTA hired 30 people last year, 25 of them were Democrats. What is the probability that this situation could exist if CTA did not discriminate?

Application 1 (cont.) Success: a Republican is hired The probability of success, p = The number of trials, n = 30 The number of successes, x = 5 P(x  5) = ???

Application 1 (cont.) Mean:  = np = 30*.375 = Variance:   = npq = 30*.375*.625 = 7.03 Standard Deviation:  = 2.65 Normal with mean = and standard deviation = 2.65 x P( X <= x)

Poisson Process time homogeneity independence no clumping rate xxx 0 time Assumptions

Poisson Process Earthquakes strike randomly over time with a rate of = 4 per year. Model time of earthquake strike as a Poisson process Count: How many earthquakes will strike in the next six months? Duration: How long will it take before the next earthquake hits?

Count: Poisson Distribution What is the probability that 3 earthquakes will strike during the next six months?

Poisson Distribution Count in time period t

Minitab Probability Calculation Click: Calc > Probability Distributions > Poisson Enter: For mean 2, input constant 3 Output: Probability Density Function Poisson with mu = x P( X = x)

Duration: Exponential Distribution Time between occurrences in a Poisson process Continuous probability distribution Mean =1/ t

Exponential Probability Problem What is the probability that 9 months will pass with no earthquake?  t = 1/12, t = 1/3 1/ t = 3

Minitab Probability Calculation Click: Calc > Probability Distributions > Exponential Enter: For mean 3, input constant 9 Output: Cumulative Distribution Function Exponential with mean = x P( X <= x)

Exponential Probability Density Function MTB > set c1 DATA > 0:12000 DATA > end Let c1 = c1/1000 Click: Calc > Probability distributions > Exponential > Probability density > Input column Enter: Input column c1 > Optional storage c2 Click: OK > Graph > Plot Enter: Y c2 > X c1 Click: Display > Connect > OK

Exponential Probability Density Function

Sampling Population - entire set of objects that we are interested in studying Sample - a chosen subset of a population

Some Samples Are... random -- each item in the population has an equal chance of being selected to be part of the sample representative -- has the same characteristics as the population under study, a microcosm of the population

Population Parameters and Sample Statistics Population Parameter Numerical descriptor of a population Values usually uncertain e.g., population mean (  ), population standard deviation (  ) Sample Statistics Numerical descriptor of a sample Calculated from observations in the sample e.g., sample mean, sample standard deviation S

What is a sampling distribution? Sample statistics are random variables Sample statistics have probability distributions “Sampling distribution” is the probability distribution of a sample statistic

MTB > Retrieve 'C:\MTBWIN\DATA\RESTRNT.MTW'. Retrieving worksheet from file: C:\MTBWIN\DATA\RESTRNT.MTW Worksheet was saved on 5/31/1994 MTB > info Information on the Worksheet Column Name Count Missing C1 ID C2 OUTLOOK C3 SALES C4 NEWCAP C5 VALUE C6 COSTGOOD C7 WAGES C8 ADS C9 TYPEFOOD C10 SEATS C11 OWNER C12 FT.EMPL C13 PT.EMPL C14 SIZE

MTB > desc 'sales' Descriptive Statistics Variable N N* Mean Median TrMean StDev SEMean SALES Variable Min Max Q1 Q3 SALES MTB > boxp 'sales' * NOTE * N missing = 25

MTB > hist 'sales' * NOTE * N missing = 25

MTB > let c15 = loge('sales') J *** Values out of bounds during operation at J Missing returned 1 times MTB > let c15 = loge('sales' + 1) MTB > name c15 'logsales' MTB > desc 'logsales' Descriptive Statistics Variable N N* Mean Median TrMean StDev SEMean logsales Variable Min Max Q1 Q3 logsales MTB > boxp 'logsales' * NOTE * N missing = 25

Four Samples of Size 50 From Restaurant “Logsales” Data--Histograms

MTB > Desc c16-c19 Descriptive Statistics Variable N N* Mean Median TrMean StDev SEMean C C C C Variable Min Max Q1 Q3 C C C C Random Samples from Restaurant “Logsales” Data--Summary

Next Time... Central Limit Theorem--”Sample averages are approximately normally distributed”