Case Study - Relative Risk and Odds Ratio John Snow’s Cholera Investigations.

Slides:



Advertisements
Similar presentations
POINT ESTIMATION AND INTERVAL ESTIMATION
Advertisements

Section 7.4 Approximating the Binomial Distribution Using the Normal Distribution HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008.
Sampling Distributions (§ )
Test 2 Stock Option Pricing
1 A heart fills with loving kindness is a likeable person indeed.
Evaluating Hypotheses
Confidence Interval for Relative Risk
Estimating a Population Proportion
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Estimation and Confidence Intervals Chapter 9.
Population Proportion The fraction of values in a population which have a specific attribute p = Population proportion X = Number of items having the attribute.
Chapter 7 The Normal Probability Distribution 7.5 Sampling Distributions; The Central Limit Theorem.
5-3 Inference on the Means of Two Populations, Variances Unknown
Ka-fu Wong © 2004 ECON1003: Analysis of Economic Data Lesson6-1 Lesson 6: Sampling Methods and the Central Limit Theorem.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Standard error of estimate & Confidence interval.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
8.1 Inference for a Single Proportion
AP Statistics Chapter 9 Notes.
Binomial distribution Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Normal Distribution as an Approximation to the Binomial Distribution Section 5-6.
The Binomial Distribution. Binomial Experiment.
AP Statistics Section 8.1: The Binomial Distribution.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
Copyright ©2011 Nelson Education Limited The Normal Probability Distribution CHAPTER 6.
Mid-Term Review Final Review Statistical for Business (1)(2)
1 Chapter 5 Sampling Distributions. 2 The Distribution of a Sample Statistic Examples  Take random sample of students and compute average GPA in sample.
FINAL REVIEW. The amount of coffee (in ounces) filled in a jar by a machine has a normal distribution with a mean amount of 16 ounces and a standard deviation.
Modular 11 Ch 7.1 to 7.2 Part I. Ch 7.1 Uniform and Normal Distribution Recall: Discrete random variable probability distribution For a continued random.
Estimating a Population Proportion
 A probability function is a function which assigns probabilities to the values of a random variable.  Individual probability values may be denoted by.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
1 SMU EMIS 7364 NTU TO-570-N Inferences About Process Quality Updated: 2/3/04 Statistical Quality Control Dr. Jerrell T. Stracener, SAE Fellow.
Bernoulli Trials Two Possible Outcomes –Success, with probability p –Failure, with probability q = 1  p Trials are independent.
1October In Chapter 17: 17.1 Data 17.2 Risk Difference 17.3 Hypothesis Test 17.4 Risk Ratio 17.5 Systematic Sources of Error 17.6 Power and Sample.
The Practice of Statistics Third Edition Chapter 8: The Binomial and Geometric Distributions 8.1 The Binomial Distribution Copyright © 2008 by W. H. Freeman.
Chapter 4. Discrete Random Variables A random variable is a way of recording a quantitative variable of a random experiment. A variable which can take.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Statistics Section 5-6 Normal as Approximation to Binomial.
June 16. In Chapter 8: 8.1 Concepts 8.2 Sampling Behavior of a Mean 8.3 Sampling Behavior of a Count and Proportion.
The binomial applied: absolute and relative risks, chi-square.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Slide Slide 1 Section 6-5 The Central Limit Theorem.
1 Since everything is a reflection of our minds, everything can be changed by our minds.
Chapter 4 Probability: Studying Randomness. Randomness and Probability Random: Process where the outcome in a particular trial is not known in advance,
The Binomial Distribution
Statistics 3502/6304 Prof. Eric A. Suess Chapter 4.
Confidence Intervals Chapter 6. § 6.3 Confidence Intervals for Population Proportions.
The final exam solutions. Part I, #1, Central limit theorem Let X1,X2, …, Xn be a sequence of i.i.d. random variables each having mean μ and variance.
Exam 2: Rules Section 2.1 Bring a cheat sheet. One page 2 sides. Bring a calculator. Bring your book to use the tables in the back.
Review Normal Distributions –Draw a picture. –Convert to standard normal (if necessary) –Use the binomial tables to look up the value. –In the case of.
Chapter 5 Sampling Distributions. Introduction Distribution of a Sample Statistic: The probability distribution of a sample statistic obtained from a.
Confidence Intervals for a Population Proportion Excel.
Section 6.4 Inferences for Variances. Chi-square probability densities.
1 Probability and Statistics Confidence Intervals.
Module 25: Confidence Intervals and Hypothesis Tests for Variances for One Sample This module discusses confidence intervals and hypothesis tests.
Example A population has a mean of 200 and a standard deviation of 50. A random sample of size 100 will be taken and the sample mean x̄ will be used to.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Chap 5-1 Chapter 5 Discrete Random Variables and Probability Distributions Statistics for Business and Economics 6 th Edition.
Statistics for Business and Economics Module 1:Probability Theory and Statistical Inference Spring 2010 Lecture 8: Tests of significance and confidence.
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
O Make Copies of: o Areas under the Normal Curve o Appendix B.1, page 784 o (Student’s t distribution) o Appendix B.2, page 785 o Binomial Probability.
1. 2 At the end of the lesson, students will be able to (c)Understand the Binomial distribution B(n,p) (d) find the mean and variance of Binomial distribution.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Estimation and Confidence Intervals Chapter 9.
Ch3.5 Hypergeometric Distribution
CHAPTER 6 Random Variables
The binomial applied: absolute and relative risks, chi-square
SA3202 Statistical Methods for Social Sciences
Binomial Probability Distribution
Section 3: Estimating p in a binomial distribution
8.1 The Binomial Distribution
Presentation transcript:

Case Study - Relative Risk and Odds Ratio John Snow’s Cholera Investigations

Population Information 2 Water Providers: Southwark & Vauxhall (S&V) and Lambeth (L) –S&V: Population: # Cholera Deaths: 3706 –L: Poulation: # Choleta Deaths: 411

Sampling Distribution of RR & OR Goal: Obtain Empirical Sampling Distributions of sample RR and OR and observe coverage rate of 95% Confidence Intervals Process: Take independent random samples of size n SV and n L from the 2 populations and observe X SV and X L deaths in sample. These X SV and X L are approximately distributed as Binomial random variables (approximate due to sampling from finite, but very large, populations)

Binomial Distribution for Sample Counts Binomial “Experiment” –Consists of n trials or observations –Trials/observations are independent of one another –Each trial/observation can end in one of two possible outcomes often labelled “Success” and “Failure” –The probability of success, p, is constant across trials/observations –Random variable, X, is the number of successes observed in the n trials/observations. Binomial Distributions: Family of distributions for X, indexed by Success probability (p) and number of trials/observations (n). Notation: X~B(n,p)

Binomial Distributions and Sampling Problem when sampling from a finite sample: the sequence of probabilities of Success is altered after observing earlier individuals. When the population is much larger than the sample (say at least 20 times as large), the effect is minimal and we say X is approximately binomial Obtaining probabilities: Table C gives probabilities for various n and p. Note that for p > 0.5, use 1-p and you are obtaining P(X=n-k)

Simulating Binomial RVs Select n and p Obtain n random numbers distributed uniformly between 0 and 1 (any software package should have built-in random number generator): U 1,…,U n Let X be the number of U i values that  p X~B(n,p) Finite population adjustments can be made by “correcting” p after each draw EXCEL has built in Function: –Tools --> Data Analysis --> Random Number Generation –--> Binomial --> Fill in p and n

Simulation Example Simulate by taking samples of n SV =n L =5000 individuals from each population of customers Generate X SV ~B(5000, ) and X L ~B(5000, ) Compute sample relative risk, ln(RR), odds ratio, ln(OR), and estimated std. errors of ln(RR) and ln(OR) Obtain 95% CIs for RR, OR (based on ln(RR),ln(OR) Repeat for a large number of samples (1000 samples) Obtain the empirical distribution of each statistic Obtain an indicator of whether the 95% CI for RR contains the population RR (5.78) and whether the 95% CI for OR contains the population OR (5.85)

Computations

Note that the distribution of Relative Risks is not normal

Note that distribution of ln(RR) is approximately normal