Sampling and Power Slides by Jishnu Das.

Slides:



Advertisements
Similar presentations
Designing an impact evaluation: Randomization, statistical power, and some more fun…
Advertisements

Anthony Greene1 Simple Hypothesis Testing Detecting Statistical Differences In The Simplest Case:  and  are both known I The Logic of Hypothesis Testing:
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Inference about a Mean Part II
Sampling Designs and Techniques
Chapter 9 Hypothesis Testing.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
AM Recitation 2/10/11.
Chapter 8 Introduction to Hypothesis Testing
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
The Probability of a Type II Error and the Power of the Test
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo MIT and Poverty Action Lab.
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Sampling for an Effectiveness Study or “How to reject your most hated hypothesis” Mead Over, Center for Global Development and Sergio Bautista, INSP Male.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
EMIS 7300 SYSTEMS ANALYSIS METHODS FALL 2005 Dr. John Lipp Copyright © Dr. John Lipp.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Framework of Preferred Evaluation Methodologies for TAACCCT Impact/Outcomes Analysis Random Assignment (Experimental Design) preferred – High proportion.
- We have samples for each of two conditions. We provide an answer for “Are the two sample means significantly different from each other, or could both.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
DSCI 346 Yamasaki Lecture 1 Hypothesis Tests for Single Population DSCI 346 Lecture 1 (22 pages)1.
Methods of Presenting and Interpreting Information Class 9.
Agenda n Probability n Sampling error n Hypothesis Testing n Significance level.
Chapter 8 Introducing Inferential Statistics.
Chapter 8: Inferences Based on a Single Sample: Tests of Hypotheses
Statistical Inference
More on Inference.
Samples, Surveys, and Results UNIT SELF-TEST QUESTIONS
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Statistical Core Didactic
Chapter 4. Inference about Process Quality
Inference and Tests of Hypotheses
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Chapters 20, 21 Hypothesis Testing-- Determining if a Result is Different from Expected.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Sample Size Estimation
Statistical inference: distribution, hypothesis testing
CHAPTER 9 Testing a Claim
4 Sampling.
Chapter 8: Inference for Proportions
Power, Sample Size, & Effect Size:
Chapter 9 Hypothesis Testing.
More on Inference.
Chapter 7 Sampling Distributions
Chapter 9 Hypothesis Testing.
Sampling Lecture 10.
St. Edward’s University
CHAPTER 9 Testing a Claim
Slides by JOHN LOUCKS St. Edward’s University.
Sampling.
What are their purposes? What kinds?
CHAPTER 10 Comparing Two Populations or Groups
Testing Hypotheses I Lesson 9.
Introduction to the design (and analysis) of experiments
DESIGN OF EXPERIMENTS by R. C. Baker
Sample Sizes for IE Power Calculations.
CHAPTER 9 Testing a Claim
Section 11.1: Significance Tests: Basics
Type I and Type II Errors
Statistical Power.
Rest of lecture 4 (Chapter 5: pg ) Statistical Inferences
CHAPTER 9 Testing a Claim
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Presentation transcript:

Sampling and Power Slides by Jishnu Das

Sample Selection in Evaluation Population based representative surveys: Sample representative of whole population Good for learning about the population Not always most efficient for impact evaluation Sampling for Impact evaluation Balance between treatment and control groups Power  statistical inference for groups of interest Concentrate sample strategically Survey budget as major consideration In practice, sample size is often set by budget Concentrate sample on key populations to increase power

Purposive Sampling: Risk: We will systematically bias our sample, so results don’t generalize to the rest of the population or other sub-groups Trade off between power within population of interest and population representation Results are internally valid, but not generalizable.

Type I and type II errors Type I error: Reject the null hypothesis when it is true Significance level  probability of rejecting the null when it is true (Type I error) Type II error: Accept (fail to reject) the null hypothesis when it is false Power  probability of rejecting the null when an alternative null is true (1-probability of Type II) We want to minimize both types of errors Increase sample size

Survey - Sampling Population: all cases of interest Sampling frame: list of all potential cases Sample: cases selected for analysis Sampling method: technique for selecting cases from sampling frame Sampling fraction: proportion of cases from population selected for sample (n/N)

Sampling Frame Simple Sampling – almost never practical unless universe of interest is geographically concentrated Cluster Sampling – randomly choose clusters and then randomly choose units within the cluster. Effective sample size is less than actual number of observations. This is the design, or cluster, effect The design effect implies that, for a given sized sample, the variance increases [1 + (E-1)] where E is the number of elements in each cluster and  is the intra-class correlation, a measure of how much the observations with in a cluster resemble each other.

Using Power Calculations to Estimate Sample Sizes What is the size sample needed to be able to find a difference in means at a given statistical significance. Need idea of what difference is a plausible expectation for the intervention. Fixing the confidence level, we observe two things when increasing sample size: the rejection region gets larger and the power increases

In Practice - I Many sample patterns possible especially when one can vary cluster numbers and cluster sizes May use simulations in Stata or similar package. They easily account for complicated designs Panel and dif-in-dif calculations need to be based on ability to find significance of changes, not difference in levels. Requires an estimate of correlation over rounds Sample needed to find difference between alternative treatments is different than that needed to compare to control

In Practice - II Number of clusters improves precision and is important especially in randomized designs. Not strictly necessary that treatment and control are equal in size or number of clusters but analysis is complicated if probability of selection differs. Importance of transparency in randomization process Many medical journal require registering trials prior to analysis (to avoid reporting only ‘favorable’ results).

An Example Does Information improve child performance in schools? (Pakistan) Randomized Design Interested in villages where there are private schooling options What Villages should we work in? Stratification: North, Central, South Random Sample: Villages chosen randomly from list of all villages with a private school

In Practice: An Example How many villages should we choose? Depends on: How many children in every village How big do we think the treatment effect will be What the overall variability in the outcome variable will be

In Practice: An Example Simulation Tables Table 1 assumes very high variability in test-scores. X,Y: X is for intervention with small effect size; Y for larger effect size N: Significant < 1% of simulations S: Significant < 10% of simulations A: Significant > 99% of simulations

In Practice: An Example Simulation Tables Table 1 assumes lower variability in test-scores. X,Y: X is for intervention with small effect size; Y for larger effect size N: Significant < 1% of simulations S: Significant < 10% of simulations A: Significant > 99% of simulations

When do we really worry about this? IF Very small samples at unit of treatment! Suppose treatment in 20 schools and control in 20 schools But there are 400 children in every school This is still a small sample Interested in sub-groups (blocks) Sample size requirements increase exponentially Using Regression Discontinuity Designs