IMPACT EVALUATION WORKSHOP ISTANBUL, TURKEY MAY 11-14.

Slides:



Advertisements
Similar presentations
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo J-PAL.
Advertisements

Designing an impact evaluation: Randomization, statistical power, and some more fun…
AADAPT Workshop South Asia Goa, December 17-21, 2009 Kristen Himelein 1.
Choosing the level of randomization
Dr. G. Johnson, Sampling Demystified: Sample Size and Errors Research Methods for Public Administrators Dr. Gail Johnson.
Planning Sample Size for Randomized Evaluations Jed Friedman, World Bank SIEF Regional Impact Evaluation Workshop Beijing, China July 2009 Adapted from.
Educational Research: Sampling a Population
CHAPTER 7 Sampling Distributions
Planning for Research: Selecting Participants
Who are the participants? Creating a Quality Sample 47:269: Research Methods I Dr. Leonard March 22, 2010.
1 Confidence Interval for the Population Proportion.
Evaluating Hypotheses
Chapter 2 – Tools of Positive Analysis
Agenda: Block Watch: Random Assignment, Outcomes, and indicators Issues in Impact and Random Assignment: Youth Transition Demonstration –Who is randomized?
Practical Sampling for Impact Evaluations Cyrus Samii, Columbia University.
The t Tests Independent Samples.
SAMPLING AND STATISTICAL POWER Erich Battistin Kinnon Scott Erich Battistin Kinnon Scott University of Padua DECRG, World Bank University of Padua DECRG,
Quantitative Research
Cover Letters for Survey Research Studies
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 7 Sampling Distributions 7.1 What Is A Sampling.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Hypothesis Testing in Linear Regression Analysis
Determining Sample Size
1 1 Practical Sampling for Impact Evaluation (aka shedding light on voodoo) Laura Chioda (LAC Chief Economist office & Adopted by DIME) Reducing Fragility,
Statistical Methods, part 1 Module 2: Latent Class Analysis of Survey Error Models for measurement errors Dan Hedlin Stockholm University November 2012.
Measuring Impact: Experiments
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo MIT and Poverty Action Lab.
Fundamentals of Data Analysis Lecture 9 Management of data sets and improving the precision of measurement.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
AADAPT Workshop Latin America Brasilia, November 16-20, 2009 Laura Chioda World Bank 1.
 The range of a data set is the difference between the maximum and minimum data entries in the set. The find the range, the data must be quantitative.
P RACTICAL S AMPLING FOR I MPACT E VALUATIONS Marie-Hélène Cloutier 1.
Variables, sampling, and sample size. Overview  Variables  Types of variables  Sampling  Types of samples  Why specific sampling methods are used.
Beyond surveys: the research frontier moves to the use of administrative data to evaluate R&D grants Oliver Herrmann Ministry of Business, Innovation.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Sampling for an Effectiveness Study or “How to reject your most hated hypothesis” Mead Over, Center for Global Development and Sergio Bautista, INSP Male.
Why Use Randomized Evaluation? Isabel Beltran, World Bank.
Medical Statistics as a science
Applying impact evaluation tools A hypothetical fertilizer project.
Smith/Davis (c) 2005 Prentice Hall Chapter Fourteen Designing and Conducting Experiments with Multiple Independent Variables PowerPoint Presentation created.
Benjamin Olken MIT Sampling and Sample Size. Course Overview 1.What is evaluation? 2.Measuring impacts (outcomes, indicators) 3.Why randomize? 4.How to.
CHOOSING THE LEVEL OF RANDOMIZATION. Unit of Randomization: Individual?
Statistical Questioning Lesson After completing this lesson, you will be able to say: I can recognize and write a statistical question. I can recognize.
1 Sampling Distribution of Arithmetic Mean Dr. T. T. Kachwala.
Higher National Certificate in Engineering Unit 36 –Lesson 4 – Parameters used to Describe the Normal Distribution.
STATISTICAL DATA GATHERING: Sampling a Population.
Efsa LEARNING PROGRAMME Module 4 - Session 4.5a Sampling.
Surveys, Experiments, and Simulations Unit 3 Part 1 Surveys.
1 Aspects of Sampling for Household Surveys Kathleen Beegle Workshop 17, Session 1c Designing and Implementing Household Surveys March 31, 2009.
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 7 Sampling Distributions 7.1 What Is A Sampling.
Population vs Sample Population = The full set of cases Sample = A portion of population The need to sample: More practical Budget constraint Time constraint.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Inferential Statistics Psych 231: Research Methods in Psychology.
IMPACT EVALUATION PBAF 526 Class 5, October 31, 2011.
Development Impact Evaluation Initiative Innovations in Investment climate reforms Paris, Nov 14, 2012 in collaboration with the Investment Climate Global.
CHAPTER 4 Designing Studies
Quasi Experimental Methods I
PCB 3043L - General Ecology Data Analysis.
Understanding Results
CHAPTER 7 Sampling Distributions
CHAPTER 7 Sampling Distributions
CHAPTER 7 Sampling Distributions
SAMPLING AND STATISTICAL POWER
Psych 231: Research Methods in Psychology
(-4)*(-7)= Agenda Bell Ringer Bell Ringer
CHAPTER 7 Sampling Distributions
Advanced Algebra Unit 1 Vocabulary
Sample Sizes for IE Power Calculations.
Title Team Members.
Presentation transcript:

IMPACT EVALUATION WORKSHOP ISTANBUL, TURKEY MAY 11-14

PRACTICAL SAMPLING Aidan Coville, DIME 13 May 2015 Istanbul IE Workshop

This presentation covers two questions: Why is sample size important? – Approx time: 20 mins (interactive) How big should my sample be? – Approx time: a lifetime of pain and anguish 3 **Slides draw from the many people that have done this presentation before (including Laura Chioda)

Q1: Why is sample size important? 4

interactive example Go to: pollev.com/dime1 5

Why is sample size important? Imagine you had to sample letters to “estimate” what the sentence says: 6

Why is sample size important? Imagine you had to sample letters to “estimate” what the sentence says: 7

Why is it important for IE? We want to know the true impact But we need to estimate this impact from a sample Estimation means we can sometimes make mistakes Making mistakes can be costly… 8

Q2: How big should my sample be? 9

The answer is… = 42

The End Questions?

A better question… What influences the sample size I need? – Size of impact – Variation in outcome – Level of clustering – Take up 12

What influences the sample size I need? Size of impactVariation in outcome Level of clusteringTake up

What influences the sample size I need? Size of impactVariation in outcome Level of clusteringTake up

Who is taller? Size of impact Big impacts are easy to identifySmall impacts are more difficult Need more precision/accuracy Larger sample needed

Minimum detectable effect We need a sample size able to detect the smallest effect size of importance. To guide this decision we need to ask: “What is the smallest effect size that, if it were any smaller, the intervention would not be worth the effort?” 16

Mo money mo power

Need to be realistic

What influences the sample size I need? Size of impactVariation in outcome Level of clusteringTake up

 How does the variance of the outcome affect our ability to detect an impact? Which group is bigger?

 How does the variance of the outcome affect our ability to detect an impact? Now… which group is bigger?

Imagine that an investment promotion program leads to an increase in firm investment (impact) from SZC to SZC on average  Case A: businesses are all very similar and the distribution of investment amoutns is very concentrated  Case B: businesses are very different and distribution of investments are spread out (distributions overlap more) Which instance requires a more precise measuring device? 22 And this doesn’t just apply to farm animals

In sum:  More underlying variance (heterogeneity)  more difficult to detect difference  need larger sample size Tricky: How do we know about heterogeneity before we decide our sample size and collect our data?  Ideal: pre-existing data … but often non-existent  Can use pre-existing data from a similar population  Example: enterprise surveys, labor force surveys  Common sense Variation in outcomes (summary)

What influences the sample size I need? Size of impactVariation in outcome Level of clusteringTake up

Clustering (1/4) Sample size required increases, the higher the level of intervention assignment – Business level – Business group level – Village/port/… – Province? Even if unit of analysis is the firm, if level of randomization is at province (cluster) level, we run into challenges quickly…

Clustering (2/4) What is the added value of more samples in the same cluster? Village 1 Village 2 Village 4 Village 3

Clustering (3/4) Village 1 Village 2 Village 4 Village 3

Clustering (4/4) Takeaway Larger within cluster correlation (guys in same cluster are similar) lower marginal value per extra sampled unit in the cluster higher sample size/more clusters needed than a simple random sample. Rule of thumb: at least 40 clusters per treatment arm

What influences the sample size I need? Size of impactVariation in outcome Level of clusteringTake up

Example:  We can only offer matching grant. We cannot force businesses to use it  Offer grant to 500 businesses  Only 50 participate  In practice, because of low take up rate, we end up with a less precise measuring device  We won’t be able to detect differences with precision  Can only find an effect if it is really large  Take-up  Low take-up (rate) lowers precision of our comparisons  Effectively decreases sample size (remember rule of inverse square) 30 Why is take up important for sample size?

Take up vs. sample size 31

A real-life example Matching grant application vs. completion rates

Overview  Who to interview is ultimately determined by our research/policy questions  How Many: 33 Elements:Implication for Sample Size: The smaller effects that we want to detect The larger the sample size will have to be The more underlying heterogeneity (variance) The more clustering in samples The lower take up

How can we boost power Focus on homogenous group (if applicable) High frequency data on core indicators Increase take up better quality data (its worth it…) Avoid clustering where possible