How do we Randomize? Jenny C. Aker Tufts University.

Slides:



Advertisements
Similar presentations
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation Muna Meky Impact Evaluation Cluster, AFTRL Slides by Paul J.
Advertisements

Survey design. What is a survey?? Asking questions – questionnaires Finding out things about people Simple things – lots of people What things? What people?
Choosing the level of randomization
How to randomize? povertyactionlab.org. Quick Review: Why Randomize Choosing the Sample Frame Choosing the Unit of Randomization Options How to Choose.
STATISTICS FOR MANAGERS LECTURE 2: SURVEY DESIGN.
Povertyactionlab.org How to Randomize? Abhijit Banerjee Massachusetts Institute of Technology.
Experimental Research Designs
What could go wrong? Deon Filmer Development Research Group, The World Bank Evidence-Based Decision-Making in Education Workshop Africa Program for Education.
1 Managing Threats to Randomization. Threat (1): Spillovers If people in the control group get treated, randomization is no more perfect Choose the appropriate.
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Chapter 7 Sampling Distributions
Sampling Prepared by Dr. Manal Moussa. Sampling Prepared by Dr. Manal Moussa.
11 Populations and Samples.
Chapter 12 Sample Surveys
Course Content Introduction to the Research Process
SAMPLING AND STATISTICAL POWER Erich Battistin Kinnon Scott Erich Battistin Kinnon Scott University of Padua DECRG, World Bank University of Padua DECRG,
SAMPLING METHODS Chapter 5.
Cross-Country Workshop for Impact Evaluations in Agriculture and Community Driven Development Addis Ababa, April 13-16, 2009 AIM-CDD Using Randomized Evaluations.
1 Randomization in Practice. Unit of randomization Randomizing at the individual level Randomizing at the group level –School –Community / village –Health.
Chapter 1: Introduction to Statistics
Measuring Impact: Experiments
AADAPT Workshop South Asia Goa, December 17-21, 2009 Nandini Krishnan 1.
Shawn Cole Harvard Business School Threats and Analysis.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Why Use Randomized Evaluation? Presentation by Shawn Cole, Harvard Business School and J-PAL Presenter: Felipe Barrera-Osorio, World Bank 1 APEIE Workshop.
Africa Impact Evaluation Program on AIDS (AIM-AIDS) Cape Town, South Africa March 8 – 13, Causal Inference Nandini Krishnan Africa Impact Evaluation.
Impact Evaluation Designs for Male Circumcision Sandi McCoy University of California, Berkeley Male Circumcision Evaluation Workshop and Operations Meeting.
Nigeria Impact Evaluation Community of Practice Abuja, Nigeria, April 2, 2014 Measuring Program Impacts Through Randomization David Evans (World Bank)
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Chapter 7 Sampling Distributions Basic Business Statistics.
Why Use Randomized Evaluation? Isabel Beltran, World Bank.
Applying impact evaluation tools A hypothetical fertilizer project.
Measuring Impact 1 Non-experimental methods 2 Experiments
Africa Impact Evaluation Program on AIDS (AIM-AIDS) Cape Town, South Africa March 8 – 13, Steps in Implementing an Impact Evaluation Nandini Krishnan.
TRANSLATING RESEARCH INTO ACTION What and How to randomize? July 9, 2011 Dhaka Raymond Guiteras, Assistant Professor University of Maryland povertyactionlab.org.
CHOOSING THE LEVEL OF RANDOMIZATION. Unit of Randomization: Individual?
Chapter Ten Copyright © 2006 John Wiley & Sons, Inc. Basic Sampling Issues.
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
Cross-Country Workshop for Impact Evaluations in Agriculture and Community Driven Development Addis Ababa, April 13-16, 2009 Steps in Implementing an Impact.
Implementing an impact evaluation under constraints Emanuela Galasso (DECRG) Prem Learning Week May 2 nd, 2006.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
IPDET Module 9: Choosing the Sampling Strategy. IPDET © Introduction Introduction to Sampling Types of Samples: Random and Nonrandom Determining.
Sampling technique  It is a procedure where we select a group of subjects (a sample) for study from a larger group (a population)
Social Experimentation & Randomized Evaluations Hélène Giacobino Director J-PAL Europe DG EMPLOI, Brussells,Nov 2011 World Bank Bratislawa December 2011.
Global Workshop on Development Impact Evaluation in Finance and Private Sector Rio de Janeiro, June 6-10, 2011 Using Randomized Evaluations to Improve.
1 Ongoing Agriculture Projects in Nigeria and Malawi Ahmed Mushfiq Mobarak Yale University.
Africa Impact Evaluation Program on AIDS (AIM-AIDS) Cape Town, South Africa March 8 – 13, Randomization.
Impact Evaluation for Evidence-Based Policy Making Arianna Legovini Lead Specialist Africa Impact Evaluation Initiative.
Common Pitfalls in Randomized Evaluations Jenny C. Aker Tufts University.
Sampling Chapter 5. Introduction Sampling The process of drawing a number of individual cases from a larger population A way to learn about a larger population.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
IMPACT EVALUATION PBAF 526 Class 5, October 31, 2011.
Cross-Country Workshop for Impact Evaluations in Agriculture and Community Driven Development Addis Ababa, April 13-16, Causal Inference Nandini.
Class 7 – Randomization* *(how to design an RCT that accommodates real world constraints)
CHAPTER 6: SAMPLING. Populations and Samples Population: a group of individuals a researcher seeks to learn about from a research study. Sample: the group.
4.05 Understand marketing-research design considerations to evaluate their appropriateness for the research problem/issue 4.00 Understand promotion and.
Using Randomized Evaluations to Improve Policy
Sampling.
Threats and Analysis.
How to Randomize.
Principles of Experiment
Quasi-Experimental Methods
Using Randomized Evaluations to Improve Policy
Development Impact Evaluation in Finance and Private Sector
Implementation Challenges
Randomization This presentation draws on previous presentations by Muna Meky, Arianna Legovini, Jed Friedman, David Evans and Sebastian Martinez.
Randomization This presentation draws on previous presentations by Muna Meky, Arianna Legovini, Jed Friedman, David Evans and Sebastian Martinez.
Sampling and Power Slides by Jishnu Das.
Impact Evaluation Designs for Male Circumcision
Applying Impact Evaluation Tools: Hypothetical Fertilizer Project
Presentation transcript:

How do we Randomize? Jenny C. Aker Tufts University

Introduction to ATAI and J-PAL Using randomized evaluations to test adoption constraints From adoption to impact Alternative strategies for randomizing programs Power and sample size Managing and minimizing threats to analysis Common pitfalls Randomized evaluation: Start-to-finish Course Overview

What is the impact of the intervention? o What is the impact of NERICA on rice yields? o What is the impact of improved bags on cowpea losses? Was this (observed) impact due to the program or something else? o Unbiased treatment or program effect o Attribution Why are we here?

Measuring Impact Offer farmers improved quality seeds Farmers use seeds Farmers don’t use seeds “Treated” farmers yields “Control” farmers yields 10 kg Is this unbiased? Too big or too small?

Measuring Impact Choose villages away from a paved road to get seeds Farmers in treated villages use seeds Farmers in control villages don’t use seeds “Treated” farmers yields “Control” farmers yields 10 kg Is this unbiased? Too big or too small?

5 Randomizati on RDDDDMatching PSM IV ExperimentalQuasi- Experiment Non—Experimental High internal validity Lower external validity Lower internal validity Higher external validity

What is randomization? Randomization involves randomly assigning a potential participant (individual, household or village) to the treatment or control group It gives each potential participant a (usually equal) chance of being assigned to each group The objective is to ensure that the only systematic difference between the program participants (treatment) and non-participants (control) is the presence of the program

Basic setup of a randomized evaluation 7 Target Population Not in evaluation Evaluation Sample Random Assignment Treatment group Participants No-Shows Control group Non- Participants Cross-overs Random number generator, calebasse, Excel, STATA

How can randomization be useful to measure a program effect? It gives each potential participant a (usually equal) chance of being assigned to each group On average, and if the sample size is large enough, observable and unobservable characteristics between the program participants (treatment) and non-participants (control) is the same The only difference is the presence of the program (But we need to check that it worked)

Simple lottery Randomization in the “bubble” Randomized phase-in Rotation Encouragement design These are not mutually exclusive. Possible Randomization Designs

Unit and method of randomization Real-world constraints Revisiting unit and method Variations on simple treatment-control Lecture Overview

Unit and method of randomization Real-world constraints Revisiting unit and method Variations on simple treatment-control Lecture Overview

1.Randomizing at the individual level 2.Randomizing at the group level “Cluster Randomized Trial” At which level should we randomize? 12 Unit of Randomization: Options

Unit of Randomization: Individual?

Unit of Randomization: Clusters ? “ Groups of individuals”: Cluster Randomized Trial

Unit of Randomization: Farmers’ Group?

Unit of Randomization: Village?

How do we choose the level? What unit does the program target for treatment? What is the unit of analysis?

Nature of the Treatment o How is the intervention administered? o What is the catchment area of each “unit of intervention” o How wide is the potential impact? Aggregation level of available data Power requirements Generally, it is best to randomize at the level at which the treatment is administered. How do we choose the level?

Intervention: Tree-planting contract for farmers (same subsidy, different payment schemes) Treatment Level: Farmers (or farm households) Catchment level: Agricultural extension agent Randomization level: Individual (500 farmers across 23 villages) Oversubscription, so additional public lottery Example: Tree-Planting in Malawi (Jack 2011)

Intervention: Promotion of fertilizer “trees” Treatment level: Farmers’ group and individual farmers Randomization level: o Farmer group (each group randomly offered different price for tree) o Individual: Reward cards for maintaining trees randomized across farmers within the group Example: Fertilizer Tree Adoption in Zambia

Intervention: SMS-based price information in India Treatment level: Farmers within the village Randomization level: Village o Ten farmers selected in each village o Farmers received subscription to a SMS-based price information service (Reuters Market Light) o Intensity of treatment varied across villages (ie, number of farmers) Example: SMS-Based Price Information in India

Unit and method of randomization Real-world constraints Revisiting unit and method Variations on simple treatment-control Lecture Overview

Fairness and ethical issues Political Concerns Resources Crossovers/spillovers Logistics Sample size Real-World Constraints

Randomizing at the individual level within a farmers’ association o Non-treated farmers might be unhappy Randomizing at the household-level within the village o Non-recipient households or the village chief might be unhappy Randomizing at the village or farmers’ association level o Ministry of Agriculture might be unhappy Fairness

Lotteries are simple and common Randomly chosen from applicant pool Participants know the “winners” and “losers” Simple lottery is useful when there is no a priori reason to discriminate Can be perceived as fair Transparent How to Randomize, Part I - 28 Political Concerns

Many programs have limited resources o Vouchers, Subsidies, Training o More eligible recipients than resources How will program recipients be chosen? o Clear-cut criteria o Arbitrary criteria o Random process o Some combination of the above How to Randomize, Part I - 29 Resources

Contamination of the control group can be due to: Spillovers – positive or negative Crossovers – movement to treatment (or control) group Spillovers/Crossovers

Is it possible or feasible for staff to implement different programs in the same catchment area? Agricultural extension agent provides training in improved NRM techniques – Training is one of many responsibilities of the agent – The agent might serve farmers from both treatment and control villages within his/her catchment areas – It might be difficult to train them to follow different procedures for different groups, and to keep track of what to give whom Logistics

The program is only large enough to serve a handful of communities Might not be able to survey (or implement the program in) enough communities to detect a (statistical) effect Sample Size

Unit and method of randomization Real-world constraints Revisiting unit and method Variations on simple treatment-control Lecture Overview

Simple lottery Randomization in the “bubble” Randomized phase-in Rotation Encouragement design These are not mutually exclusive. Possible Randomization Designs

A partner may not be willing to randomize among eligible people. However, a partner might be willing to randomize in “the bubble.” People “in the bubble” are those who are borderline in terms of eligibility – Just above the threshold  not eligible, but almost What treatment effect do we measure? What does it mean for external validity? Randomization in “the bubble”

Within the bubble, compare treatment to control Participants <=.25 ha Non-participants >.25 ha Treatment Control

Randomization in the Bubble Must receive the program Ineligible Randomized assignment to the program

Program still has discretion to treat “necessary” groups Example: Agricultural grant program in Niger (PRODEX) Example: Expansion of consumer credit in South Africa Randomization in “the bubble”

Takes advantage of the program expansion (ie, the NGO cannot implement in all villages the first year) Everyone gets program eventually If everyone is eligible for the program, what determines which villages, schools, branches, etc. will be covered in which year? Randomized Phase-In

Round 3 Treatment: 3/3 Control: Round 1 Treatment: 1/3 Control: 2/3 Round 2 Treatment: 2/3 Control: 1/3 Randomized evaluation ends

Advantages Everyone gets something eventually Provides incentives to maintain contact Concerns Can complicate estimating long-run effects Be careful with phase-in windows Do expectations of change actions today? Randomized Phase-In

Groups get treatment in turns – Group A gets treatment in the first period – Group B gets treatment in the second period How to Randomize, Part I - 42 Rotation

Round 1 Treatment: 1/2 Control: 1/2 Rotation design Round 2 Treatment from Round 1  Control —————————————————————————— Control from Round 1  Treatment Round 1 Treatment: 1/2 Control: 1/2

Advantages: – Might be perceived as fairer, therefore easier to get accepted Disadvantages: – If those in Group B anticipate treatment, they might change their behavior – Cannot measure long-term impact because no pure control group How to Randomize, Part I - 44 Rotation

Sometimes it’s not possible to randomize program access (vaccines, savings program, etc) But many programs have less than 100% take- up Randomize encouragement to receive treatment Randomized Encouragement

Encouragement design Encourage Do not encourage participated did not participate Complying Not complying compare encouraged to not encouraged do not compare participants to non- participants adjust for non-compliance in analysis phase These must be correlated

Something that makes some individuals more likely to use program than others – Not in itself a treatment – E.g., vouchers, training, visit from agent, etc For whom are we estimating the treatment effect? – Think about who responds to encouragement (compliers) What is “encouragement”?

Simple lottery Randomization in the “bubble” Randomized phase-in Rotation Encouragement design These are not mutually exclusive. Summary of Possible Designs

How to Randomize, Part I - 49 Methods of randomization - recap DesignMost useful when… AdvantagesDisadvantages Basic Lottery Program oversubscribed Familiar Easy to understand Easy to implement Can be implemented in public Control group may not cooperate Differential attrition

How to Randomize, Part I - 50 Methods of randomization - recap DesignMost useful when…AdvantagesDisadvantages Phase-In Expanding over time Everyone must receive treatment eventually Easy to understand Constraint is easy to explain Control group complies because they expect to benefit later Anticipation of treatment may impact short-run behavior Difficult to measure long-term impact

How to Randomize, Part I - 51 Methods of randomization - recap DesignMost useful when…AdvantagesDisadvantages Rotation Everyone must receive something at some point Not enough resources per given time period for all More data points than phase-in Difficult to measure long-term impact

How to Randomize, Part I - 52 Methods of randomization - recap DesignMost useful when… AdvantagesDisadvantages Encouragement Program has to be open to all comers When take-up is low, but can be easily improved with an incentive Can randomize at individual level even when the program is not administered at that level Measures impact of those who respond to the incentive Need large enough inducement to improve take-up Encouragement itself may have direct effect

Intervention: Provide capacity-building grants to farmers’ association (World Bank, GoN) Treatment level: Farmers’ association Randomization level: Farmers’ association Randomization design: – Partial lottery – Randomized phase-in – Encouragement scheme But huge implications for sample size! Grants to Farmers’ Association in Niger

Consider non-standard lottery designs Could increase outreach activities Is this ethical? How to Randomize, Part I - 54 What if you have 500 applicants for 500 slots?

There are 2000 applicants Application screening identifies 500 “eligible” candidates for 500 slots In this case, a simple lottery will not work What are our options? Non-Standard Lottery Designs

Selection procedures may exist only to reduce eligible candidates in order to meet a capacity constraint What are they screening for? Which elements are essential? If certain screening mechanisms appear “arbitrary” (although not random), randomization can serve the purpose of filtering and help with evaluation Non-Standard Lottery Designs

Unit and method of randomization Real-world constraints Revisiting unit and method Variations on simple treatment-control Lecture Overview

Multiple treatments Crossing or interacting treatments Varying levels of treatment Stratified randomization Multiple-stage randomization Variations on Simple Treatment and Control

Sometimes the core question is deciding among different possible interventions – Ie, in-person extension agent visits versus a call-in hotline You can randomize these interventions Does this teach us about the benefit of any one intervention? Do you have a control group? How to Randomize, Part I - 59 Multiple treatments

Treatment 1 Treatment 2 Treatment 3 Multiple treatments

Test different components of treatment in different combinations – Improved seeds only, improved seeds plus training, training only, no treatment Test whether components serve as substitutes or complements What is most cost-effective combination? Advantage: win-win for operations, can help answer questions for them, beyond simple “impact”! Cross-cutting treatments

Some villages are assigned full treatment – All households receive access to SMS-based price information Some villages are assigned partial treatment – 50% of households receive access to SMS-based price information Testing subsidies and prices – Vary the price of seeds, inputs or access to market information via mobile phone Varying levels of treatment

Randomization should, in principle, ensure balance in the treatment and control groups if the sample size is large enough What happens when it is small? Stratified randomization can help to ensure balance across groups when there is a small(er) sample – Divide the sample into different subgroups – Select treatment and control from each subgroup What happens if you don’t stratify? 63 Stratified Randomization

Stratify on variables that could have important impact on outcome variable (bit of a guess) Stratify on subgroups that you are particularly interested in (where may think impact of program may be different) Stratification more important when small data set Can get complex to stratify on too many variables Makes the draw less transparent the more you stratify You can also stratify on index variables you create 64 Stratified Randomization

Need sample frame Pull out of a hat/bucket/calabasse Use random number generator in spreadsheet program to order observations randomly Stata program code What if no existing list? 65 Mechanics of Randomization Source: Jenny Aker

Mechanics of Randomization