Choosing the level of randomization

Slides:



Advertisements
Similar presentations
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation Muna Meky Impact Evaluation Cluster, AFTRL Slides by Paul J.
Advertisements

Designing an impact evaluation: Randomization, statistical power, and some more fun…
The World Bank Human Development Network Spanish Impact Evaluation Fund.
1 Research in Micro Finance: Big Questions and How to Answer Them.
Where do data come from and Why we don’t (always) trust statisticians.
PCS Champions The when “everything is fine” appointment.
How to randomize? povertyactionlab.org. Quick Review: Why Randomize Choosing the Sample Frame Choosing the Unit of Randomization Options How to Choose.
Presented by Malte Lierl (Yale University).  How do we measure program impact when random assignment is not possible ?  e.g. universal take-up  non-excludable.
Surveys and Questionnaires. How Many People Should I Ask? Ask a lot of people many short questions: Yes/No Likert Scale Ask a smaller number.
Povertyactionlab.org How to Randomize? Abhijit Banerjee Massachusetts Institute of Technology.
Benefits and limits of randomization 2.4. Tailoring the evaluation to the question Advantage: answer the specific question well – We design our evaluation.
The counterfactual logic for public policy evaluation Alberto Martini hard at first, natural later 1.
What could go wrong? Deon Filmer Development Research Group, The World Bank Evidence-Based Decision-Making in Education Workshop Africa Program for Education.
Who is the subject in a CRT? Rachel Glennerster Executive Director, J-PAL Department of Economics, MIT PRIM&R November 8, 2013.
1 Managing Threats to Randomization. Threat (1): Spillovers If people in the control group get treated, randomization is no more perfect Choose the appropriate.
What do we know about gender and agriculture in Africa? Markus Goldstein Michael O’Sullivan The World Bank Cross-Country Workshop for Impact Evaluations.
PAI786: Urban Policy Class 2: Evaluating Social Programs.
Experiments and Observational Studies.  A study at a high school in California compared academic performance of music students with that of non-music.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
IMPACTS OF A WORK-BASED POVERTY REDUCTION PROGRAM ON CHILDREN’S ACHIEVEMENT AND SOCIAL BEHAVIOR: THE NEW HOPE PROJECT Aletha C. Huston, Greg J. Duncan,
Cross-Country Workshop for Impact Evaluations in Agriculture and Community Driven Development Addis Ababa, April 13-16, 2009 AIM-CDD Using Randomized Evaluations.
1 Randomization in Practice. Unit of randomization Randomizing at the individual level Randomizing at the group level –School –Community / village –Health.
Where Do Data Come From? ● Conceptualization and operationalization of concepts --> measurement strategy --> data. ● Different strategies --> different.
Modifying Risk and Protective Factors. What can I do as a school-based health provider? In the context of an established relationship with a child/adolescent.
Measuring Impact: Experiments
AADAPT Workshop South Asia Goa, December 17-21, 2009 Nandini Krishnan 1.
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo MIT and Poverty Action Lab.
 Is there a comparison? ◦ Are the groups really comparable?  Are the differences being reported real? ◦ Are they worth reporting? ◦ How much confidence.
Rebecca Thornton University of Michigan and J-PAL.
Shawn Cole Harvard Business School Threats and Analysis.
Headlines What “sells” the news?. Appealing Headlines Simplify complex issues Certainties not possibilities Quick and easy solutions.
CAUSAL INFERENCE Shwetlena Sabarwal Africa Program for Education Impact Evaluation Accra, Ghana, May 2010.
Sampling Techniques 19 th and 20 th. Learning Outcomes Students should be able to design the source, the type and the technique of collecting data.
Why Use Randomized Evaluation? Isabel Beltran, World Bank.
Applying impact evaluation tools A hypothetical fertilizer project.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
What is randomization and how does it solve the causality problem? 2.3.
Africa Impact Evaluation Program on AIDS (AIM-AIDS) Cape Town, South Africa March 8 – 13, Steps in Implementing an Impact Evaluation Nandini Krishnan.
TRANSLATING RESEARCH INTO ACTION What and How to randomize? July 9, 2011 Dhaka Raymond Guiteras, Assistant Professor University of Maryland povertyactionlab.org.
CHOOSING THE LEVEL OF RANDOMIZATION. Unit of Randomization: Individual?
Observations vs. Experiments Target Goals: I can distinguish between an observational study and an experiment. I can explain how a lurking variable in.
What do we know about gender and private sector development in Africa? Markus Goldstein Alaka Holla The World Bank.
Africa Program for Education Impact Evaluation Dakar, Senegal December 15-19, 2008 Experimental Methods Muna Meky Economist Africa Impact Evaluation Initiative.
Chapter 6 Conducting & Reading Research Baumgartner et al Chapter 6 Selection of Research Participants: Sampling Procedures.
IMPACT EVALUATION WORKSHOP ISTANBUL, TURKEY MAY
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
Impact Evaluations in Good Times and Bad Forum Kajian Pembangunan March 22, 2011 Firman Witoelar, DECHD, Discussant.
Social Experimentation & Randomized Evaluations Hélène Giacobino Director J-PAL Europe DG EMPLOI, Brussells,Nov 2011 World Bank Bratislawa December 2011.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
Impact Evaluation for Evidence-Based Policy Making Arianna Legovini Lead Specialist Africa Impact Evaluation Initiative.
Perceptions of successful ageing by senior adults in Hong Kong: Implications for lifelong learning Panel Discussion at Book Launch 28 February 2012 Dr.
Common Pitfalls in Randomized Evaluations Jenny C. Aker Tufts University.
Raising Awareness Hunger & Obesity By: Carshella, John, & Jeremy.
Class 7 – Randomization* *(how to design an RCT that accommodates real world constraints)
26 Nov 2009 Dar es Salaam 1 Selecting the cohort David Coulter.
Kenya Evidence Forum - June 14, 2016 Using Evidence to Improve Policy and Program Designs How do we interpret “evidence”? Aidan Coville, Economist, World.
Using Randomized Evaluations to Improve Policy
Portland State University
Measuring Results and Impact Evaluation: From Promises into Evidence
Section 5.1 Designing Samples
CHAPTER 4 Designing Studies
How to Randomize.
Using Randomized Evaluations to Improve Policy
Randomization This presentation draws on previous presentations by Muna Meky, Arianna Legovini, Jed Friedman, David Evans and Sebastian Martinez.
Prices How do prices help determine WHAT, HOW, and FOR WHOM to produce? What factors affect prices?
Randomization This presentation draws on previous presentations by Muna Meky, Arianna Legovini, Jed Friedman, David Evans and Sebastian Martinez.
Explanation of slide: Logos, to show while the audience arrive.
Sample Sizes for IE Power Calculations.
TTL: Nyambura Githagui
Presentation transcript:

Choosing the level of randomization Module 4.2

Unit of Randomization: Individual?

Unit of Randomization: Individual?

Unit of Randomization: Cluster?

Unit of Randomization: Which Cluster?

Unit of Randomization: Class?

Unit of Randomization: Class?

Unit of Randomization: Class?

Unit of Randomization: School?

Steps in individual level randomization

Steps in individual level randomization

Steps in group level randomization

Steps in group level randomization

Considerations for level of randomization Unit of measurement Feasibility and fairness Spillovers Attrition Compliance Statistical power Clustering on the ground

Measurement and unit of randomization Randomization cannot take place at a lower level than our outcome is measured Outcome is quality of school building, cant randomize at level of class, teacher, or child Outcome is election of MP from a minority group, cant randomize at level lower than MP constituency Note if our outcome is self reported voting for a minority candidate then can randomize individually

Feasibility and unit of randomization Usually randomization is at the level at which the program is implemented Community health program is randomized by community School improvement program is randomized by school Randomizing below this level means changing how program implemented Eg nutritional supplement program normally done by school but randomized by individual has to keep track of which children are treatment, adding to cost of program

Political feasibility and perceived fairness Randomizing at a higher level can make randomization be perceived as fair Important to understand what is seen as fair in a given context It may seem fair that some schools are selected for a pilot program, if schools have a fair (random) chance of being selected for the pilot Providing some individual students with a program and excluding others within a school may be seen as less fair

Spillovers When a program changes outcomes for those not directly participating we call this impacts “spillovers” Physical spillovers a farmer draws water from an aquifer and there is less for others (negative spillover) Behavioral spillovers if some children go to school more this may encourage others to go (positive spillover) Informational if one mother learns the benefits of breastfeeding she may tell her friends (positive spillover) Market or general equilibrium If some firms reduce cost of production and price this may affect the price other firms can charge for their product (negative spillover)

Spillovers and randomization unit Spillovers mean outcomes for the comparison groups may be impacted by the program The comparison group is no longer a good counterfactual Q: if positive benefits of the program spillover to the comparison group, will this lead to an under or over estimate of program impact?

Positive spillover bias measured impact true impact bias True impact = 3 months Measured impact = 2 months Bias = Measured impact – True impact = - 1 month

Negative spillover bias measured impact true impact bias True impact = 3 months Measured impact = 2 months Bias = Measured impact – True impact = - 1 month

Containing spillovers within treatment group Child given free meal at school will share with friends treatment comparison share with friends Spillovers (would have been comparison but are influenced by treatment)

Containing spillovers within treatment group Containing spillovers by randomizing at a higher level Child given free meal at school will share with friends Treatment child has no need to share with friends Comparison child has no lunch to share with friends Randomizing at level of school solves the problem treatment comparison friends

Creating a buffer to absorb spillovers We can randomize at the individual level if our sample individuals are far enough apart to avoid spillovers Farmers given new variety of rice, sample farmers live far from each other. Live on their farms so no obvious “community” to randomize by Not sampled

A design to measure spillovers We can randomize at a level where some of the comparison experience spillovers and some don’t In our farmer example, farmers near treatment farmers are spillover farmers (purple), and can be compared to those near comparison farmers (green) The spillover effect is the difference in outcomes between these two groups (purple vs green farmers)

Example: deworming If a child is dewormed they are less likely to infect children in the neighborhood While most of the spillovers would occur within schools, but some children lived near those who went to neighboring schools Miguel and Kremer (2004) randomized at the school level but also measured outcomes in near by schools

Attrition and unit of randomization Attrition is when we cannot collect data on all our sample Usually attrition occurs when people move or are not at home when enumerators call If people feel a study is unfair they can refuse to cooperate, leading to attrition Sometimes providing benefits to some but not their neighbors is seen as unfair and can cause attrition It is important that randomization is seen as fair for many reasons, reducing attrition is one of them Randomizing at a higher level can help

Compliance and unit of randomization Full compliance is when all those in the treatment group get the program and all those in the comparison don’t get it Low level randomization may lead to sharing which undermines compliance Example: iron supplements in Indonesia Iron supplements aimed at older people Only providing supplements to some in the family might lead to sharing with others, diluting effect Providing supplements to everyone in the family increased compliance Source: Thomas et al, 2006.

Compliance and unit of randomization Simple to implement designs achieve higher compliance Avoid having one person implement two versions of a program Example: adding health insurance to micro credit One microcredit officer serves 10 groups in an area If randomize some groups to be offered health insurance and some not, credit officer will need to keep straight who can apply for insurance and who cant May need to randomize at the credit officer level

Power and unit of randomization Statistical power is the ability to detect an effect of a given size Individual randomization gives more power than group randomization (for a given sample size) We get more information from 20 children individually randomized than from 20 children in two schools (one treatment and one comparison) We will return to this subject in much more detail Many factors push as to randomize at higher levels, power pushes us to randomize lower

Clustering on the ground Clusters by which we randomize must make sense on the ground Administrative boundaries do not always correspond to how people interact with each other People may cluster and interact in smaller groups than administrative neighborhoods or villages We can randomize by these natural neighborhoods and have more power than if we randomized by administrative neighborhood Or administrative neighborhoods may blend into each other with little real difference evident on the ground Risk that implementers may not know who is in treatment and who in comparison

Example: hamlets vs. villages in India Rural areas in India are divided into villages or panchayats, but in much of India a lot of social interaction happens within a hamlet Some interventions can be randomized by hamlet Hamlets Villages

Example: farmers in Western Kenya In some parts of Kenya people live on their farms and randomizing by administrative “village” may make little sense Some researchers have randomized farming programs by which school households send their children to as this may represent a more natural locus of interaction (Duflo, Kremer, Robinson, 2007) Villages Schools School related clusters