Missing Data Issues in RCTs: What to Do When Data Are Missing? Analytic and Technical Support for Advancing Education Evaluations REL Directors Meeting.

Slides:



Advertisements
Similar presentations
Value Added in CPS. What is value added? A measure of the contribution of schooling to student performance Uses statistical techniques to isolate the.
Advertisements

Random Assignment Experiments
Brian A. Harris-Kojetin, Ph.D. Statistical and Science Policy
Systematic Review of Literature Part XIX Analyzing and Presenting Results.
Designs to Estimate Impacts of MSP Projects with Confidence. Ellen Bobronnikov March 29, 2010.
CONCEPTS UNDERLYING STUDY DESIGN
Experimental Research Designs
19.Multivariate Analysis Using NLTS2 Data. 1 Prerequisites Recommended modules to complete before viewing this module  1. Introduction to the NLTS2 Training.
WHY LARGE-SCALE RANDOMIZED CONTROL TRIALS? David Myers Senior Vice President IES 2006 Research Conference David Myers Senior Vice President IES 2006 Research.
Validity, Sampling & Experimental Control Psych 231: Research Methods in Psychology.
Who are the participants? Creating a Quality Sample 47:269: Research Methods I Dr. Leonard March 22, 2010.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Technology’s Edge: The Educational Benefits of Computer-Aided Instruction Lisa Barrow Federal Reserve Bank of Chicago Lisa Markman Princeton University.
Treatment Effects: What works for Whom? Spyros Konstantopoulos Michigan State University.
How to deal with missing data: INTRODUCTION
Agenda: Block Watch: Random Assignment, Outcomes, and indicators Issues in Impact and Random Assignment: Youth Transition Demonstration –Who is randomized?
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
Chapter 4 Selecting a Sample Gay, Mills, and Airasian
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Chapter 5 Copyright © Allyn & Bacon 2008 This multimedia product and its contents are protected under copyright law. The following are prohibited by law:
Experimental Research
Multiple Linear Regression A method for analyzing the effects of several predictor variables concurrently. - Simultaneously - Stepwise Minimizing the squared.
The World Bank Human Development Network Spanish Impact Evaluation Fund.
Experimental Design The Gold Standard?.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Dr. Engr. Sami ur Rahman Assistant Professor Department of Computer Science University of Malakand Research Methods in Computer Science Lecture: Research.
Determining Sample Size
Overview of MSP Evaluation Rubric Gary Silverstein, Westat MSP Regional Conference San Francisco, February 13-15, 2008.
RESEARCH A systematic quest for undiscovered truth A way of thinking
Sampling Techniques LEARNING OBJECTIVES : After studying this module, participants will be able to : 1. Identify and define the population to be studied.
Research Strategies, Part 2
Chapter 5 Selecting a Sample Gay, Mills, and Airasian 10th Edition
Module 4 Notes Research Methods. Let’s Discuss! Why is Research Important?
Assisting GPRA Report for MSP Xiaodong Zhang, Westat MSP Regional Conference Miami, January 7-9, 2008.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.
Rigorous Quasi-Experimental Evaluations: Design Considerations Sung-Woo Cho, Ph.D. June 11, 2015 Success from the Start: Round 4 Convening US Department.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
CHAPTER 12 Descriptive, Program Evaluation, and Advanced Methods.
MSRP Year 1 (Preliminary) Impact Research for Better Schools RMC Corporation.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Evaluating Impacts of MSP Grants Hilary Rhodes, PhD Ellen Bobronnikov February 22, 2010 Common Issues and Recommendations.
DATA PREPARATION: PROCESSING & MANAGEMENT Lu Ann Aday, Ph.D. The University of Texas School of Public Health.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Research Strategies. Why is Research Important? Answer in complete sentences in your bell work spiral. Discuss the consequences of good or poor research.
Evaluating Impacts of MSP Grants Ellen Bobronnikov Hilary Rhodes January 11, 2010 Common Issues and Recommendations.
McMillan Educational Research: Fundamentals for the Consumer, 6e © 2012 Pearson Education, Inc. All rights reserved. Educational Research: Fundamentals.
Evaluating Impacts of MSP Grants Ellen Bobronnikov January 6, 2009 Common Issues and Potential Solutions.
Evaluation Requirements for MSP and Characteristics of Designs to Estimate Impacts with Confidence Ellen Bobronnikov February 16, 2011.
Evaluation Research Dr. Guerette. Introduction Evaluation Research – Evaluation Research – The purpose is to evaluate the impact of policies The purpose.
Design of Clinical Research Studies ASAP Session by: Robert McCarter, ScD Dir. Biostatistics and Informatics, CNMC
Developing an evaluation of professional development Webinar #2: Going deeper into planning the design 1.
Education Intervention in the Clinical Setting for Inappropriate Use of Antibiotics in Children Katie Butterfield.
Effectiveness of Selected Supplemental Reading Comprehension Interventions: Impacts on a First Cohort of Fifth-Grade Students June 8, 2009 IES Annual Research.
Outcomes Evaluation A good evaluation is …. –Useful to its audience –practical to implement –conducted ethically –technically accurate.
 Describe the defining characteristics of quantitative research studies.  List and describe the basic steps in conducting quantitative research studies.
Common Pitfalls in Randomized Evaluations Jenny C. Aker Tufts University.
1 Study Design Imre Janszky Faculty of Medicine, ISM NTNU.
Research design By Dr.Ali Almesrawi asst. professor Ph.D.
Patricia Gonzalez, OSEP June 14, The purpose of annual performance reporting is to demonstrate that IDEA funds are being used to improve or benefit.
TRANSLATING RESEARCH INTO ACTION Randomized Evaluation Start-to-finish Abdul Latif Jameel Poverty Action Lab povertyactionlab.org.
IMPACT EVALUATION PBAF 526 Class 5, October 31, 2011.
Analysis for Designs with Assignment of Both Clusters and Individuals
Introduction to Survey Data Analysis
Geoscience Projects and The Online Evaluation Resource Library (OERL)
Explanation of slide: Logos, to show while the audience arrive.
Rerandomization to Improve Baseline Balance in Educational Experiments
Presentation transcript:

Missing Data Issues in RCTs: What to Do When Data Are Missing? Analytic and Technical Support for Advancing Education Evaluations REL Directors Meeting February 8, 2008

2 What Are We Doing? Purpose: To provide guidance on how to deal with missing baseline and longitudinal follow-up data (including attrition from data collection) Audience: – Researchers measuring the impact of an educational intervention on student achievement using a randomized control trial (RCT) design – Of particular benefit to study teams just entering the impact analysis phase of their projects

3 Scope of the Monograph Restricted to Missing Data Issues in RCTs: – Missing data on outcome variables, control variables, and subgroup variables – Excludes quasi-experimental studies, does not address attrition from the intervention Focus on Analysis Strategies—We take the design and study sample as already fixed Provide Practical Guidance— Intent is to provide solutions that RELs can actually implement (not cutting edge or overly costly methods)

4 Today’s Meeting Status of the Monograph: – We are just writing the first draft, due March 1 st – No solutions today – just an informal discussion What do we want to accomplish today? – Let you know where the paper is heading – Get your feedback What’s not here? What help do you need?

5 General Approach to ITT Impact Estimation in the REL Studies The RCTs are multi-site cluster randomized controlled trials with random assignment at the classroom or school level Given this design, RELs are using 2- or 3-level HLM models to estimate impacts, e.g., with 3 levels, students at level 1, classrooms at level 2, and schools at level 3 Covariates are usually included at the student level and sometimes at the class or school level

6 Data Used in RCTs Conducted by the RELs Outcome Measures: – Most common outcomes are student test score measures – But other outcomes include student attitudes, teacher practice, and teacher knowledge Baseline Measures: – Most RCTs are collecting baseline data to define subgroups and improve the precision of the impact estimates – These data include pre-intervention test scores and demographic characteristics, e.g., age, gender, race, ethnicity, Limited English Proficiency, and eligibility for Free or Reduced Price Lunches

7 Most Common Types of Data Collection for Creating Outcome Measures 1. Student tests conducted for the study (n = 20) 2. Teacher surveys (n = 16) 3. Student tests required by the state (n = 11) 4. Student surveys (n = 6) 5. Teacher tests (n = 5)

8 Missing Data and Measurement Issues Missing Data (focus of this presentation): – Studies are likely to encounter missing values for one or more outcome variables, subgroup variables, or control variables Other Measurement Issues (included in paper): – Baseline measures collected after random assignment – Outcomes measures collected over time and perhaps on average later for one experimental group than the other – Others? Pooling state test scores across states?

9 Student Tests for the Study Reasons for Missing Data:  The student’s parent did not give consent  The student failed to attend school on testing day  The student transferred to another school  The student’s classroom was unavailable for testing (fire drill)  Student refused to do the test Missing data rates should be low if tests are administered in the usual classroom setting (most will be)

10 Student Tests Required by the State Reasons for Missing Data:  The student was absent and there was no follow-up testing  The student was exempt from taking the state test  The student’s school failed to provide test score data  The student transferred to another school Missing data rates should be low since data can be collected at district or state level, and only a small fraction of the sample will transfer outside of the district

11 Student Surveys Reasons for Missing Data:  The student’s parent did not give consent  The student’s teacher failed to administer the survey  The student did not attend school on the day of the survey  The student chose not to complete the survey  The student transferred to another school Missing data rates should be low if the survey is administered in school, could be high if the survey is administered outside of school

12 Teacher Surveys Reasons for Missing Data:  The teacher refused to complete the survey  The teacher was on temporary leave  The teacher left the school and never received the survey Missing data rates can be high because teachers are busy and may have little incentive to complete the survey (unless required by the school or district)

13 Teacher Tests Some REL RCTs are testing teachers on their knowledge using tests conducted online or in school Reasons for Missing Data:  The teacher refused to complete the test  The teacher was on temporary leave  The teacher left the school and never received the test Missing data rates can be high because teachers may find such test offensive or onerous (unless required by the school or district)

14 Why Worry About Missing Data? Standard software typically drop the cases with missing variables, but this can lead to biased impact estimates To avoid this problem, researchers may choose to drop the variables for which some cases are missing, but this can have negative consequences

15 Dropping Cases with Missing Data Non-Response Bias—Dropping cases with missing data can lead to “non-response” bias if there is a relationship between the outcome and “missingness” – e.g., if student achievement is lower for students exempted from the state test) Biased Impact Estimates Bias—Dropping cases with missing data can lead to biased impact estimates if the rate of missing data or the mechanism behind the “missingness” differs between treatment and control (give example)

16 Dropping Variables from the Analysis Dropping either outcome variables (dependent variables) or subgroup variables because the data are missing for some cases is like throwing the baby out with the bath water! Dropping control variables (independent variables) because data are missing will reduce the precision of the impact estimates (since controls are included to increase precision)

17 Assessing the Size of the Problem Learn why certain data are missing—In some cases this may shed light on whether non-response bias is likely to be severe Compare respondents to nonrespondents using data available for both—For student outcomes, it is especially important to compare the two groups in their pre-intervention test scores

18 Addressing the Problem: The Toolbox All these methods rely on the information that we do have for the sample, but they vary in their assumptions and technical approach Some methods work only for baseline covariates, some work only for outcome measures, and some work for both

19 Different Methods for Baseline Covariates and Outcome Variables Baseline Covariates (Control Variables) Only: – Dummy variable indicators for cases with missing data Outcome Variables Only: – Weighting methods—Re-weight respondents to better represent the population of interest (e.g., weight “up” groups with high rates of missing data) – Bounding the impacts—Make assumptions about nonrespondents that maximize and minimize the estimated impacts (e.g., make best and worst case assumptions for the true values of the outcome when data are missing)

20 Methods for Both Types of Variables Imputation-based Procedures: – Mean value imputation—Replace missing values with the mean for the variable – Regression imputation—Replace missing values with a predicted value from regression model – Stochastic regression imputation—Adds residual to predicted value to maintain correct variance. Stochastic regression imputation can be implemented as a single imputation or a multiple imputation Model-based Procedures—This is a broad class of procedures that includes maximum likelihood based methods, the EM algorithm, and pattern-mixture models (we are still investigating)

21 How Do We Plan To Recommend Strategies? Our current thinking is to assess alternatives on the basis of four criteria: 1. Accessibility—Is this “tool” something that be done with standard software? 2. Bias reduction—How effective is the method likely to be in addressing non-response bias? 3. Correct inference—Will the tool produce standard errors that are at least “not too biased”? 4. Power—Does this method generate estimates that are reasonable precise (relative to alternative options)?

22 Discussion Time for your feedback: – What’s not here? – What help do you need?