EEF Archive analysis overview

Slides:



Advertisements
Similar presentations
A Spreadsheet for Analysis of Straightforward Controlled Trials
Advertisements

Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo J-PAL.
Meta-analysis: summarising data for two arm trials and other simple outcome studies Steff Lewis statistician.
Building Evidence in Education: Workshop for EEF evaluators 2 nd June: York 6 th June: London
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
Overview of Meta-Analytic Data Analysis. Transformations Some effect size types are not analyzed in their “raw” form. Standardized Mean Difference Effect.
BCOR 1020 Business Statistics
2 Accuracy and Precision Accuracy How close a measurement is to the actual or “true value” high accuracy true value low accuracy true value 3.
School of Education Archive Analysis: On EEF Trials Adetayo Kasim, ZhiMin Xiao, and Steve Higgins.
Analysis of Variance. ANOVA Probably the most popular analysis in psychology Why? Ease of implementation Allows for analysis of several groups at once.
Mediator analysis within field trials Laura Stapleton UMBC.
Povertyactionlab.org Planning Sample Size for Randomized Evaluations Esther Duflo MIT and Poverty Action Lab.
Chapter 11: Estimation Estimation Defined Confidence Levels
Biostatistics: Measures of Central Tendency and Variance in Medical Laboratory Settings Module 5 1.
MAS2317 Presentation Jake McGlen. Assuming vague prior knowledge, obtain the posterior distribution for µ and hence construct a 95% Bayesian confidence.
Slide 1 Estimating Performance Below the National Level Applying Simulation Methods to TIMSS Fourth Annual IES Research Conference Dan Sherman, Ph.D. American.
Introduction Multilevel Analysis
One-with-Many Design: Estimation David A. Kenny June 22, 2013.
ANOVA (Analysis of Variance) by Aziza Munir
Data Analysis – Statistical Issues Bernd Genser, PhD Instituto de Saúde Coletiva, Universidade Federal da Bahia, Salvador
1st meeting: Multilevel modeling: introduction Subjects for today:  Basic statistics (testing)  The difference between regression analysis and multilevel.
Effect Size Calculation for Meta-Analysis Robert M. Bernard Centre for the Study of Learning and Performance Concordia University February 24, 2010 February.
Empirical Efficiency Maximization: Locally Efficient Covariate Adjustment in Randomized Experiments Daniel B. Rubin Joint work with Mark J. van der Laan.
RCTs and instrumental variables Anna Vignoles University of Cambridge.
2 Accuracy and Precision Accuracy How close a measurement is to the actual or “true value” high accuracy true value low accuracy true value 3.
- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.
Precision Gains from Publically Available School Proficiency Measures Compared to Study-Collected Test Scores in Education Cluster-Randomized Trials June.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
1 Sampling Distribution of Arithmetic Mean Dr. T. T. Kachwala.
Multilevel and multifrailty models. Overview  Multifrailty versus multilevel Only one cluster, two frailties in cluster e.g., prognostic index (PI) analysis,
Confidence Interval for a Single Proportion p-hat, not phat.
Indirect and mixed treatment comparisons Hannah Buckley Co-authors: Hannah Ainsworth, Clare Heaps, Catherine Hewitt, Laura Jefferson, Natasha Mitchell,
Matrix Models for Population Management & Conservation March 2014 Lecture 10 Uncertainty, Process Variance, and Retrospective Perturbation Analysis.
Anders Nielsen Technical University of Denmark, DTU-Aqua Mark Maunder Inter-American Tropical Tuna Commission An Introduction.
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 10: Correlational Research 1.
Approaches to Linking Process and Outcome Data in a Cross-Site Evaluation Laura Elwyn, Ph.D. Kristin Stainbrook, Ph.D. American Evaluation Association.
Methods of Presenting and Interpreting Information Class 9.
The Conditional Random-Effects Variance Component in Meta-regression
N. V. Hartvig - Setting Release Limits
Analysis for Designs with Assignment of Both Clusters and Individuals
Sample Size Determination
H676 Week 3 – Effect Sizes Additional week for coding?
Lurking inferential monsters
Ling Ning & Mayte Frias Senior Research Associates Neil Huefner
Challenges arising from the analysis of randomized trials in education
Intro to Single Paper Meta-Analyses
Gian Luca Di Tanna Vladislav Berdunov Richard Hooper
Statistical Models for the Analysis of Single-Case Intervention Data
Challenges of statistical analysis in surgical trials
Statistical Analysis Plan review
Partial Credit Scoring for Technology Enhanced Items
Power, Sample Size, & Effect Size:
2 independent Groups Graziano & Raulin (1997).
EVAAS Overview.
BA 275 Quantitative Business Methods
Bootstrap Confidence Intervals using Percentiles
Estimation of Sampling Errors, CV, Confidence Intervals
ESTIMATION
Narrowing the evaluation gap
Linear Hierarchical Modelling
Brahm Fleisch Research supported by the Zenex Foundation October 2017
Narrative Reviews Limitations: Subjectivity inherent:
Analysing educational trials: the Education Endowment Foundation Archive Steve Higgins, Adetayo Kasim, ZhiMin Xiao, with Nasima Akhter, Ewoud De Troyer,
Evaluating Impacts: An Overview of Quantitative Methods
Sampling and Power Slides by Jishnu Das.
Publication Bias in Systematic Reviews
CCSSO National Conference on Student Assessment June 21, 2010
Sample Sizes for IE Power Calculations.
Hypothesis Tests with Means of Samples
Presentation transcript:

EEF Archive analysis overview Steve Higgins and Adetayo Kasim With support from ZhiMin Xiao, Nasima Akhter and Ewoud De Troyer s.e.higgins@durham.ac.uk a.s.kasim@durham.ac.uk @profstig EEF Conference 1st June, 2017

RESEARCH QUESTIONS How consistent are the results from the different analytic approaches or Methods? What is the impact of non-compliance on effect size calculation? Are low or high performing children more or less likely to benefit from an intervention? What proportion of children benefit from an educational intervention?

STRATEGIES Different effect size calculation Multi-level models Cohen’s d, Hedges’ g and Glass’ Δ Outome type Post-test only, Gain score and ancova Fixed Effect Model Ordinary least squares (OLS) Adjusting for school effects Multi-level models Explore variances within, between, total Frequentist versus Bayesian approach Design specific Individual randomisation, cluster, multi-site Xiao Z., Kasim, A., Higgins, S.E. (2016) Same Difference? Understanding Variation in the Estimation of Effect Sizes from Educational Trials International Journal of Educational Research 77: 1-14 http://dx.doi.org/10.1016/j.ijer.2016.02.001 Xiao Z., Higgins, S.E. & Kasim, A., (under review) Lord’s paradox revisited (post/gain comparisons)

KEY FINDINGS 17 EEF projects Four analytic models Results converge in larger, unproblematic trials Point estimates and estimates of precision vary when trials are problematic Results tend to diverge if ICC ≥ 0.2 and there are few clusters/schools MLM total variance ‘most conservative’ model (wider Confidence Intervals) Bayesian with vague priors give similar results as frequentist methods but more precise (narrower Credible Intervals)

CHALLENGES FACED Identifying the same data used by evaluators The impact of the different randomization approach Matching study design and analysis Impact of the different randomization factors and other covariates

CONCLUSION Overview of other analyses are presented below: Variation Cumulative quantiles Complier average causal effect (CACE) Internal validity Permutation p-values Software R package ‘eefAnalytics’ https://cran.r-project.org/web/packages/eefAnalytics/eefAnalytics.pdf Local influence index SD change Bootstrap confidence intervals