Propensity Score Matching

Slides:



Advertisements
Similar presentations
Promising pathways Understanding and predicting preparation and success and its potential as a model for change for the California Community Colleges and.
Advertisements

BLENDING PROPENSITY SCORE MATCHING AND LOGISTIC REGRESSION IN SUPPORT SERVICE EVALUATIONS TERRENCE WILLETT, CRAIG HAYWARD, AND NATHAN PELLEGRIN CAIR CONFERENCE.
KIPP: Effectiveness and Innovation in Publicly-Funded, Privately-Operated Schools October 4, 2012 Presentation to the APPAM/INVALSI Improving Education.
BENJAMIN GAMBOA, RESEARCH ANALYST CRAFTON HILLS COLLEGE RESEARCHING: ALPHA TO ZETA.
Correlation AND EXPERIMENTAL DESIGN
Logistic Regression and Odds Ratios
The Creation of Two Comparison Groups to Facilitate Evidence-Based Decision Making Prepared and Presented by Keith Wurtz Dean, Institutional Effectiveness,
Promise Pathways: Cohort Board of Trustees Retreat July 24, 2012 The Utility of High School Achievement Data for Placement in Foundational Skills.
PSYC512: Research Methods PSYC512: Research Methods Lecture 19 Brian P. Dyre University of Idaho.
Practical Meta-Analysis -- D. B. Wilson
Today Concepts underlying inferential statistics
Agenda: Block Watch: Random Assignment, Outcomes, and indicators Issues in Impact and Random Assignment: Youth Transition Demonstration –Who is randomized?
Stat 112: Lecture 9 Notes Homework 3: Due next Thursday
McGraw-Hill © 2006 The McGraw-Hill Companies, Inc. All rights reserved. Experimental Research Chapter Thirteen.
Experimental Research
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Chapter 8 Experimental Research
Wednesday PM  Presentation of AM results  Multiple linear regression Simultaneous Simultaneous Stepwise Stepwise Hierarchical Hierarchical  Logistic.
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Harnessing the Power of Alternative Assessment to Advance the Completion Agenda Eloy Ortiz Oakley Superintendent-President, Long Beach City College Terrence.
Evaluating the California Acceleration Project Equity implications of increasing throughput via curricular redesign Craig Hayward Director of Research,
Evaluating the California Acceleration Project Equity implications of increasing throughput via curricular redesign Craig Hayward Director of Research,
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Analysis of Covariance, ANCOVA (GLM2)
Multiple Approaches to Multiple Measures: Four Three Approaches to Improving Student Achievement Through More Holistic, Evidence-Based Multiple Measures.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Chapter 1: Introduction to Statistics
URBDP 591 A Lecture 8: Experimental and Quasi-Experimental Design Objectives Basic Design Elements Experimental Designs Comparing Experimental Design Example.
Correlational Research Chapter Fifteen Bring Schraw et al.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Intro: “BASIC” STATS CPSY 501 Advanced stats requires successful completion of a first course in psych stats (a grade of C+ or above) as a prerequisite.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
A Randomized Experiment Comparing Random to Nonrandom Assignment William R Shadish University of California, Merced and M.H. Clark Southern Illinois University,
Chapter 3.1.  Observational Study: involves passive data collection (observe, record or measure but don’t interfere)  Experiment: ~Involves active data.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Political Science 30: Political Inquiry. Linear Regression II: Making Sense of Regression Results Interpreting SPSS regression output Coefficients for.
Stat 112 Notes 9 Today: –Multicollinearity (Chapter 4.6) –Multiple regression and causal inference.
Sierra College ERIK COOPER. A Brief History  The RP Conference 2013  STEPS Project  Sierra College  Paul Neal & Craig Kelly  Working with the English.
What Use Are International Assessments for States? 30 May 2008 Jack Buckley Deputy Commissioner National Center for Education Statistics Institute of Education.
Background In , City Year Greater Philadelphia (CYGP) provided academic and behavioral supports for students in grades 6-9. Supports include one-on-one.
Evaluating Risk Adjustment Models Andy Bindman MD Department of Medicine, Epidemiology and Biostatistics.
ACT College-Readiness Measures: Explore, Plan and the ACT Hal L. Sanderson, Ph.D. Director of Institutional Research & Assessment Evidence-Based Learning.
Release of Preliminary Value-Added Data Webinar August 13, 2012 Florida Department of Education.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Multiple Measures Project  California Community Colleges are required to use multiple measures Title 5 § 55502(i) clearly mandates that California community.
1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.
Logistic Regression Analysis Gerrit Rooks
Logistic Regression An Introduction. Uses Designed for survival analysis- binary response For predicting a chance, probability, proportion or percentage.
Nonparametric Statistics
RESULTS OF THE 2009 ADMINISTRATION OF THE COMMUNITYCOLLEGE SURVEY OF STUDENT ENGAGEMENT Office of Institutional Effectiveness, September 2009.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Estimating Effects and Inferring Implications 2016 Academic Academy Sacramento, CA Terrence Willett Senior Consulting Researcher RP Group.
Patricia Gonzalez, OSEP June 14, The purpose of annual performance reporting is to demonstrate that IDEA funds are being used to improve or benefit.
BUS 308 Entire Course (Ash Course) For more course tutorials visit BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308.
BUS 308 Entire Course (Ash Course) FOR MORE CLASSES VISIT BUS 308 Week 1 Assignment Problems 1.2, 1.17, 3.3 & 3.22 BUS 308 Week 1.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
STUDENT DIVERSITY AND HOW IT RELATES TO STUDENT SUCCESS Dr. Michael Conyette.
Classroom Network Technology as a Support for Systemic Mathematics Reform: Examining the Effects of Texas Instruments’ MathForward Program on Student Achievement.
Methods of Presenting and Interpreting Information Class 9.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Nonparametric Statistics
Constructing Propensity score weighted and matched Samples Stacey L
Propensity Score Matching Makes Program Evaluation Easy
Nonparametric Statistics
Access Center Assessment Report
An Introduction to Correlational Research
Within-Subjects Experiments
Nazmus Saquib, PhD Head of Research Sulaiman AlRajhi Colleges
Presentation transcript:

Propensity Score Matching A Practical Demonstration Looking at Results from the Promise Pathways Initiative at Long Beach City College Andrew Fuenmayor, Research Analyst John Hetts, Director of Institutional Research Long Beach City College Office of Institutional Effectiveness

The Long Beach College Promise: Extending the promise of a college education to every student in the Long Beach Unified School District.

LBCC’s Promise Pathways: Background First year experience program for students matriculating directly from high school Alternative assessment using multiple measures Prescriptive scheduling emphasizing full-time enrollment and early completion of basic skills courses Priority registration Achievement coaches and other pilot experiments

How to evaluate? How to communicate? As with many interventions or programs in education, students who receive a treatment are non-random self-selection to participate (e.g., honors programs, tutoring) including election not to participate in required interventions selection by risk factor (e.g., summer bridge programs, EOPS programs) selection by performance (e.g., probation workshops) Must reduce potential bias of confounding variables particularly variables potentially correlated with key outcomes. Multiple regression, ANOVA and ANCOVA, Propensity Score Matching are can help isolate the effect of the treatment itself. Each has advantages and disadvantages.

Advantages of Logistic Regression Allows researcher to observe individual relationships between covariates and outcomes. Can more easily test for interactions between treatments and other student characteristics. Allows researcher to set up a hypothetical model of relationships useful in predictive analysis. But…

SPSS Logistic Regression Output: People respond to this …

With This

Advantages of Propensity Score Matching Propensity Score Matching is conceptually easier for a non-technical audience. Regression or Analysis of Covariance forces the audience to conceptualize hypothetical students who might or might not experience an intervention sometimes at locations in variable distributions where there are no students PSM allows comparison of real students to real students. And in Monte Carlo simulations, PSM also tends to: exhibit higher empirical power than regression especially when there are fewer observations per confounding variable perform better when the relationship between the confounding variable and the treatment are higher Additionally Propensity Score Matching has the stamp has been named something of the “technique of choice” by the state - ??? Overall, the propensity score exhibited more empirical power than logistic regression.

Propensity to what? Be in the Treatment Group. Calculates a students likelihood of being in the treatment group. Matching algorithm pairs each student in treatment group with a student not in the treatment group but whose other variables indicate a high likelihood of being in the treatment group. Outputs two groups of equal sizes of students with very similar propensity of being in treatment group

PSM in SPSS (with R plugin)

Start with an Unduplicated Dataset The practical demonstration will go through the process of running a propensity score on our actual promise pathways data. Estimated time 10-15 minutes Overview of dataset. How it needs to be formatted (ie unduplicated at the student level). The kinds of variables we have our file and why they might be useful. The necessary outcome variables PSM guts what exactly spss is doing as detailed in the output. A look at the matched data set. Descriptive examination of the matched data set. How well did the match work? Are there still obvious differences between the two treatment group and the matched group. Especially on variables that correlate with your outcome variables. Options with a matched data set. Simple comparisons between groups now become much more useful. Simple t-tests. Can say “the difference between the treatment groups performance on measure X vs the match group was statistically significant etc.

Propensity Score Matching Wizard The propensity score matching wizard in SPSS. New version uses the built in r-module to run a propensity score matching algorithm. Show in detail each of the features of the wizard explain how they works. Group indicator, Predictors, Ps variable name, Match tolerance, Case id, Match id variable, Output dataset name, Overview of options

Propensity Score Matching Wizard

Propensity Score Matching Wizard The propensity score matching wizard in SPSS. New version uses the built in r-module to run a propensity score matching algorithm. Show in detail each of the features of the wizard explain how they works. Group indicator, Predictors, Ps variable name, Match tolerance, Case id, Match id variable, Output dataset name, Overview of options

Propensity Score Matching Wizard

Execution of PSM: Behind the Scenes

Execution of PSM: Behind the Scenes

How Well Did it Work?

How Well Did it Work?

Outcome Metrics Transfer Level English and Math Completion Completion of basic skills is an important indicator of long term likelihood of transfer Intent to Complete and Behavioral Intent to Transfer Key metrics used by chancellors office Complete 30 Units and Attempt 48 Units Indicators of mid term student successfulness. Students are staying with the institution and completing units.

Logistic Regression Output – Transfer Math Success

Logistic Regression Output – Transfer English Success

Converting Logged Odds Ratios into Effect Sizes Effect size is dependent value of other covariates . Because Logistic Regression uses the logit link function a given logged odds ratio corresponds to a changing relative percentage likelihood. We must arbitrarily construct one or many hypothetical students and calculate the effect size for each using the link function:

Converting Logged Odds Ratios into Effect Sizes For a student with average high school grades and CST scores: English: B of 1.372  33 Percentage Points Math: B of 0.824  16 Percentage Points How do these figures compare to PSM?

Completion of Transfer-Level Math and English

Predictive Early Momentum Points

Statistical Significance Can be Tested with a Simple T-Test Outcome T-Stat Significance Transfer Math Success 3.693 .000 Transfer English Success 11.628 Achieve Intent to Complete 2.553 .011 Achieve Behavioral Intent to Transfer 8.352

Response from Faculty and Administration

Ease of Presentation Telling this story is much simpler than telling the same story using ANOVA or Regression. The figures in these tables are real differences as opposed to estimated relationships. While the estimated relationship has significant meaning to the researcher explaining it often bogs down the discussion.

Contact information Andrew Fuenmayor: afuenmayor@lbcc.edu John Hetts: jhetts@edresults.org