1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.

Slides:



Advertisements
Similar presentations
Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Sep. 16, 2005 Lilly Yue, Ph.D.* CDRH, FDA, Rockville MD * No official support.
Advertisements

The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
Statistical vs. Practical Significance
1-Way Analysis of Variance
A workshop introducing doubly robust estimation of treatment effects
Introduction to Propensity Score Weighting Weimiao Fan 10/10/
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Analysis of variance (ANOVA)-the General Linear Model (GLM)
3.2 OLS Fitted Values and Residuals -after obtaining OLS estimates, we can then obtain fitted or predicted values for y: -given our actual and predicted.
Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.
Method Reading Group September 22, 2008 Matching.
More on ANOVA. Overview ANOVA as Regression Comparison Methods.
Understanding Variables Emily H. Wughalter, Ed.D. Professor, Department of Kinesiology Spring 2010.
Clustered or Multilevel Data
Review: The Logic Underlying ANOVA The possible pair-wise comparisons: X 11 X 12. X 1n X 21 X 22. X 2n Sample 1Sample 2 means: X 31 X 32. X 3n Sample 3.
What Is Multivariate Analysis of Variance (MANOVA)?
Analysis of Variance & Multivariate Analysis of Variance
1 A MONTE CARLO EXPERIMENT In the previous slideshow, we saw that the error term is responsible for the variations of b 2 around its fixed component 
Chapter 14 Inferential Data Analysis
1 Advances in Statistics Or, what you might find if you picked up a current issue of a Biological Journal.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Advanced Statistics for Interventional Cardiologists.
Propensity Score Matching
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
Repeated Measures Designs. In a Repeated Measures Design We have experimental units that may be grouped according to one or several factors (the grouping.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Propensity Score Matching and Variations on the Balancing Test Wang-Sheng Lee Melbourne Institute of Applied Economic and Social Research The University.
1 Multivariate Analysis (Source: W.G Zikmund, B.J Babin, J.C Carr and M. Griffin, Business Research Methods, 8th Edition, U.S, South-Western Cengage Learning,
Statistical Power 1. First: Effect Size The size of the distance between two means in standardized units (not inferential). A measure of the impact of.
Educational Research: Competencies for Analysis and Application, 9 th edition. Gay, Mills, & Airasian © 2009 Pearson Education, Inc. All rights reserved.
Psychology 301 Chapters & Differences Between Two Means Introduction to Analysis of Variance Multiple Comparisons.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.
Assumptions of value-added models for estimating school effects sean f reardon stephen w raudenbush april, 2008.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Discriminant Analysis Discriminant analysis is a technique for analyzing data when the criterion or dependent variable is categorical and the predictor.
A Randomized Experiment Comparing Random to Nonrandom Assignment William R Shadish University of California, Merced and M.H. Clark Southern Illinois University,
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
Analysis of Covariance Combines linear regression and ANOVA Can be used to compare g treatments, after controlling for quantitative factor believed to.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
General Linear Model.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Considering model structure of covariates to estimate propensity scores Qiu Wang.
Todd Wagner, PhD February 2011 Propensity Scores.
Rerandomization to Improve Covariate Balance in Randomized Experiments Kari Lock Harvard Statistics Advisor: Don Rubin 4/28/11.
Ten things about Experimental Design AP Statistics, Second Semester Review.
Propensity Score Matching in SPSS: How to turn an Audit into a RCT
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Chapter 9 Sampling Distributions 9.1 Sampling Distributions.
CWR 6536 Stochastic Subsurface Hydrology Optimal Estimation of Hydrologic Parameters.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Analysis for Designs with Assignment of Both Clusters and Individuals
Multiple Imputation using SOLAS for Missing Data Analysis
Analysis of Covariance (ANCOVA)
Applied Statistical Analysis
Combining Effect Sizes
کارگاه حجم نمونه با نرم افزار G*Power
Treatment effect: Part 2
12 Inferential Analysis.
Chapter 9: Differences among Groups
The European Statistical Training Programme (ESTP)
Comparisons Among Treatments/Groups
Chapter: 9: Propensity scores
Factor Analysis.
Presentation transcript:

1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010

2 The propensity score (1) The propensity score is “…the conditional probability of assignment to a particular treatment given a vector of observed covariates.” (Rosenbaum en Rubin, 1983: 41). Used in non-randomized studies to control for selection bias Balance observed pretreatment variables among patient Find an estimate of the average treatment effect But, treatment effect can be different within subgroups

3 The propensity score (2) Univariate propensity scoreMultivariate propensity score Bartak and colleagues (2009)Spreeuwenberg and colleagues (2010) Used for 2 treatment categoriesUsed for > 2 treatment categories Propensity score used in: Matching Stratification Regression Inverse probability weight Combinations of …

Methods in this study To find a treatment effect within subgroups, if the propensity score is applied: Method 1: Regression analysis with propensity score, subgroups and interaction with treatment assignment; Method 2: Weighted regression analysis with inverse of the propensity score (to weight observations), subgroups and interaction with treatment assignment; Method 3: Propensity score applied for groups defined on treatment assignment and subgroups; then, regression analysis with propensity score and dummies for groups Two treatment categories and two subgroups are used in this study 4

Variable selection for propensity score Does the variable for subgroups has to be included? Discussion about variable selection for propensity score; Only variables related to outcome? Only variables related to treatment assignment? Both variables…? In this study; 8 different propensity scores (PS) formulated, based on: Variables related to outcome, with and without subgroup Variables related to treatment assigment, with and without subgroup Both variables…, with and without subgroup Only variables related to both outcome and treatment assignment, with and without subgroup 5

How to test? (1) Real dataset not useful because effects unknown beforehand;  You cannot test whether the effect found is accurate Monte Carlo simulation study to test methods and different propensity scores: Simulate data with known treatment effects Estimate different propensity scores for this data Apply different methods for different propensity scores, for this data  Repeat this process 1000 times 6

How to test? (2) What do you want to know? If the treatment effect estimated is (almost) equal to the treatment effect you used to simulate the data Bias of estimator: difference between estimated treatment effect and the true value of parameter Want to have an unbiased estimate; Less bias indicates a more accurate estimate of the treatment effect Bias is estimated for overall treatment effect and for the treatment effect within subgroups 7

Results 8 N=250;ρ=0N=250;ρ=0.3N=250; ρ=0.7N=500;ρ=0N=500;ρ=0.3N=500; ρ=0.7N=1000;ρ=0N=1000;ρ=0.3N=1000; ρ=0.7 RowMethodBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSE Regression 1PS1, func PS1, func PS1, func PS1, subgr PS2, func PS2, func PS2, func PS2, subgr PS3, func PS3, func PS3, func PS3, subgr PS4, func PS4, func PS4, func PS4, subgr PS5, func PS5, func PS5, func PS5, subgr PS6, func PS6, func PS6, func PS6, subgr PS7, func PS7, func PS7, func PS7, subgr PS8, func PS8, func PS8, func PS8, subgr N=250;ρ=0N=250;ρ=0.3N=250; ρ=0.7N=500;ρ=0N=500;ρ=0.3N=500; ρ=0.7N=1000;ρ=0N=1000;ρ=0.3N=1000; ρ=0.7 RowMethodBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSE Regression 1PS1, func PS1, func PS1, func PS1, subgr PS2, func PS2, func PS2, func PS2, subgr PS3, func PS3, func PS3, func PS3, subgr PS4, func PS4, func PS4, func PS4, subgr PS5, func PS5, func PS5, func PS5, subgr PS6, func PS6, func PS6, func PS6, subgr PS7, func PS7, func PS7, func PS7, subgr PS8, func PS8, func PS8, func PS8, subgr N=250;ρ=0N=250;ρ=0.3N=250; ρ=0.7N=500;ρ=0N=500;ρ=0.3N=500; ρ=0.7N=1000;ρ=0N=1000;ρ=0.3N=1000; ρ=0.7 RowMethodBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSEBiasMSE Inverse PS 1PS1, func PS1, func PS1, func PS1, subgr PS2, func PS2, func PS2, func PS2, subgr PS3, func PS3, func PS3, func PS3, subgr PS4, func PS4, func PS4, func PS4, subgr PS5, func PS5, func PS5, func PS5, subgr PS6, func PS6, func PS6, func PS6, subgr PS7, func PS7, func PS7, func PS7, subgr PS8, func PS8, func PS8, func PS8, subgr

Results (1) Which propensity score is the most accurate within each method tested (tested with ANOVA): But, some values for bias per propensity score where not very different from each other… 9 General treatment effectTreatment effect within subgroups Method 1PS with variables related to outcome Method 2PS with variables related to outcome and variable for subgroups PS with variables only related to outcome and treatment assignment and variables for subgroups Method 3PS with variables related to outcome NA

Results (2) Which method is most accurate when the most accurate propensity scores are compared? Decide on partial effect size of method in ANOVA* For general treatment effect, the partial effect size is 0,028, where method 1 gives the lowest bias (followed by method 3) For treatment effect within subgroups, the partial effect size is 0,051, where method 1 gives the lowest bias too Although the effect sizes for method are not very large, regression analysis with treatment assignment, subgroup, interaction between these and the propensity score, which is estimated with variables related to outcome, seems to be the most accurate method to find treatment effects within subgroups *Effect size – 0,010 = small; 0,059 = medium; 0,138 = large (Cohen, 1988) 10

Discussion (1) Data simulation is done for different settings: Sample size, correlation between covariates and correlation with covariate for subgroups are changed over simulations Results for most accurate propensity score are based on sum of bias over all these settings; comparisons between methods for all propensity scores could give more in depth results The overall bias for different propensity scores was sometimes not very different Model for simulation of data was simple, linear; the relation between variables and outcome in practice can be more complicated …. 11

Discussion (2) Questions? 12