Estimating a causal effect using observational data Aad van der Vaart Afdeling Wiskunde, Vrije Universiteit Amsterdam Joint with Jamie Robins, Judith Lok,

Slides:



Advertisements
Similar presentations
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Brief introduction on Logistic Regression
Chapter 12 Simple Linear Regression
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
Uncertainty in Engineering The presence of uncertainty in engineering is unavoidable. Incomplete or insufficient data Design must rely on predictions or.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Nguyen Ngoc Anh Nguyen Ha Trang
10 Further Time Series OLS Issues Chapter 10 covered OLS properties for finite (small) sample time series data -If our Chapter 10 assumptions fail, we.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Topic 6: Introduction to Hypothesis Testing
The Multiple Regression Model Prepared by Vera Tabakova, East Carolina University.
Multiple Linear Regression Model
Maximum likelihood Conditional distribution and likelihood Maximum likelihood estimations Information in the data and likelihood Observed and Fisher’s.
Simultaneous Equations Models
Copyright © 2006 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide Are the Means of Several Groups Equal? Ho:Ha: Consider the following.
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Using Regression Models to Analyze Randomized Trials: Asymptotically Valid Tests Despite Incorrect Regression Models Michael Rosenblum, UCSF TAPS Fellow.
Maximum likelihood (ML)
Ordinary Least Squares
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Regression Method.
Chapter 13 Notes Observational Studies and Experimental Design
1 Least squares procedure Inference for least squares lines Simple Linear Regression.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
AP Statistics Chapter 9 Notes.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 15 Multiple Regression n Multiple Regression Model n Least Squares Method n Multiple.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
From Theory to Practice: Inference about a Population Mean, Two Sample T Tests, Inference about a Population Proportion Chapters etc.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
HSRP 734: Advanced Statistical Methods July 17, 2008.
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
Pro gradu –thesis Tuija Hevonkorpi.  Basic of survival analysis  Weibull model  Frailty models  Accelerated failure time model  Case study.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
1 B IVARIATE AND MULTIPLE REGRESSION Estratto dal Cap. 8 di: “Statistics for Marketing and Consumer Research”, M. Mazzocchi, ed. SAGE, LEZIONI IN.
28. Multiple regression The Practice of Statistics in the Life Sciences Second Edition.
AP Statistics Semester One Review Part 2 Chapters 4-6 Semester One Review Part 2 Chapters 4-6.
Multiple Logistic Regression STAT E-150 Statistical Methods.
Measures of Association: Pairwise Correlation
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
8-3: Probability and Probability Distributions English Casbarro Unit 8.
T tests comparing two means t tests comparing two means.
Chapter Eight: Quantitative Methods
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Statistical NLP: Lecture 4 Mathematical Foundations I: Probability Theory (Ch2)
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Multiple Regression Chapter 14.
CJT 765: Structural Equation Modeling Final Lecture: Multiple-Group Models, a Word about Latent Growth Models, Pitfalls, Critique and Future Directions.
Ten things about Experimental Design AP Statistics, Second Semester Review.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Estimating standard error using bootstrap
Bayesian Semi-Parametric Multiple Shrinkage
Basic Estimation Techniques
Chapter 8: Inference for Proportions
Simple Linear Regression - Introduction
The Practice of Statistics in the Life Sciences Fourth Edition
Basic Estimation Techniques
Lecture 15 Sections 7.3 – 7.5 Objectives:
Econometric Models The most basic econometric model consists of a relationship between two variables which is disturbed by a random error. We need to use.
Discrete Event Simulation - 4
Ten things about Experimental Design
Statistical NLP: Lecture 4
Sampling Distributions
Indicator Variables Response: Highway MPG
Testing Causal Hypotheses
Presentation transcript:

Estimating a causal effect using observational data Aad van der Vaart Afdeling Wiskunde, Vrije Universiteit Amsterdam Joint with Jamie Robins, Judith Lok, Richard Gill

CAUSALITY Operational Definition: If individuals are randomly assigned to a treatment and control group, and the groups differ significantly after treatment, then the treatment causes the difference We want to apply this definition with observational data

Counter factuals treatment indicator A  {0,1} outcome Y Given observations (A, Y) for a sample of individuals, mean treatment effect might be defined as E( Y | A=1 ) – E( Y | A=0 ) However, if treatment is not randomly assigned this is NOT what we want to know

Counter factuals (2) treatment indicator A  {0,1} outcome Y outcome Y 1 if individual had been treated outcome Y 0 if individual had not been treated mean treatment effect E Y 1 – E Y 0 Unfortunately, we observe only one of Y 1 and Y 0, namely: Y= Y A

Counter factuals (3) ASSUMPTION: there exists a measured covariate Z with A  (Y 0, Y 1 ) given Z  means “are statistically independent” Under ASSUMPTION: E Y 1 – E Y 0 =  {E (Y | A=1, Z=z) - E (Y | A=1, Z=z) } dP Z (z) CONSEQUENCE: under ASSUMPTION the mean treatment effect is estimable from the observed data (Y,Z,A) ASSUMPTION is more likely to hold if Z is “bigger”

Longitudinal Data times:       treatments: a = (a 0, a 1,..., a K ) observed treatments: A = (A 0, A 1,..., A K ) counterfactual outcomes: Y a observed outcome: Y A We are interested in E Y a for certain a

Longitudinal Data (2) times:       treatments: a = (a 0, a 1,..., a K ) observed treatments: A = (A 0, A 1,..., A K ) ASSUMPTION: Y a  A k given ( Z k, A k-1 ), for all k Under ASSUMPTION E Y a can be expressed in the distribution of the observed data (Y, Z, A ) “It is the task of an epidemiologist to collect enough information so that ASSUMPTION is satisfied” observed covariates: Z = (Z 0, Z 1,..., Z K )

Estimation and Testing Under ASSUMPTION it is possible, in principle to test whether treatment has effect to estimate the mean counterfactual treatment effects A standard statistical approach would be to model and estimate all unknowns. However there are too many. We look for a “semiparametric approach” instead.

Shift function The quantile-distribution shift function is the (only monotone) function that transforms a variable “distributionally” into another variable. It is convenient to model a change in distribution.

Structural Nested Models shift map corresponding to these distributions, transforms into IDEA: model by a parameter and estimate it treatment until time k outcome of this treatment

Structural Nested Models (2) treatment until time k outcome of this treatment transforms into positive effect no effect negative effect no effect negative effect time  k-1  k

Estimation Make regression model for Make model for  Add as explanatory variable Estimate  by the value such that does NOT add explanatory value. Under ASSUMPTION: is distributed as 

Estimation (2) Example: if treatment A is binary, then we might use a logistic regression model We estimate (  by standard software for given  The “true”  is the one such that the estimated  is zero. We can also test whether treatment has an effect at all by testing H 0 :  =0 in this model with Y instead of Y  

End Lok, Gill, van der Vaart, Robins, 2004, Estimating the causal effect of a time-varying treatment on time-to-event using structural nested failure time models Lok, 2001 Statistical modelling of causal effects in time Proefschrift, Vrije Universiteit