Partially Missing At Random and Ignorable Inferences for Parameter Subsets with Missing Data Roderick Little Rennes 20151.

Slides:



Advertisements
Similar presentations
Sampling Research Questions
Advertisements

A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
EMNLP, June 2001Ted Pedersen - EM Panel1 A Gentle Introduction to the EM Algorithm Ted Pedersen Department of Computer Science University of Minnesota.
Treatment of missing values
Mixture Models and the EM Algorithm
1 Multiple Frame Surveys Tracy Xu Kim Williamson Department of Statistical Science Southern Methodist University.
Cox Model With Intermitten and Error-Prone Covariate Observation Yury Gubman PhD thesis in Statistics Supervisors: Prof. David Zucker, Prof. Orly Manor.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Goodness of Fit of a Joint Model for Event Time and Nonignorable Missing Longitudinal Quality of Life Data – A Study by Sneh Gulati* *with Jean-Francois.
How to Handle Missing Values in Multivariate Data By Jeff McNeal & Marlen Roberts 1.

1 Unsupervised Learning With Non-ignorable Missing Data Machine Learning Group Talk University of Toronto Monday Oct 4, 2004 Ben Marlin Sam Roweis Rich.
How to deal with missing data: INTRODUCTION
Modeling Achievement Trajectories When Attrition is Informative Betsy J. Feldman & Sophia Rabe- Hesketh.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
Increasing Survey Statistics Precision Using Split Questionnaire Design: An Application of Small Area Estimation 1.
Separate multivariate observations
Target population-> Study Population-> Sample 1WWW.HIVHUB.IR Target Population: All homeless in country X Study Population: All homeless in capital shelters.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
1 Multiple Imputation : Handling Interactions Michael Spratt.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Performance of Resampling Variance Estimation Techniques with Imputed Survey data.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
Chapter 7 The Logic Of Sampling. Observation and Sampling Polls and other forms of social research rest on observations. The task of researchers is.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Imputation for Multi Care Data Naren Meadem. Introduction What is certain in life? –Death –Taxes What is certain in research? –Measurement error –Missing.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Empirical Efficiency Maximization: Locally Efficient Covariate Adjustment in Randomized Experiments Daniel B. Rubin Joint work with Mark J. van der Laan.
GEE Approach Presented by Jianghu Dong Instructor: Professor Keumhee Chough (K.C.) Carrière.
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
A REVIEW By Chi-Ming Kam Surajit Ray April 23, 2001 April 23, 2001.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Assessing Estimability of Latent Class Models Using a Bayesian Estimation Approach Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University Departments.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
Inference for the mean vector. Univariate Inference Let x 1, x 2, …, x n denote a sample of n from the normal distribution with mean  and variance 
A shared random effects transition model for longitudinal count data with informative missingness Jinhui Li Joint work with Yingnian Wu, Xiaowei Yang.
Linear Correlation (12.5) In the regression analysis that we have considered so far, we assume that x is a controlled independent variable and Y is an.
Tutorial I: Missing Value Analysis
Armando Teixeira-Pinto AcademyHealth, Orlando ‘07 Analysis of Non-commensurate Outcomes.
INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.
Pre-Processing & Item Analysis DeShon Pre-Processing Method of Pre-processing depends on the type of measurement instrument used Method of Pre-processing.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Lynette.
Computacion Inteligente Least-Square Methods for System Identification.
Synthetic Approaches to Data Linkage Mark Elliot, University of Manchester Jerry Reiter Duke University Cathie Marsh Centre.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
1 Borgan and Henderson: Event History Methodology Lancaster, September 2006 Session 8.1: Cohort sampling for the Cox model.
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Missing data: Why you should care about it and what to do about it
MISSING DATA AND DROPOUT
Inference for the mean vector
The Centre for Longitudinal Studies Missing Data Strategy
Maximum Likelihood & Missing data
Multiple Imputation.
Multiple Imputation Using Stata
How to handle missing data values
Making Statistical Inferences
The European Statistical Training Programme (ESTP)
Ch11 Curve Fitting II.
Product moment correlation
Missing Data Mechanisms
Chapter 13: Item nonresponse
Presentation transcript:

Partially Missing At Random and Ignorable Inferences for Parameter Subsets with Missing Data Roderick Little Rennes 20151

Outline Inference with missing data: Rubin's (1976) paper on conditions for ignoring the missing-data mechanism Rubin’s standard conditions are sufficient but not necessary: example Propose definitions of partially MAR, ignorability for likelihood (and Bayes) inference for subsets of parameters (Little and Zanganeh, 2013) Application: Subsample ignorable methods for regression with missing covariates (Little and Zhang, 2011) Joint work with Nanhua Zhang, Sahar Zanganeh Rennes 20152

v 3

Rubin (1976 Biometrika) Landmark paper (5000+ citations, after being rejected by many journals!) –I wrote my first referee’s report (11 pages!), and an obscure discussionon ancillarity Modeled the missing data mechanism by treating missingness indicators as random variables, assigning them a distribution Sufficient conditions under which missing data mechanism can be ignored for likelihood and frequentist inference about parameters –Focus here on likelihood, Bayes Rennes 20154

Ignoring the mechanism Full likelihood: Likelihood ignoring mechanism: Missing data mechanism can be ignored for likelihood inference when Rennes 20155

Rubin’s sufficient conditions for ignoring the mechanism Missing data mechanism can be ignored for likelihood inference when –(a) the missing data are missing at random (MAR): –(b) distinctness of the parameters of the data model and the missing-data mechanism: MAR is the key condition: without (b), inferences are valid but not fully efficient Rennes 20156

More on Rubin (1976) Seaman et al. (2013) propose a more complex but precise notation Distinguish between “direct” likelihood inference and “frequentist likelihood inference –“Realized MAR” sufficient for direct likelihood inference – R depends only on realized observed data –“Everywhere MAR” sufficient for frequentist likelihood inference: MAR condition needs to hold for observed values in future repeated sampling –Rubin (1976) uses term “always MAR”. See also Mealli and Rubin (2015, forthcoming) Rennes 20157

“Sufficient for ignorable” is not the same as “ignorable” These definitions have come to define ignorability (e.g. Little and Rubin 2002) However, Rubin (1976) described (a) and (b) as the "weakest simple and general conditions under which it is always appropriate to ignore the process that causes missing data". These conditions are not necessary for ignoring the mechanism in all situations. Rennes 20158

Example 1: Nonresponse with auxiliary data ???? ???? Not linked Or whole population N Rennes 20159

MAR, ignorability for parameter subsets MAR and ignorability are defined in terms of the complete set of parameters in the data model for D It would be useful to have a definition of MAR that applies to subsets of parameters of substantive interest. Example: inference for regression parameters might be “partially MAR” when parameters for a model for full data are not. Rennes

MAR, ignorability for parameter subsets Rennes

MAR, ignorability for parameter subsets Rennes

Partial MAR given a function of mechanism Rennes

Example 1: Auxiliary Survey Data ???? ???? Not linked Rennes

Ex. 2: MNAR Monotone Bivariate Data ???? Rennes

More generally… Rennes

Application: missing data in covariates ? Target: regression of Y on X, Z; missing data on X BUT: if Pr(X missing)= g(Z, X) CC analysis is consistent, but IL methods (or weighted CC) are inconsistent since mechanism is not MAR Simulations favoring IL often generate data under MAR, hence are biased against CC IL methods include information for the regression in the incomplete cases (particularly intercept and coefficients of Z) and are valid assuming MAR: Pr(X missing)= g(Z, Y) Rennes

PatternObservation, i P1i = 1,…,m√√√ P2i = m +1,…,n√?√ More general missing data in X Key: √ denotes observed, ? denotes observed or missing Could be vector P1 P2 Rennes

Ignorable Likelihood methods Rennes

CC analysis MNAR mechanism: Missingness can depend on missing values of X Rennes Follows from (*)

PatternObservation, i P1i = 1,…,m√√? P2i = m +1,…,n√?? Extension: missing data on X and Y Key: √ denotes observed, ? denotes observed or missing Could be vector P1: covariates complete P2: x incomplete Rennes

SSIL analysis, X and Y missing Rennes

SSIL likelihood, X and Y missing Rennes

PatternObservation, i P1i = 1,…,m√√√? P2i = m +1,…,m+r√√?? P3i = m +r+1,…,n√??? Two covariates X, W with different mechanisms SSIL: analyze cases in patterns 1 and 2 P1: covariates complete P2: x obs, w, y may be mis P3: x mis, w, y may be mis Rennes

Subsample Ignorable Likelihood (SSIL) Target: regression of Y on Z, X, and W Assume: By similar proof to previous case, data are SSIL applies IL method (e.g. ML) to the subsample of cases for which X is observed, but W or Y may be incomplete Rennes

Simulation Study For each of 1000 replications, 5000 observations Z, W, X and Y generated as: 20-35% of missing values of W and X generated by four mechanisms Rennes

Simulation: missing data mechanisms Mechanisms I: All valid II: CC valid III: IML valid IV: SSIML valid Rennes

RMSEs*1000 of Estimated Regression Coefficients for Before Deletion (BD), Complete Cases (CC), Ignorable Maximum Likelihood (IML) and Subsample Ignorable Maximum Likelihood (SSIML), under Four Missing Data Mechanisms. I*IIIIIIVIIIIIIIV BD CC IML SSIML Valid: ALL CC IML SSIML ALL CC IML SSIML Rennes

Missing Covariates in Survival Analysis Rennes

How to choose X, W Choice requires understanding of the mechanism: Variables that are missing based on their underlying values belong in W Variables that are MAR belong in X Collecting data about why variables are missing is obviously useful to get the model right But this applies to all missing data adjustments… Rennes

Other questions and points –How much is lost from SSIL relative to full likelihood model of data and missing data mechanism? In some special cases, SSIL is efficient for a pattern-mixture model In other cases, trade-off between additional specification of mechanism and loss of efficiency from conditional likelihood –MAR analysis applied to the subset does not have to be likelihood-based E.g. weighted GEE, AIPWEE –Pattern-mixture models (Little, 1993) can also avoid modeling the mechanism Rennes

Conclusions Defined partial MAR for a subset of parameters Application to regression with missing covariates: sometimes discarding data is useful! Subsample ignorable likelihood: apply likelihood method to data, selectively discarding cases based on assumed missing- data mechanism –More efficient than CC –Valid for P-MAR mechanisms where IL, CC are inconsistent Rennes

References Harel, O. and Schafer, J.L. (2009). Partial and Latent Ignorability in missing data problems. Biometrika, 2009, 1-14 Little, R.J.A. (1993). Pattern ‑ Mixture Models for Multivariate Incomplete Data. JASA, 88, Little, R. J. A., and Rubin, D. B. (2002). Statistical Analysis with Missing Data (2 nd ed.) Wiley. Little, R.J. and Zangeneh, S.Z. (2013). Missing at random and ignorability for inferences about subsets of parameters with missing data. University of Michigan Biostatistics Working Paper Series. Little, R. J. and Zhang, N. (2011). Subsample ignorable likelihood for regression analysis with missing data. JRSSC, 60, 4, 591–605. Rubin, D. B. (1976). Inference and Missing Data. Biometrika 63, Seaman, S., Galati, J., Jackson, D. and Carlin, J. (2013). What Is Meant by “Missing at Random”? Statist. Sci. 28, 2, Zhang, N. & Little, R.J. (2014). Lifetime Data Analysis, published online Aug doi:10.​1007/​s x. Rennes