1 Meta-analysis with missing data: metamiss Ian White and Julian Higgins MRC Biostatistics Unit, Cambridge, UK Stata users’ group, London 10 September.

Slides:



Advertisements
Similar presentations
Ian White MRC Biostatistics Unit, Cambridge
Advertisements

Notes Sample vs distribution “m” vs “µ” and “s” vs “σ” Bias/Variance Bias: Measures how much the learnt model is wrong disregarding noise Variance: Measures.
Meta-analysis: summarising data for two arm trials and other simple outcome studies Steff Lewis statistician.
Comments on Hierarchical models, and the need for Bayes Peter Green, University of Bristol, UK IWSM, Chania, July 2002.
Bayesian Estimation in MARK
BAYESIAN POSTERIOR DISTRIBUTIONS FOR PROBABILISTIC SENSITIVITY ANALYSIS Gordon B. Hazen and Min Huang, IEMS Department, Northwestern University, Evanston.

Error Propagation. Uncertainty Uncertainty reflects the knowledge that a measured value is related to the mean. Probable error is the range from the mean.
Structural uncertainty from an economists’ perspective
1 Unsupervised Learning With Non-ignorable Missing Data Machine Learning Group Talk University of Toronto Monday Oct 4, 2004 Ben Marlin Sam Roweis Rich.
Results 2 (cont’d) c) Long term observational data on the duration of effective response Observational data on n=50 has EVSI = £867 d) Collect data on.
Sample Midterm question. Sue want to build a model to predict movie ratings. She has a matrix of data, where for M movies and U users she has collected.
Approximate Bayesian Methods in Genetic Data Analysis Mark A. Beaumont, University of Reading,
1 Transforming the efficiency of Partial EVSI computation Alan Brennan Health Economics and Decision Science (HEDS) Samer Kharroubi Centre for Bayesian.
Missing Data.. What do we mean by missing data? Missing observations which were intended to be collected but: –Never collected –Lost accidently –Wrongly.
CS Bayesian Learning1 Bayesian Learning. CS Bayesian Learning2 States, causes, hypotheses. Observations, effect, data. We need to reconcile.
1 ADDRESSING BETWEEN-STUDY HETEROGENEITY AND INCONSISTENCY IN MIXED TREATMENT COMPARISONS Application to stroke prevention treatments for Atrial Fibrillation.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
Meta-Analysis and Meta- Regression Airport Noise and Home Values J.P. Nelson (2004). “Meta-Analysis of Airport Noise and Hedonic Property Values: Problems.
Standard Error of the Mean
Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.
Are the results valid? Was the validity of the included studies appraised?
Lecture 2 Basic Bayes and two stage normal normal model…
ANOVA Greg C Elvers.
ECE 8443 – Pattern Recognition LECTURE 06: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Bias in ML Estimates Bayesian Estimation Example Resources:
Applications of Bayesian sensitivity and uncertainty analysis to the statistical analysis of computer simulators for carbon dynamics Marc Kennedy Clive.
Guide to Handling Missing Information Contacting researchers Algebraic recalculations, conversions and approximations Imputation method (substituting missing.
9 Mar 2007 EMBnet Course – Introduction to Statistics for Biologists Nonparametric tests, Bootstrapping
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Estimating Incremental Cost- Effectiveness Ratios from Cluster Randomized Intervention Trials M. Ashraf Chaudhary & M. Shoukri.
Properties of OLS How Reliable is OLS?. Learning Objectives 1.Review of the idea that the OLS estimator is a random variable 2.How do we judge the quality.
Therapeutic Equivalence & Active Control Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
Patricia Guyot1,2, Nicky J Welton1, AE Ades1
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition LECTURE 07: BAYESIAN ESTIMATION (Cont.) Objectives:
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
A REVIEW By Chi-Ming Kam Surajit Ray April 23, 2001 April 23, 2001.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Markov Chain Monte Carlo for LDA C. Andrieu, N. D. Freitas, and A. Doucet, An Introduction to MCMC for Machine Learning, R. M. Neal, Probabilistic.
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
MPS/MSc in StatisticsAdaptive & Bayesian - Lect 51 Lecture 5 Adaptive designs 5.1Introduction 5.2Fisher’s combination method 5.3The inverse normal method.
Analysis of Experimental Data; Introduction
Designing Factorial Experiments with Binary Response Tel-Aviv University Faculty of Exact Sciences Department of Statistics and Operations Research Hovav.
Tutorial I: Missing Value Analysis
Univariate Gaussian Case (Cont.)
1 Probability and Statistics Confidence Intervals.
Systematic reviews and meta-analyses: when and how to do them Andrew Smith Royal Lancaster Infirmary 18 May 2015.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
Scheduling with Uncertain Resources Eugene Fink, Jaime G. Carbonell, Ulas Bardak, Alex Carpentier, Steven Gardiner, Andrew Faulring, Blaze Iliev, P. Matthew.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Density Estimation in R Ha Le and Nikolaos Sarafianos COSC 7362 – Advanced Machine Learning Professor: Dr. Christoph F. Eick 1.
Institute of Statistics and Decision Sciences In Defense of a Dissertation Submitted for the Degree of Doctor of Philosophy 26 July 2005 Regression Model.
Markov Chain Monte Carlo in R
H676 Week 3 – Effect Sizes Additional week for coding?
Multiple Imputation using SOLAS for Missing Data Analysis
Bayesian data analysis
MISSING DATA AND DROPOUT
The Centre for Longitudinal Studies Missing Data Strategy
Multiple Imputation.
How to handle missing data values
The Binomial and Geometric Distributions
Multidimensional Integration Part I
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Example Human males have one X-chromosome and one Y-chromosome,
Linear Hierarchical Modelling
Learning From Observed Data
Summary risk ratios (RRs) as estimated when the missing data are ignored (available case), by naïve imputation methods (best-case and worst-case) and by.
Clinical prediction models
Presentation transcript:

1 Meta-analysis with missing data: metamiss Ian White and Julian Higgins MRC Biostatistics Unit, Cambridge, UK Stata users’ group, London 10 September 2007

2 Motivation Missing outcome data compromise trials So they also compromise meta-analyses We may want to –correct for bias due to missing data –down-weight trials with more missing data NB missing data within trials, not missing trials

3 Plan Meta-analysis of binary data Haloperidol example Standard approaches to missing data Imputation methods IMORs Methods that allow for uncertainty Demonstration

4 Haloperidol meta-analysis HaloperidolPlacebo % missing r1f1m1n1r2f2m2n2 Arvanitis % Beasley % Bechelli % Borison % Chouinard % Durost % Garry % Howard % Marder % Nishikawa % Nishikawa % Reschke % Selman % Serafetinides % Simpson % Spencer % Vichaiya % r=successes f=failures m=missing n=total

5 Standard approaches to missing data Available cases (complete cases): ignore the missing data –assumes MAR: missingness is independent of outcome given arm Assume missing=failure –implausible, but not too bad for health-related behaviours Neither assumption is likely to be correct

6 Other ideas Sensitivity analyses, e.g. do both missing=failure and available cases –but these could agree by chance Explore best / worst cases Use reasons for missingness Explicit assumptions about informative missingness ( IM) –IM: missingness is dependent on outcome

7 metamiss.ado Processes data on successes, failures and missing by arm & feeds results to metan Available cases analysis (ACA) Imputed case analyses (ICA): –impute as failure: ICA-0 –impute as success: ICA-1 –best-case: ICA-b (missing=success in E, failure in C) –worst-case: ICA-w –impute with same probability as in control arm: ICA-pC –impute with same probability as in experimental arm: ICA-pE –impute with same probability as in own arm: ICA-p (agrees with ACA) –impute using IMORs: ICA-IMOR (see next slide)

8 More general imputation: IMORs Measure Informative Missingness using the Informative Missing Odds Ratio (IMOR): –Odds ratio between outcome and missingness Can’t estimate IMOR from the data, but given any value of IMOR, we can analyse the data Generalises other ideas: e.g. –ICA-0 uses IMORs 0, 0 –ICA-1 uses IMORs ,  –ICA-b uses IMORs , 0 –ICA-p uses IMORs 1, 1 –ICA-pC uses IMORs OR, 1 where OR is odds ratio between arm and outcome in available cases

9 Getting standard errors (weighting) right Weight 1: treat imputed data as real Weight 2: use standard errors from ACA Weight 3: scale imputed data to same sample size as available cases Weight 4: algebraic standard errors –same as weight 1 for ICA-0, ICA-1, ICA-b, ICA-w –same as weight 2 for ICA-p –uses Taylor expansion for ICA-IMOR –for ICA-pC & ICA-pE, we condition on the IMOR (I can explain…)

10

11

12

13

14

15

16

17

18

19

20

21 Allowing for reasons (ICA-R) Specify number of missing individuals in each arm to be imputed by each scheme ICA-0, ICA-1, ICA-pC, ICA-pE, ICA-p, ICA-IMOR. Can take these data from a different outcome: metamiss scales to #missing If missing in a particular study, metamiss imputes using combined studies

22

23 Allowing for uncertainty So far we have pretended we really know the IMORs This is never really correct Now we allow them to be unknown but from a user-specified distribution

24 Bayesian approach allowing for uncertain IMORs (Rubin, 1977)

25 Bayesian analysis Elicit prior for  E,  C or use N(0,1 2 ) or N(0,2 2 ) Get posterior distribution by integrating over the 2- dimensional distribution of  E,  C. metamiss does this fast & accurately by: 1.Standard normal approximation to posterior given  E,  C 2.Integrate using Gauss-Hermite quadrature. Alternatives: –Taylor expansion (inaccurate for large SD of log IMOR) –Full Bayesian Monte Carlo (slow, little gain in accuracy)

26 Understanding priors for log IMOR: implied prior for P(success | missing) when P(success | observed) = 1/2

27

28

29

30 Proposal: 4 sensitivity analyses IMORsOptions (e.g.) Sensitive to:Works via: fixed equal imor(2 2) Imbalance in missingness Point estimates fixed opposite imor(2 1/2) Amount of missing data random equal sdlogimor(2) corr(1) Imbalance in missingness Weightings random uncorrelated sdlogimor(2) corr(0) Amount of missing data

31 Summary Tool for sensitivity analysis Requires thought about plausible missing data mechanisms Would be nice to overlay sensitivity analysis with ACA Further work includes combining uncertainty with reasons I also have a program mvmeta for multivariate meta-analysis

32 References 1 st part: Higgins JPT, White IR, Wood A. Imputation methods for missing outcome data in meta-analysis of clinical trials. Clinical Trials, submitted. 2 nd part: White IR, Higgins JPT, Wood AM. Allowing for uncertainty due to missing data in meta-analysis. 1. Two-stage methods. Statistics in Medicine, in press. Related: White IR, Welton NJ, Wood AM, Ades AE, Higgins JPT. Allowing for uncertainty due to missing data in meta- analysis. 2. Hierarchical models. Statistics in Medicine, in press. metamiss.ado available from

33 Extra slides

34