  Meta-Analysis of Correlated Data. Common Forms of Dependence Multiple effects per study –Or per research group! Multiple effect sizes using same.

Slides:



Advertisements
Similar presentations
Repeated Measures/Mixed-Model ANOVA:
Advertisements

Meta-Regression & Mixed Effects T i ~ N(  i  i 2 )  i ~ N(  2 )
I OWA S TATE U NIVERSITY Department of Animal Science Using Basic Graphical and Statistical Procedures (Chapter in the 8 Little SAS Book) Animal Science.
Topic 12: Multiple Linear Regression
Likelihood & Hierarchical Models  ij  j,  j 2   j ,  2 
Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Statistical Analysis Overview I Session 2 Peg Burchinal Frank Porter Graham Child Development Institute, University of North Carolina-Chapel Hill.
SPH 247 Statistical Analysis of Laboratory Data 1April 2, 2013SPH 247 Statistical Analysis of Laboratory Data.
3-Dimensional Gait Measurement Really expensive and fancy measurement system with lots of cameras and computers Produces graphs of kinematics (joint.
Uncertainty and confidence intervals Statistical estimation methods, Finse Friday , 12.45–14.05 Andreas Lindén.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Lecture 4 (Chapter 4). Linear Models for Correlated Data We aim to develop a general linear model framework for longitudinal data, in which the inference.
Analysis of Clustered and Longitudinal Data Module 3 Linear Mixed Models (LMMs) for Clustered Data – Two Level Part A 1 Biostat 512: Module 3A - Kathy.
Statistical Models in Meta- Analysis T i ~ N(  i  i 2 )  i ~ N(  2 )
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
1 Test a hypothesis about a mean Formulate hypothesis about mean, e.g., mean starting income for graduates from WSU is $25,000. Get random sample, say.
Multilevel Models 2 Sociology 8811, Class 24
ANCOVA Psy 420 Andrew Ainsworth. What is ANCOVA?
Chapter 14 Conducting & Reading Research Baumgartner et al Chapter 14 Inferential Data Analysis.
Comparing Means: Independent-samples t-test Lesson 13 Population APopulation B Sample 1Sample 2 OR.
What Is Multivariate Analysis of Variance (MANOVA)?
Heterogeneity in Hedges. Fixed Effects Borenstein et al., 2009, pp
Data Analysis Statistics. Inferential statistics.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Objectives of Multiple Regression
Introduction to Multilevel Modeling Using SPSS
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Data Analysis and Interpretation: Computing effect sizes.
Overview of Meta-Analytic Data Analysis
The Campbell Collaborationwww.campbellcollaboration.org Introduction to Robust Standard Errors Emily E. Tanner-Smith Associate Editor, Methods Coordinating.
Guide to Handling Missing Information Contacting researchers Algebraic recalculations, conversions and approximations Imputation method (substituting missing.
Error Component Models Methods of Economic Investigation Lecture 8 1.
Scientific question: Does the lunch intervention impact cognitive ability? The data consists of 4 measures of cognitive ability including:Raven’s score.
Introduction Multilevel Analysis
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Module 8: Estimating Genetic Variances Nested design GCA, SCA Diallel
Statistical Applications for Meta-Analysis Robert M. Bernard Centre for the Study of Learning and Performance and CanKnow Concordia University December.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
How to Evaluate the Effects of Potential Bias in Meta-analysis in R.
Kirsten Fiest, PhD June 23, CONDUCTING META-ANALYSES IN HEALTH RESEARCH.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.
Dependencies Complex Data in Meta-analysis. Common Dependencies Independent subgroups within a study (nested in lab?) Multiple outcomes on the same people.
Statistical Models for the Analysis of Single-Case Intervention Data Introduction to:  Regression Models  Multilevel Models.
Lecture 3 Linear random intercept models. Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The.
Schmidt & Hunter Approach to r Bare Bones. Statistical Artifacts Extraneous factors that influence observed effect Sampling error* Reliability Range restriction.
Mixed Effects Models Rebecca Atkins and Rachel Smith March 30, 2015.
Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.
Bootstraps and Jackknives Hal Whitehead BIOL4062/5062.
Data Analysis in Practice- Based Research Stephen Zyzanski, PhD Department of Family Medicine Case Western Reserve University School of Medicine October.
Environmental Modeling Basic Testing Methods - Statistics III.
1 Summarizing Performance Data Confidence Intervals Important Easy to Difficult Warning: some mathematical content.
Fixed- v. Random-Effects. Fixed and Random Effects 1 All conditions of interest – Fixed. Sample of interest – Random. Both fixed and random-effects meta-analyses.
Analysis of Experiments
September 28, 2000 Improved Simultaneous Data Reconciliation, Bias Detection and Identification Using Mixed Integer Optimization Methods Presented by:
Jessaca Spybrook Western Michigan University Multi-level Modeling (MLM) Refresher.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Genotype x Environment Interactions Analyses of Multiple Location Trials.
An Application of Multilevel Modelling to Meta-Analysis, and Comparison with Traditional Approaches Alison O’Mara & Herb Marsh Department of Education,
Review Statistical inference and test of significance.
Statistical Concepts Basic Principles An Overview of Today’s Class What: Inductive inference on characterizing a population Why : How will doing this allow.
Estimating standard error using bootstrap
From t-test to multilevel analyses Del-2
Linear Mixed Models in JMP Pro
Mixed models and their uses in meta-analysis
G Lecture 6 Multilevel Notation; Level 1 and Level 2 Equations
Methods of Economic Investigation Lecture 12
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
MetaForest Using random forests to explore heterogeneity in meta-analysis Caspar J. van Lissa, Utrecht University NL
Presentation transcript:

  Meta-Analysis of Correlated Data

Common Forms of Dependence Multiple effects per study –Or per research group! Multiple effect sizes using same control Phylogenetic non-independence Measurements of multiple responses to a common treatment Unknown correlations…

Multiple Sample Points per Study! StudyExperiment in StudyHedges DV Hedges D Ramos & Pinto Ramos & Pinto Ramos & Pinto Ellner & Vadas Ellner & Vadas Moria & Melian

Hierarchical Models Study-level random effect Study-level variation in coefficients Covariates at experiment and study level

Hierarchical Models Random variation within study (j) and between studies (i) T ij  ij,  ij 2 )  ij  j,  j 2   j ,  2 

Study Level Clustering

Hierarchical Partitioning of One Study Grand Mean Study Mean Variation due to  Variation due to 

Example: Data Set 1 Group Effect Variance 1 A A A A B B B C

A Two-Step Solution T ij  ij,  ij 2 )  ij  j,  j 2   j ,  2  library(plyr) data1_study <- ddply(data1,.(Group), function(adf){ mod <- rma(Effect, Variance, data=adf) cbind(theta_j = coef(mod), se_theta_j = coef(summary(mod))[1,2], omega2 = mod$tau2) })

A Two-Step Solution T ij  ij,  ij 2 )  ij  j,  j 2   j ,  2  > data1_study Group theta_j se_theta_j omega2 1 A B C D E jj jj

A Two-Step Solution T ij  ij,  ij 2 )  ij  j,  j 2   j ,  2  > rma(theta_j, I(se_theta_j^2), data=data1_study) Random-Effects Model (k = 5; tau^2 estimator: REML) tau^2 (estimated amount of total heterogeneity): (SE = )... estimate se zval pval ci.lb ci.ub < *** 22 

Multiple Effects per Research Group

Solutions to Multiple Hierarchies Multiple-Step Meta-analyses Multi-level hierarchical model fits –Better estimator –Accommodates more complex data structures –May need to go Bayesian (don't be scared!) Model correlation…

Common Forms of Dependence Multiple effects per study –Or per research group! Multiple effect sizes using same control Phylogenetic non-independence Measurements of multiple responses to a common treatment Unknown correlations…

Multiple Effect Sizes with Common Control Effect of each treatment calculated using same control!

The Control Keeps Showing Up! n c and sd c are going to be the same for all treatments Effect sizes will covary

Calculating Covariance Formulae available or derivable for all effect sizes

A Mixed Effect Group Model Group means, random study effect, and then everything else is error T i  im,  i 2 ) where  im  m,  2 

A Mixed Effect Group Model Group means, random study effect, and then everything else is error T i  MVN  i,  i ) where  i  MVN  X i ,  2 

What are  i and  i ? i =i=i =i= T i  MVN  i,  i )

What about the treatment effects? X i =   i =  i  MVN  X i ,  2 

What if treatments are correlated?  i = T i  MVN  i,  i )

Why does covariance matter?   x-y =   x +   y + 2   x,y In asking if two treatments differ, cov helps tighten confidence intervals High cov  more weight for a study as treatments share information

Multiple Treatments study trt m1i m2i sdpi n1i n2i Common Control!

Calculating the Variance/Covariance Matrix [,1] [,2] [,3] [,4] [,5] [,6] [1,] [2,] [3,] [4,] [5,] [6,]

Fitting a Model with a VCOV Matrix > rma.mv(yi ~ factor(trt)-1, V, random =~ 1|study, data=dat)

Comparison to No Correlation Model With correlation estimate se zval pval ci.lb ci.ub factor(trt) < factor(trt) < Without correlation estimate se zval pval ci.lb ci.ub factor(trt) < factor(trt) <

Common Forms of Dependence Multiple effects per study –Or per research group! Multiple effect sizes using same control Phylogenetic non-independence Measurements of multiple responses to a common treatment Unknown correlations…

Effect Size on Related Organisms Not Independent Warming on Litterfall Pine Trees Redwoods Fir Trees Oak Trees {

Phylogenetic Distances Determines Covariances for Weights

What about Multiple Studies of Some Species?

Common Forms of Dependence Multiple effects per study –Or per research group! Multiple effect sizes using same control Phylogenetic non-independence Measurements of multiple responses to a common treatment Unknown correlations…

Common Treatments Treatment Response 1Response 2Response 3

Common Treatments CO 2 CO 2 Assimilation GS Stomatal Conductance PN

Correlation Between Responses

What does Correlation between effects mean? X i =   i =  i  MVN  X i ,  2 

What Do We Do? 1. Create a 'composite' measure –Average –Weighted Average 2. Estimate different coefficients directly 3. Robust Variance Estimation (RVE)

The CO 2 Effect Data experiment Paper Measurement Hedges Var GS PN GS PN GS PN GS PN GS PN

Direct Estimation rma.mv(Hedges ~ Measurement, Var, random =~ Measurement|Paper, data=co2data, struct="HCS")

 and Different Correlation Structures Different structures for different data We do not always know which one is correct!

Estimates of Variance, Covariance Multivariate Meta-Analysis Model (k = 68; method: REML) Variance Components: outer factor: Paper (nlvls = 18) inner factor: Measurement (nlvls = 2) estim sqrt k.lvl fixed level tau^ no GS tau^ no PN rho no

Disadvantages to Multivariate Meta-Analysis 1. Difficult to estimate with few studies 2. Additional assumptions of covariance structure 3. Often little improvement over univariate meta-analysis 4. Publication bias exacerbated if data not missing at random Jackson et al Satist. Med.

Robust Variance Estimation Essentially, bound weights within a group j to 1/mean var j and assume a value of  –Test sensitivity to choice of  –Correct DF for small sample sizes Methods developed by Hedges, Tipton, and others robumeta package in R

robumeta & RVE library(robumeta) robu(Hedges ~ Measurement, data=co2data, studynum=Paper, var.eff.size=Var)

RVE: Correlated Effects Model with Small-Sample Corrections Model: Hedges ~ Measurement Number of studies = 18 Number of outcomes = 68 (min = 2, mean = 3.78, median = 4, max = 10 ) Rho = 0.8 I2 = Tau.Sq = Struct="CS" only so far

Often, Choice of  Matters Little > sensitivity(co2modRVE) Type Variable rho=0 rho=0.2 rho=0.4 rho=0.6 rho=0.8 rho=1 1 Estimate intercept MeasurementPN Std. Err. intercept MeasurementPN Tau.Sq

Results May Differ… Multivariate Meta-Analysis Model Results: estimate se zval pval ci.lb ci.ub intrcpt MeasurementPN * Robust Variance Estimation Model Results: Estimate StdErr t-value df P(|t|>) 95% CI.L 95% CI.U Sig 1 intercept MeasurementPN

Other Sources of Unknown Correlations Shared system types Shared environmental events Labs or investigators Re-sampling experiments Experiments repeated in a region More…

Why Model Correlation instead of Hierarchy? Depends on question Analytical difficulty Leveraging correlation to aid with missing data