The Campbell Collaborationwww.campbellcollaboration.org Introduction to Robust Standard Errors Emily E. Tanner-Smith Associate Editor, Methods Coordinating.

Slides:



Advertisements
Similar presentations
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Advertisements

Analysis of variance (ANOVA)-the General Linear Model (GLM)
The Campbell Collaborationwww.campbellcollaboration.org Moderator analyses: Categorical models and Meta-regression Terri Pigott, C2 Methods Editor & co-Chair.
Multivariate distributions. The Normal distribution.
Econometric Details -- the market model Assume that asset returns are jointly multivariate normal and independently and identically distributed through.
EVAL 6970: Meta-Analysis Meta-Regression and Complex Data Structures Dr. Chris L. S. Coryn Spring 2011.
Longitudinal Experiments Larry V. Hedges Northwestern University Prepared for the IES Summer Research Training Institute July 28, 2010.
Chapter 10 Simple Regression.
CHAPTER 3 ECONOMETRICS x x x x x Chapter 2: Estimating the parameters of a linear regression model. Y i = b 1 + b 2 X i + e i Using OLS Chapter 3: Testing.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Generalized Regression Model Based on Greene’s Note 15 (Chapter 8)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Clustered or Multilevel Data
Chapter 8 Estimation: Single Population
Chapter 15 Panel Data Analysis.
Overview of Meta-Analytic Data Analysis. Transformations Some effect size types are not analyzed in their “raw” form. Standardized Mean Difference Effect.
Analysis of Variance & Multivariate Analysis of Variance
Maximum likelihood (ML)
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
GEE and Generalized Linear Mixed Models
Regression and Correlation Methods Judy Zhong Ph.D.
Chapter 9 Two-Sample Tests Part II: Introduction to Hypothesis Testing Renee R. Ha, Ph.D. James C. Ha, Ph.D Integrative Statistics for the Social & Behavioral.
Inference for regression - Simple linear regression
Overview of Meta-Analytic Data Analysis
  Meta-Analysis of Correlated Data. Common Forms of Dependence Multiple effects per study –Or per research group! Multiple effect sizes using same.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
ANOVA Greg C Elvers.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Error Component Models Methods of Economic Investigation Lecture 8 1.
Introduction Multilevel Analysis
Andrew Thomson on Generalised Estimating Equations (and simulation studies)
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Meta-analysis 統合分析 蔡崇弘. EBM ( evidence based medicine) Ask Acquire Appraising Apply Audit.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
The Campbell Collaborationwww.campbellcollaboration.org C2 Training: May 9 – 10, 2011 Introduction to meta-analysis.
Dependencies Complex Data in Meta-analysis. Common Dependencies Independent subgroups within a study (nested in lab?) Multiple outcomes on the same people.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
ANOVA: Analysis of Variance.
Generalised method of moments approach to testing the CAPM Nimesh Mistry Filipp Levin.
Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
ANCOVA. What is Analysis of Covariance? When you think of Ancova, you should think of sequential regression, because really that’s all it is Covariate(s)
FIXED AND RANDOM EFFECTS IN HLM. Fixed effects produce constant impact on DV. Random effects produce variable impact on DV. F IXED VS RANDOM EFFECTS.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Analysis of Experiments
Tutorial I: Missing Value Analysis
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
ANCOVA.
CHAPTER- 3.2 ERROR ANALYSIS. 3.3 SPECIFIC ERROR FORMULAS  The expressions of Equations (3.13) and (3.14) were derived for the general relationship of.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Canadian Bioinformatics Workshops
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
BINARY LOGISTIC REGRESSION
Esman M. Nyamongo Central Bank of Kenya
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
Multiple Regression.
Comparisons among methods to analyze clustered multivariate biomarker predictors of a single binary outcome Xiaoying Yu, PhD Department of Preventive Medicine.
Confidence Interval Estimation
Presentation transcript:

The Campbell Collaborationwww.campbellcollaboration.org Introduction to Robust Standard Errors Emily E. Tanner-Smith Associate Editor, Methods Coordinating Group Research Assistant Professor, Vanderbilt University Campbell Collaboration Colloquium Copenhagen, Denmark May 30 th, 2012

The Campbell Collaborationwww.campbellcollaboration.org Outline Types of dependencies Dealing with dependencies Robust variance estimation Practical considerations – Choosing weights – Handling covariates Robust variance estimation in Stata

The Campbell Collaborationwww.campbellcollaboration.org Types of Dependencies Most meta-analysis techniques assume effect sizes are statistically independent But there are many instances when you might have dependent effect sizes – Multiple measures of the same underlying outcome construct – Multiple measures across different follow-up periods – Multiple treatment groups with a common control group – Multiple studies from the same research laboratory

The Campbell Collaborationwww.campbellcollaboration.org Types of Dependencies Assume T i = θ i + ε i, where – T i is the effect size estimate – θ i is the effect size parameter – ε i is the estimation error Statistical dependence can arise because – ε i are correlated – θ i are correlated – or both

The Campbell Collaborationwww.campbellcollaboration.org Correlated Effects Model (ε i correlated) kj correlated estimates of the study specific ES τ 2 = between study variation in ES Meta-regression Dependent effect size meta- regression Study 1, θ1Estimate 1.1 of θ1Estimate 1.2 of θ1Estimate 1.3 of θ1Study 2, θ2Estimate 2.1 of θ2Study 3, θ3Estimate 3.1 of θ3Estimate 3.2 of θ3

The Campbell Collaborationwww.campbellcollaboration.org Hierarchical Model (θ i correlated) ω 2 = between-study, within-cluster variation in ES τ 2 = between cluster variation in average ES Meta- regression Dependent effect size meta- regression Cluster 1, θ1 Study 1.1 estimate of θ1.1 Study 1.2 estimate of θ1.2 Study 1.3 estimate of θ1.3 Cluster 2, θ2Study 2.1 estimate of θ2.1Cluster 3, θ3Study 3.1 estimate of θ3.1 Study 3.2 estimate of θ3.2

The Campbell Collaborationwww.campbellcollaboration.org Dealing with Dependencies Ignore it and analyze the effect sizes as if they are independent ( not recommended ) Select a set of independent effect sizes – Create a synthetic mean effect size – Randomly select one effect size – Choose the “best” effect size Model the dependence with full multivariate analysis – This requires information on the covariance structure Use robust variance estimation (Hedges, Tipton, & Johnson, 2010)Hedges, Tipton, & Johnson, 2010

The Campbell Collaborationwww.campbellcollaboration.org Variance Estimation Assume T = Xβ + ε, where T = (T 1,…,T m )’ T j is a k j x 1 vector of effect sizes for study j X = (X 1,…,X m )’ X j is a k j x p design matrix for study j β = (β 1,…,β p )’ β is a 1 x p vector of unknown regression coefficients ε = (ε 1,…,ε m )’ ε j is a k j x 1 vector of residuals for study j E( ε j ) = 0, V( ε j ) = Σ j

The Campbell Collaborationwww.campbellcollaboration.org Variance Estimation We can estimate β by: And the covariance matrix for this estimate is The problem is that although the variances in Σ j are known, the covariances are UNKNOWN

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimator (RVE) The RVE of b is where e j = T j – X jb is the k j x 1 estimated residual vector for study j

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimator (RVE) A robust test of H 0 : β a = 0 uses the statistic where v R aa is the a th diagonal of the V R matrix Note: the t-distribution with df = m - p should be used for critical values

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimator (RVE) Under regularity conditions and as m -> ∞, V m R is a consistent estimator of the true covariance matrix RVE theorem is asymptotic in the number of studies m, not the number of effect sizes Results apply to any type of dependency No distributional assumptions needed for the effect sizes Correlations do not need to be known or specified, though may impact the standard errors RVE theorem applies for any set of weights

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Choosing Weights Although the RVE works for any weights, the most efficient weights are inverse-variance weights – In the hierarchical model:

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Choosing Weights In the correlated effects model, we can estimate approximately efficient weights by assuming a simplified correlation structure: – Within each study j, the correlation between all pairs of effect sizes is a constant ρ – ρ is the same in all studies – k j sampling variances within the study are approximately equal with average V  j

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Choosing Weights In the correlated effects model, we can take a conservative approach when calculating weights by also assuming ρ = 1, and weights become: Conservative approach, because studies do not receive additional weight for contributing multiple effect sizes

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Choosing Weights In the correlated effects model, ρ also occurs in the estimator of τ 2 : Use external information about ρ if available (test reliabilities, correlations reported in studies, etc.) Take a sensitivity approach when estimating τ 2 by estimating the model with various values of ρ in (0, 1)

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Choosing Weights For the correlated effects model, RVE software is currently programmed to default to (per Hedges, Tipton, & Johnson recommendation): – Conservative approach to estimate weights (assume ρ = 1) – User must specify ρ for estimation of τ 2 ; sensitivity tests recommended

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Handling Covariates Some covariates may vary within groups (i.e., studies or clusters) and between groups, e.g., – Length of follow-up after intervention – Time frame of outcome measure – Outcome reporter (self-report vs. parent-report) – Type of outcome construct (frequency vs. quantity of alcohol use) When modeling the effects of a covariate, ask if the effect of interest is between- or within-groups

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Handling Covariates In a standard meta-regression with independent effect sizes, T j = β 0 + X j β 1 + … where X j is length to follow-up, β 0 and β 1 can be interpreted as: – β 0 = the average effect size when X j = 0 e.g. the average effect size in studies in which the intervention just occurred – β 1 = the effect of a 1-unit increase in X j on T j e.g. the effect size change associated with moving from a study in which the intervention just occurred to a study in which the effect size was measured at a 1 month posttest follow-up

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Handling Covariates In the correlated effects model, for a fixed study ( j = 1), now assume there are multiple outcomes. This study has its own regression equation: T i1 = β 01 + X i1 β 11 +… The coefficients β 01 and β 11 can be interpreted as: – β 01 = the average effect size when X i1 = 0 e.g. the average effect size for units in the study ( j = 1) when the intervention just occurred – β 11 = the effect of a 1-unit increase in X i1 on T i1 e.g. the effect size change for units in the study ( j = 1) at the time of intervention and at follow-up 1 month later

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Handling Covariates When using the RVE, these two types of regression occur in one analysis: Within GroupT ij = β 0j + X ij β 2 + … Between Groupβ 0j = β 0 + X  j β 1 + … These two regressions are combined into one analysis and model: T ij = β 0 + X ij β 2 + X  j β 1 + …

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Handling Covariates To properly separate estimation of within- and between- group effects of covariates, use group mean centering: X cij = X ij – X  j where X  j is the mean value of X ij in group j (and where group is either study or cluster). So now, T ij = β 0 + X cij β 2 + X  j β 1 + … If you don’t center X ij you are actually modeling a weighted combination of the within- and between-study effect, which is difficult to interpret

The Campbell Collaborationwww.campbellcollaboration.org Practical Issues: Handling Covariates When using a covariate, ask if the effect of interest is between- or within-groups Make sure to group-center your within-group variables Acknowledge that if only a few groups have variability in X ij – Within-group estimate of β 2 (associated with X cij ) will be imprecise (i.e. have a large standard error) – The types of groups in which X ij varies may be different than (i.e. not representative of) groups in which X ij does not vary

The Campbell Collaborationwww.campbellcollaboration.org Calculating Robust Variance Estimates Variables you will need in your dataset – Group identifier (e.g., study/cluster identification number) – Effect size estimate – Variance estimate of the effect size – Any moderator variables or covariates of interest Additional pieces of information you will need to specify – In a correlated effects model: assumed correlation between all pairs of effect sizes ( ρ ) – Fixed, random, hierarchical, or user-specified weights

The Campbell Collaborationwww.campbellcollaboration.org Stata ado file available at SSC archive: type ssc install robumeta SPSS macro available at: R functions available at: Calculating Robust Variance Estimates

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata Install robumeta.ado file from the Statistical Software Components (SSC) archive

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata Access example datasets and syntax/output files here:

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata robumeta.ado command structure robumeta depvar [indepvars] [if] [in], study(studyid) variance(variancevar) weighttype(weightingscheme) rho(rhoval) [options]

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata Example of a correlated effects model (correlated ε) Fictional meta-analysis on the effectiveness of alcohol abuse treatment for adolescents Effect sizes represent post-treatment differences between treatment and comparison groups on some measure of alcohol use (positive effect sizes represent beneficial treatment effects) – Number of effect sizes k = 172 – Number of studies m = 39 – Average number of effect sizes per study = 4.41

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata 4 moderators of interest: 2 vary within and between studies, 2 vary between studies only

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata To model both the within- (X cij ) and between- effects (X  j ) of the type of alcohol outcome and follow-up time frame, create group mean and group mean centered variables

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata Let’s say we have a similar meta-analysis, but now need to estimate a hierarchical model (correlated θ ) Effect sizes represent post-treatment differences between treatment and comparison groups on some measure of alcohol use (positive effect sizes represent beneficial treatment effects) – Number of effect sizes k = 68 – Number of research labs m = 15 – Average number of effect sizes per research lab = 4.5

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata τ 2 – between lab variance component; ω 2 between-study within-lab variance component

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata 5 moderators of interest: all vary within and between clusters (research labs)

The Campbell Collaborationwww.campbellcollaboration.org Robust Variance Estimation in Stata To model both the within- (X cij ) and between- effects (X  j ) of the covariates of interest, create group mean and group mean centered variables

The Campbell Collaborationwww.campbellcollaboration.org

Conclusions & Recommendations Robust variance estimation is one way to handle dependencies in effect size estimates, and allows estimation of within- and between-study effects of covariates – Method performs well when there are 20 or more studies with an average of 2 or more effect size estimates per study Choose the proper model for the type of dependencies in your data (correlated ε or correlated θ )

The Campbell Collaborationwww.campbellcollaboration.org Conclusions & Recommendations When using the correlated effects model (correlated ε ), with efficient weights, if you have no information on ρ: – Use a sensitivity approach for estimating τ 2 – Assume ρ = 1 in your weights, i.e., For each covariate X ij in your model, remember that you can estimate: – Between-group effect: group mean (X  j ) – Within-group effect: group mean centered variable (X cij = X ij – X  j )

The Campbell Collaborationwww.campbellcollaboration.org Recommended Reading Hedges, L. V., Tipton, E., & Johnson, M. C. (2010). Robust variance estimation in meta-regression with dependent effect size estimates. Research Synthesis Methods, 1,

The Campbell Collaborationwww.campbellcollaboration.org P.O. Box 7004 St. Olavs plass 0130 Oslo, Norway