1 Introduction to Statistical Mediation David P. MacKinnon Arizona State University Center for AIDS Prevention Studies, UCSF, June 12-13, 2007 Brown, Cheong, Fairchild, Fritz, Lockwood, Morgan-Lopez, Taylor, Tein, Williams, West, Wang, Yoon Undergraduate Social Psychology Class Graduate School UCLA Quantitative Psychology Drug Prevention Research at USC Support from the National Institute on Drug Abuse MacKinnon, D. P. (2007) Introduction to Statistical Mediation Analysis, Mahwah, NJ: Erlbaum.
2 Goals of CAPS Presentation Describe many mediating variable examples. Describe reasons for mediation analysis--it can help improve prevention programs and reduce their cost. It is also useful for testing theories. Describe the latest methods to assess mediation. Describe limitations of mediation analysis. Describe experimental as well as non-experimental designs to investigate mediating variables.
3 Overview of Presentation Mediation Examples and Definition Statistical Mediation Analysis New tests for Mediation Limitations of Statistical Mediation Analysis Designs to address limitations of Mediation Analysis Summary and Future Directions
4 Psychology Example Stimulus: Multiply 24 and 16 Organism:You Response: Your Answer Organism as a Black Box Stimulus>Organism >Response (SOR) theory whereby the effect of a Stimulus on a Response depends on mechanisms in the organism (Woodworth, 1928). These mediating mechanisms translate the Stimulus to the Response. SOR theory is ubiquitous in psychology.
5 Mediation Statements If norms become less tolerant about smoking then smoking will decrease. If you increase positive parental communication then there will be reduced symptoms among children of divorce. If children are successful at school they will be less anti-social. If unemployed persons can maintain their self-esteem they will be more likely to be reemployed. If pregnant women know the risk of alcohol use for the fetus then they will not drink alcohol during pregnancy.
6 Mediator Definition and Examples A variable that is intermediate in the causal process relating an independent to a dependent variable. Attitudes cause intentions which then cause behavior (Azjen & Fishbein, 1980) Prevention programs change norms which promote healthy behavior (Judd & Kenny, 1981) Exposure to an argument affects agreement with the argument which affects behavior (McGuire, 1968)
7 More Mediation Examples P Psychotherapy induces catharsis, insight, and other mediators which lead to a better outcome (Freedheim & Russ, 1981) P Psychotherapy changes attributional style which reduces depression (Hollon, Evans, & DeRubies, 1991) P Parenting programs reduce parents’ negative discipline which reduces symptoms among children with ADHD (Hinshaw, 2002).
8 CAPS Mediation Examples Social problem solving affects psychological health which affects adherence to HIV medications (Johnson et al., 2006) Girl/boy friend in 7 th grade affects peer norms about sexual behavior which affects sexual behavior in 9 th grade (VanOss et al., 2006) Condom promotion program changes attitudes about sexual enjoyment from condoms which changes condom use (Choi et al., 2007). Affective regulation affects stimulant use and nonadherence to medications which affects viral load (Carrico et al., 2007).
9 Mediation Analysis in Treatment and Prevention Research Mediation is important for prevention and treatment research. Practical implications include reduced cost and more effective treatments. Mediation analysis is based on theory for the processes underlying treatments. Action theory corresponds to how the treatment will affect mediators—the X to M relation. Conceptual Theory focuses on how the mediators are related to the outcome variables—the M to Y relation (Chen, 1990, Lipsey, 1993).
10 Questions about mediators for treatment and prevention. Are these the right mediators? Are they causally related to the outcome? Is self-esteem causally related to symptoms? Conceptual Theory Can these mediators be changed? Can personality be changed? Action Theory Will the change in these mediators that we can muster with our treatment be sufficient to lead to desired change in the outcome? Do we have the resources to change self-esteem in four sessions? Both Action and Conceptual Theory.
11 Quotes about mediation analysis In the absence of a concern for such mediating or intervening mechanisms, one ends up with facts, but with incomplete understanding (Rosenberg, 1968, p. 63)... much of what social psychologists do is attempt to understand how internal processes mediate the effect of the situation on behavior (Kenny, Kashy, & Bolger, 1998, p. 259).
12 More Quotes Nursing “.. Should consider hypotheses about mediators …. that could provide additional information about why an observed phenomenon occurs” (Bennett, 2000). Children’s programs “.. Including even one mediator ….. in a program theory and testing it with the evaluation.. will yield more fruit….” (Petrosino, 2000) Child mental health “rapid progress … depends on efforts to identify … mediators of treatment outcome. We recommend randomized clinical trials routinely include and report such analyses” (Kraemer et al., 2002).
13 “Everyone talks about the weather but nobody does anything about it.” (Mark Twain)
14 Mediation Examples Residential instability reduced collective efficacy which increased violence (neighborhoods, Sampson et al., 1997) Anabolic prevention program affects norms regarding healthy behavior which reduced intentions to use steroids (Krull & MacKinnon, 1999; 2001). Alcohol prevention program affected norms which reduced alcohol use, (Komro et al., 2001)
15 Mediation is important because … Central questions in many fields are about mediating processes. Important for basic research on mechanisms of effects. Critical for applied research, especially prevention and treatment. Many interesting statistical and mathematical issues.
16 2, 3, or 4, variable effects Two variables: X Y, Y X, X Y are reciprocally related. Measures of effect include the correlation, covariance, regression coefficient, odds ratio, mean difference. Three variables: X M Y, X Y M, Y X M, and all combinations of reciprocal relations. Special names for third-variable effects, confounder, mediator, moderator/interaction. Four variables: many possible relations among variables, e.g., X Z M Y
17 Mediator versus Confounder Confounder is a variable related to two variables of interest that falsely obscures or accentuates the relation between them (Meinert & Tonascia, 1986). The definition below is also true of a confounder because a confounder also accounts for the relation but it is not intermediate in a causal sequence. In general, a mediator is a variable that accounts for all or part of the relation between a predictor and an outcome (Baron & Kenny, 1986, p.1176).
18 Mediator versus Moderator Moderator is a variable that affects the strength of the relation between two variables. The variable is not intermediate in the causal sequence so it is not a mediator. Moderator is usually an interaction, the relation between X and Y depends on a third variable. There are other more detailed definitions of a moderator.
19 Other names for Variables in the Mediation Model Antecedent to Mediating to Consequent (James & Brett, 1984). Initial to Mediator to Outcome (Kenny, Kashy & Bolger, 1998). Program to surrogate endpoint to ultimate endpoint (Prentice, 1989). Independent to Mediating to Dependent used in this presentation.
20 Three ways to specify a model Verbal description: A variable M is intermediate in the causal sequence relating X to Y. Diagram Equations
21 Mediation Regression Equations -Start here with the simplest mediation model with one mediator. -Tests of mediation use information from some or all of three equations -The coefficients in the equations may be obtained using methods such as ordinary least squares regression, covariance structure analysis, or logistic regression.
22 Single Mediator Model MEDIATOR M INDEPENDENT VARIABLE XY DEPENDENT VARIABLE ab c’
23 Relation of X to Y MEDIATOR M INDEPENDENT VARIABLE XY DEPENDENT VARIABLE c 1.The independent variable is related to the dependent variable: Y = i 1 + c X +
24 Relation of X to M MEDIATOR M INDEPENDENT VARIABLE XY DEPENDENT VARIABLE 2. The independent variable is related to the potential mediator: M = i 2 + a X + a
25 Relation of X and M to Y MEDIATOR M INDEPENDENT VARIABLE XY DEPENDENT VARIABLE a 3. The mediator is related to the dependent variable controlling for exposure to the independent variable: Y = i 3 + c’ X + b M + b c’
26 Mediated Effect Measures Mediated effect=ab Standard error= Mediated effect=ab=c-c’ (MacKinnon et al., 1995) Direct effect= c’ Total effect= ab+c’=c Test for significant mediation: z’=Compare to empirical distribution of the mediated effect abab
27 Assumptions I For each method of estimating the mediated effect based on Equations 1 and 3 (c-c’) or Equations 2 and 3(ab): Predictor variables are uncorrelated with the error in each equation. Errors are uncorrelated across equations. Predictor variables in one equation are uncorrelated with the error in other equation. Reliable and valid measures No omitted influences. Normally distributed variables
28 Assumptions II Data are a random sample from the population of interest. Coefficients, a, b, c’ reflect true causal relations and the correct functional form. Mediation chain is correct: Temporal ordering is correct X before M before Y. Any mediation model is part of a longer mediation chain. The researcher decides what part of the micromediational chain to examine. Homogeneous effects across subgroups: The relation from X to M and from M to Y are homogeneous across subgroups or other characteristics of participants in the study. Routine to test XM interaction in Equation 3. This means there are not moderator effects.
29 Three Major Types of Single Sample Tests for the Mediation Effect (1) Causal Steps: Series of tests described in Baron and Kenny (1986) for example. (2) Difference in Coefficients: c-c’, e.g., from Clogg et al. (1992) (3) Product of Coefficients: ab, e.g., from Sobel (1982) See MacKinnon et al., Psychological Methods (2002) for a review and comparison of single sample tests
30 Causal Steps Tests of Mediation Judd & Kenny (1981), 3 Steps plus Step 4 c’ is nonsignificant Baron & Kenny (1986), 3 Steps plus Step 4 drop from c to c’ Test of whether the a and b paths are statistically significant (MacKinnon et al., 2002).
31 Difference in Coefficients Significance test: t N-2 = (c-c’)/s c-c’ General formula for s 2 c-c’ : s 2 c-c’ = s 2 c + s 2 c’ -2s cc’ Clogg, Petkova, and Shihadeh (1992) s 2 c-c’ =(s c’ |r xm |) 2
32 Product of Coefficients Formulas for the variance of ab Multivariate delta variance: Sobel (1982), Folmer (1981) s 2 ab =a 2 s 2 b + b 2 s 2 a Exact variance: Aroian (1944) s 2 ab =a 2 s 2 b + b 2 s 2 a +s 2 a s 2 b Unbiased variance: Goodman (1960) s 2 ab =a 2 s 2 b + b 2 s 2 a -s 2 a s 2 b Test based on the distribution of the product of two random variables using critical values from Meeker et al. (1988) using a program called PRODCLIN.
33 Empirical Sample size estimates for.8 power to detect the mediated effect TestS-S S-M S-LM-SM-MM-LL-SL-ML-L Baron/Kenny (τ’ = 0) a & b Joint Delta PRODCLIN Note: Table entries are based on empirical simulation so they are not exact. Fritz & MacKinnon (2007).
34 Reasons for Differences Among Methods Requirement for significant total effect, c, and requirement that c’ is nonsignificant reduces accuracy of causal steps methods. Assumption that the mediated effect divided by its standard error has a normal distribution is incorrect for some values. Mediation is a test of two paths corresponding to a and b paths.
35 Distribution of the Product The mediated effect is the product of two coefficients a and b. The distribution of the product has a normal distribution only in special cases. At low values of a and b, the distribution has excess kurtosis and skewness, e.g. when a and b are both zero, kurtosis is 6. It is not surprising that the confidence limits are inaccurate if the distribution is assumed to be normal. One solution is to use the distribution of the product in statistical tests and confidence limits.
36
37 PRODCLIN (distribution of the PRODuct Confidence Limits for the INdirect effect) MacKinnon, Fritz, Williams, and Lockwood, (In Press, Behavior Research Methods) describes program to compute critical values for the distribution of the product. Web location includes programs in SAS, SPSS, and R that access a FORTRAN program. Input a, s a, b, s b, correlation between a and b, and Type I error rate. Output includes the input values and normal and distribution of the product confidence limits.
38 Critical Values for Distribution of the Product Because the distribution of the product is not normal, there are different critical values for the distribution for each value of a/s a and b/s b. The critical values are and for the 95% confidence interval from the normal distribution. There are different upper and lower critical values for the distribution of the product. Confidence limits and significance tests are more accurate using the critical values from the distribution of the product (MacKinnon et al. 2004).
39 Example Calculations using the Distribution of the Product For example, a =.3386, s a =.1224, b=.4510, s b = Enter these values in the PRODCLIN program. PRODCLIN returns the critical value for the 2.5% percentile, M lower = and M upper = the critical value for the 97.5% percentile. Use the critical values to calculate upper and lower confidence limits. LCL= ab + M upper s ab = ( ) (.0741) UCL= ab + M lower s ab = ( )(.0741) Asymmetric Confidence Limits are (.0329,.3197)
40 Resampling Methods -Another good option for data that do not have a normal distribution is resampling methods (MacKinnon et al. 2004). -Bootstrap method for mediated effects was described by Bollen & Stine (1991), Lockwood & MacKinnon (1998), MacKinnon et al., (2004) and Shrout & Bolger (2002) -Purpose is to use the data itself to form a distribution of a statistic (Manly, 1997). Does not make as many assumptions and can handle nonnormal distributions. -The value of a statistic in the observed sample is compared to the distribution of the statistic formed by resampling from the data a large number of times.
41 Bootstrap Test for Mediation -Estimate the mediated effect in the sample. -Make a new data set by sampling N subjects data with replacement and estimating the mediated effect in each of a large number (1000) of bootstrap samples. -Determine significance level by locating the mediated effect for the observed sample in the distribution of the bootstrap sample. Find 2.5% and 97.5% values for confidence interval. -Bias-corrected bootstrap makes a correction for the difference between the observed and average bootstrapped mediated effect.
42 Statistical Mediation Tests Summary Three general types of tests, causal steps, difference in coefficients, and product of coefficients. Tests differ substantially in Type I error and statistical power. Requirement of significant X to Y relation and assumed normal distribution of the mediated effect reduces power. Best tests are based on the distribution of the product and resampling methods.
43 Quotes about mediation analysis In the absence of a concern for such mediating or intervening mechanisms, one ends up with facts, but with incomplete understanding (Rosenberg, 1968, p. 63)... much of what social psychologists do is attempt to understand how internal processes mediate the effect of the situation on behavior (Kenny, Kashy, & Bolger, 1998, p. 259).
44 Reasons for Mediation analysis in prevention research. 1. Manipulation check. Did the program change the mediators it was designed to change? 2. Program Improvement. What do the program effects on mediators suggest about program improvements? 3. Measurement Improvement. Is a lack of program effects due to poor measurement? 4. Delayed effects. Will program effects on the dependent variable emerge later? 5. Test the process of mediation. Was the theory-based prediction of mediation correct? 6. Practical implications. Can the program be redesigned to cost less and be more efficient?
45 Interpretation of Mediation Results in prevention research. Program effect on mediator but not outcome. The mediator may not be causally related to the outcome. Lack of power or insufficient measurement—explanations for all null effects below. Program effect on the outcome but not the mediator. The program did not affect the intended mediator. Other constructs were mediators. No program effects on the outcome or the mediator. Program was ineffective, lack of statistical power. Program effects on the mediator and the outcome but nonsignificant mediation. The mediator may not be causally related to the outcome. Program effects on the mediator and the outcome and significant mediation. Program was effective and there is evidence for the hypothesized mediating mechanism.
46 Causal Inference for Mediation The Rubin Causal Model (RCM, Rubin, 1974) describes a general way to interpret evidence for causal relations, developed to interpret non- experimental as well as experimental research. It is a solution not a problem. Helpful because the RCM clearly displays limits and strengths of models, including mediation.
47 Counterfactual Counterfactual is central to modern causal inference. The counterfactual refers to conditions in which a participant could serve, not just the condition that they did serve in. For example, for a participant in the treatment group, the counterfactual is the same participant in the control group. For a participant in the control group, the counterfactual is the same participant in the treatment group.
48 Why b and c’ do not reflect a causal relation? Because M is not under experimental control, and M is both a dependent and independent variable, b and c’ do not necessarily represent causal effects. Need: The relation between M and Y for participants in the treatment group if they were in the control group; the relation between M and Y for control participants if they instead were in the treatment group. Coefficients b and c’ are not clearly causal effects, because M is not randomly assigned making the counterfactuals for these relations complicated.
49 Causal inference for mediation -Counterfactual idea helps organize causal inference and highlights ambiguity regarding interpretation of c’ and b coefficients as causal effects. -In treatment and prevention, the M to Y, b, relation is based on prior research and theory. It is all we consider known. -Do we need to know the true causal structure to make good decisions based on research? Is a descriptive model sufficient? -Can we ever know the true causal relation among variables? “Science in no case can demonstrate any inherent necessity in a sequence, nor prove with absolute certainty that it must be repeated” (Pearson p. 113, Grammar of Science, 1957).
50 Improving Mediation Inference using the Rubin Causal Model Statistical approaches to improving causal inference from a mediation study: (I). Instrumental Variable Methods, Holland 1988; Sobel (II). Principal Stratification and latent classes; Frangakis & Rubin, 2002; Jo, Both approaches use aspects of the data such as no direct effect or stratifications of types of participants, such as compliers, never compliers etc. to improve inference regarding b and c’.
51 Design Approaches to Causal Inference Statistical mediation analysis answers the following question, “How does a researcher use measures of the hypothetical intervening process to increase the amount of information from a research study?” Another question is, “What is the best next study or studies to conduct after a statistical mediation analysis to further test mediation theory.” Five general approaches: (1) double randomization, (2) blockage, (3) enhancement, (4) purification, (5) pattern matching for multiple variables, subgroups, settings, time, and alternative manipulations (Mark, 1986).
52 (1) Double Randomization If the problem with the b path is that M is not randomly assigned, then how about randomizing both X in the X to M relation and randomizing M in the M to Y relation. Say X is randomized and there was a significant effect of X on M in Study 1. In Study 2, an experiment was set up so that M was randomized to levels defined by how X changed M in Study 1. If there was a significant relation of M to Y in Study 2, then there is more evidence for mediation.
53 Wood et al. (1974) Overview Study of self-fulfilling prophecy cited in Spencer et al., (2005). Race (X) predicts quality of interview (M) and quality of interview predicts performance (Y). Confederate—Person assisting with the experiment. The confederates are used to manipulate factors. Confederate applicants were used in Study 1 for the X to M relation and confederate interviewers were used in Study 2 for the M to Y relation.
54 Wood et al., (1974) Study 1. White participants interviewed either Black or White confederate applicants (X). The dependent variable M, was interview quality and participants with Black confederate applicants gave poorer quality interviews (M). Study 2. Confederates gave either an interview (M) like White applicants were interviewed in Study 1 or like Black applicants in Study 1. This manipulation had a significant effect on applicant performance. So randomization was used for the X to M relation and the M to Y relation.
55 Prevention Example (MacKinnon et al., 2002) Norms increase exercise which decreases depression. Study 1, X to M: Similar to existing prevention studies, participants either receive a social norm manipulation to increase exercise or not (X) and exercise is measured (M). Study 2, M to Y: Participants are randomly assigned to conduct an amount of exercise (M) obtained in the program group or the control from Study 1 and depression is measured (Y). Help. If you know or think of other studies like this please let me know!
56 Double Randomization Problems Most problems center around the randomization of the mediator so that it corresponds to the change in the mediator in the X to M study. Study 2 is a mediation model with a manipulation (X) that should change M in the same way as X changed M in Study 1. So Study 2 data is analyzed with statistical mediation analysis.
57 (2) Blockage Designs The goal of blockage designs is to test a mediation relation with a manipulation that blocks the mediator from operating. For example, lets say that an exercise program appears to reduce depression by increasing endorphins-- the hypothesized mediator. A blockage manipulation would administer a drug to prevent endorphins so that persons receiving the exercise program would no longer experience reduced depression if the endorphins is the mediator.
58 (3) Enhancement Designs The goal of enhancement designs is to deliver interventions that enhance the effects of a hypothesized mediator. For example, lets say that an addiction treatment program reduces remission by improving social support. An enhancement design would increase social support even more to demonstrate a larger effect on remission. Social support may be increased by more exposure to a therapist, additional contact with friends and family etc.
59 (4) Purification Designs The goal of purification designs is to reduce a manipulation to its critical ingredients. For example, in drug prevention research, it appears that changes in norms, beliefs about positive consequences of drugs, and intentions to avoid drugs appear to the most important mediators of drug prevention programs. A purification design would retain only those program components that address these mediators to test whether the purer program changes drug use.
60 (5) Pattern Matching The goal of pattern matching is to specify patterns of results based on mediation theory. Different types of studies and information are used to assess whether the pattern of results is consistent with mediation theory. Multiple variables: a mediation relation is observed for one variable but not another. For example, change in beliefs about positive consequences of alcohol use is a mediator for alcohol use but not for tobacco use. Changes in beliefs about positive consequences is a statistical mediator but changes in beliefs about negative consequences is not.
61 More Pattern Matching Examples Moderators: For example, prevention program effects are most effective for persons low on the mediator at baseline. Setting: An intervention to change norms that then changes behavior should be more successful in a setting where more norm change may occur. Different Manipulations: A different manipulation that should change the same theoretical mediator should lead to the same results.
62 Summary Mediation theory is central to many fields and critical for treatment and prevention research. Statistical mediation analysis of a single study yields important but potentially limited information. Experimental designs to follow mediation analysis provide more evidence for a mediation relation. Note that statistical mediation analysis of data from experimental designs may also yield additional information.
63 Future Directions Causal inference for mediation will continue to be an active area of research. Programs of research are needed to investigate mediators. Must consider other evidence including clinical judgment, theory, case studies, and replication studies. Statistical mediation analysis for some methods is still needed, e.g. survival analysis, longitudinal data, generalized linear model. Need more applications of mediation analysis.
64 Hypothesized Effects of a Presentation on Mediation Analysis CAPS Talk on Mediation Analysis # Studies with Mediation Analysis Interest in Mediation Methods Norms Regarding Reporting Results of Studies Comprehension of Reasons for Mediation Analysis Beliefs About the Importance of Theory Testing
65 THE END