Estimating and Testing Mediation David A. Kenny University of Connecticut http://davidakenny.net/cm/MediationN.ppt Overview Introduction (click to go there) The Four-Step Approach (click) The Modern Approach (click) Test the Indirect Effect (click) Power (click) Assumptions (click) DataToText (click) 2 Introduction 3 Interest in Mediation • Mentions of “mediation” or “mediator” in psychology abstracts: – 1980: 36 – 1990: 122 – 2000: 339 – 2010: 1,198 4 Why the Interest in Mediation? • Understand the mechanism – theoretical concerns – cost and efficiency concerns • Find more proximal endpoints • Understand why the intervention did not work – finding the missing link – compensatory processes 5 The Beginning Model: Basic Research 6 The Mediational Model 7 The Beginning Model: Applied Research 8 The Four Paths • • • • X Y: path c X M: path a M Y (controlling for X): path b X Y (controlling for M): path c′ (standardized or unstandardized) 9 The Four-Step Approach 10 In the 1980s Different Pairs of Researchers Proposed a Series of Steps to Test Mediation • Judd & Kenny (1981) • James & Brett (1984) • Baron & Kenny (1986) 11 The Four Steps • Step 1: X Y (test path c) • Step 2: X M (test path a) • Step 3: M (and X) Y (test path b) • Step 4: X (and M) Y (test path c′) Note that Steps 3 and 4 use the same regression equation. 12 Total and Partial Mediation • Total Mediation • Meet steps 1, 2, and 3 and find that c′ equal zero. • Partial Mediation • Meet steps 1, 2, and 3 and find that c′ is smaller in absolute value than c. 13 Example Dataset • Morse et al. – J. of Community Psychology, 1994 – treatment housing contacts days of stable housing – persons randomly assigned to treatment groups. – 109 people (davidakenny.net/papers/twmr/example.txt) 14 Variables in the Example • Treatment – 1 = treated (intensive case management) – 0 = treatment as usual • Housing Contacts: total number of contacts per during the 9 months after the intervention began • Stable Housing – days per month with adequate housing (0 to 30) – Averaged over 7 months from month 10 to month 16, after the intervention began 15 Morse et al. Example Step 1: X Y c = 6.558, p = .009 Step 2: X M a = 5.502, p = .013 Step 3: M (and X) Y b = 0.466, p < .001 Step 4: X (and M) Y c′ = 3.992, p = .090 16 When Can We Conclude Complete Mediation? When c′ is not statistically different from zero, but c is? NOT REALLY Ideally when a and b are substantial and c′ is small, not just non-significant. 17 Standardized vs. Unstandardized For most everything in this presentation, the variables can be standardized making the coefficients Betas. The key thing is to be consistent: If M is standardized in Step 2, it is also standardized in Step 3. 18 Presenting Mediation 19 The Modern Approach 20 Dissatisfaction with the Steps Approach Low power in the test of Step 1 No single measure and test of mediation Work by MacKinnon, Hayes, Preacher and others has led to the “Modern Approach.” 21 Decomposition of Effects Total Effect = Direct Effect + Indirect Effect c = c′ + ab (This equality exactly holds for multiple regression, but not necessarily for other estimation methods.) The Indirect Effect or ab provides one number that summarizes the amount of mediation. Example: 6.558 = (5.502)(0.466) And 100(2.56/6.56) = 39% of the total effect is explained (ab/c or equivalently 1 - c′/c). 22 Two Sides of the Same Coin Note that ab = c - c′ That is, the indirect effect exactly equals the amount of the reduction in the total effect (c) after the mediator is introduced. Some papers (e.g., Baron & Kenny) emphasize one side and others emphasize the other. Current work in mediation is now focused on the “ab” side. 23 Estimating the Total Effect (c) The total effect or c can be inferred from direct and indirect effect as c′ + ab. We need not perform the Step 1 regression to estimate c. This can be useful in situations when c does not exactly equally c′ + ab. 24 Inconsistent Mediation • ab and c′ have a different sign • X as a “suppressor” variable • Example: Stress and Mood with Coping as a Mediator • Consequences – The Total Effect or c may not be significant – Percent mediated greater than 100% • Do we have mediation? – Yes. There is an indirect effect (ab > 0). – No. There is no effect that is “mediated.” 25 What to Present Decomposition of Effects Total Effect Indirect Effect(s) Direct Effects Less of an emphasis on complete versus partial mediation. 26 Test of the Indirect Effect 27 Work on Determining How to Test ab = 0 • The key piece of information in the modern approach is the indirect effect. • Many researchers developed approaches to this problem. 28 Strategies to Test ab = 0 • Test a and b separately • Sobel test • Bootstrapping • Monte Carlo Method 29 Test a and b Separately • Easy to do • Works fairly well • Does not provide a method for a confidence interval for ab. • Seem too much like the oldfashioned Four-Step Approach. 30 Sobel Test of Mediation Compute the square root of a2sb2 + b2sa2 which is denoted as sab Note that sa and sb are the standard errors of a and b, respectively; ta = a/sa and tb = b/sb. Divide ab by sab and treat that value as a Z. So if ab/sab greater than 1.96 in absolute value, reject the null hypothesis that the indirect effect is zero. 31 Example a = 5.502 and b = 0.466 sa = 2.182 and sb = 0.100 ab = 2.56; sab = 1.157 Sobel test Z is 2.218, p = .027 We conclude that the indirect effect is statistically different from zero. Website: http://www.people.ku.edu/~preacher/sobel/sobel.htm 32 Large values of “ab” are more variable than small values (i.e., 0). The distribution of ab is highly skewed which lowers the power of the test. 33 Bootstrapping • “Nonparametric” way of computing a sampling distribution. • Re-sampling (with replacement) • Many trials (computationally intensive) • Current thinking is NOT to correct for bias (i.e., the mean of the bootstrap estimate differs slightly from the estimate). • Compute a confidence interval which is asymmetric. • Slight changes because empirically derived. 34 Results of Bootstrapping 95% Percentile Confidence Interval: Lower Upper .5325 5.0341 Note that the CI is asymmetric for an estimate of 2.598. Also values differ to sampling error. (Done using the Hayes & Preacher macro from http://www.afhayes.com/spss-sas-and-mplus-macros-and-code.html.) 35 Monte Carlo Method • Save a, b, sa (the standard error of a) and sb from the mediation analysis. • For some methods you have the covariance of ab or sab to save. • Use them and the assumption of normality to generate estimates of a and b and so ab. • Use this distribution of ab’s to get a confidence interval or a p value. • Less computationally intensive than bootstrapping and many trials are feasible. 36 Results of the Monte Carlo Method 95% Monte Carlo Confidence Interval (20,000 trials): Lower Upper 0.551 5.158 Done using the Selig & Preacher web program at http://www.quantpsy.org/medmc/medmc.htm; if correlated use Tofighi’s Rmediation at http://www.amp.gatech.edu/RMediation 37 Power 38 Power of the Test of c If c′ is zero, then c equals ab. If both a and b have a moderate effect size, then c has a “smaller than small” effect size (assuming c′ is zero). Thus, it is very possible to have tests of significance for a and b be statistically significant, but c is not. Note that for N = 100, if a and b have moderate effect sizes (r = .3) and c′ = 0, the power of the test of c is only .14 whereas 39 the power for a is .87 and b is .83. Relative Power of c and ab Several authors, as early as Cox (1960), have noted that the indirect effect has much more power than the test of c even when the two equal the same value as when c′ is zero. Kenny and Judd (2014) show that sometimes the test of c need 75 times the number of cases to have the same power as the test of ab! 40 Relative Power of the Test of c and c′ when b = 0 When b = 0, c = c′ and in this case c might be statistically significant, yet c′ might not be significant even though. Why? There is multicollinearity between X and M due to path a. Note that as M becomes a more successful mediator and path a gets larger, multicollinearity becomes more of a issue in the testing of c′ (and b). 41 Power and the Test of b Let b = .3 (standardized). What happens to N need to have 80% power as path a gets larger: a N .1 86 .3 93 .5 112 (a is standardized; sample size needed for 80% power to reject the null hypothesis that b = 0) Conclusion: Power of the test of b declines as a increases. 42 Tool MedPowR R based program to help in power analyses. Can either give the power for a given value of N and a, b, and c’ or give the N needed to achieve a desired level of power. Program can be downloaded at http://davidakenny.net/progs/PowMed.R 43 44 Power calculations have begun... Effect Size Power N c .390 .986 100 a .300 .868 100 b .300 .890 100 c' .300 .890 100 ab .090 .773 100 Alpha for all power calculations set to .050. Power calculations complete. 45 Assumptions 46 Taking Assumptions Seriously • Older Mediation Analysis ignored the strong assumptions required for the analysis. • Current work, especially that within the Causal Inference tradition, focuses much more on them. • Researchers need to conduct “Sensitivity Analyses” or “What If?”analyses. 47 Assumptions: Multiple Regression • Linearity – Is a problem in the example; housing contacts has a quadratic effect on stable housing (p = .034). • Normal Distribution of Errors – Interval level of measurement of M and Y • Equal Error Variance • Independence – No clustering • X and M Do Not Interact to Cause Y 48 No XM Interaction: Linear Mediation • Called “Moderation” in Baron & Kenny • Add XM (and possibly other interaction terms, e.g., X2M) when explaining Y. • Many contemporary analysts now see XM interaction as part of mediation. • Not significant for the example (p = .476) 49 Causal Assumptions • Perfect Reliability – for M and X • No Omitted Variables – all common causes of M and Y, X and M, and X and Y measured and controlled • No Reverse Causal Effects – M and Y not cause X – Y may not cause M (Guaranteed if X is manipulated.) 50 Basic Mediational Causal Model Note that U1 and U2 are theoretical variables and not “errors” from a regression equation. U1 1 M a b U2 1 c' X Y 51 Unreliability • Usually safe to assume that X is perfectly reliable. • Measurement error in Y does not bias unstandardized regression coefficients. • Measurement error in M is problematic. 52 Unreliability in M Error 1 U1 M 1 1 M Latent b a U2 1 c' X Y 53 Effect of Unreliability in M • b is attenuated (closer to zero) • c′ is inflated (given consistent mediation) – more as a increases – more as b increases – Note that the bigger the indirect, the greater the bias in c′. 54 What to Do about Unreliability in M? • Improve the reliability • Adjust estimates using Structural Equation Modeling • Conduct Sensitivity Analyses assuming different values of reliability. 55 Omitted Variables U1 e 1 Omitted Variable M f a b U2 1 c' X Y 56 Other Terms for an Omitted Variable • Third variable. • Confounder – Term used in epidemiology – Becoming increasingly popular 57 What is the Effect of Omitted Variables? • Usually, but not always, the sign of ef is the same as b. – Inflating the estimate of b – Deflates the estimate of c′ (could produce inconsistent mediation). 58 Effect of Vitamin A Supplements in Northern Sumatra 59 "Standard" Analysis 60 Compliance should totally mediate this effect. Why is c' negative? Results with an Omitted Variable 61 Path c' fixed to zero. Omitted variables that causes M and Y. What to Do about Omitted Variables? • Do not omit them: Include them in the analysis as covariates. • If there is good reason to believe that c′ = 0, they can be allowed for. • Sometimes the omitted variable is “shared method effects.” If an issue, measure M and Y by different methods. • Conduct sensitivity analyses. 62 A Comforting Fact • Unreliability in M deflates ab and inflates c′. • An omitted variable usually deflates ab and inflates c′. • As Fritz, Kenny, & MacKinnon have shown these two biases can almost exactly offset each other. 63 Reverse Causation U1 1 M a b g U2 1 c' X Y 64 Effect of Reverse Causation • Typically, b and g have the same sign, which likely makes the value of b inflated and the value of c′ deflated. 65 What to Do about Reverse Causation? • Longitudinal designs • If c′ = 0, then the model can be estimated. • Instrumental variable method. 66 Timing of the Measurement of the Mediator • Mediator should be measured after X but before Y. • X might be measured at the same time as Y (e.g., number of treatment sessions), but it must be assumed that X has not changed since when it affected Y. 67 Controlling for Prior Values • Obtain baseline measures of M and Y. • Control for baseline M and Y in the analysis. 68 DataToText 69 DataToText Project Have the researcher tell DataToText what is the research question. DataToText performs the requisite analyses. DataToText gives the results from those analyses: computer output a written description 70 DataToText Macros • Macro developed to provide text, tables, and figures of a simple mediational analysis. – SPSS version: MedText http://davidakenny.net/dtt/mediate.htm – R version: MedTextR http://davidakenny.net/dtt/mediateR.htm 71 Advantages of DataToText • Does the analyses that should be done, but often are not, e.g., tests for outliers and nonlinearity. • MedTextR issues up to 20 different warnings. • Produces a 3 page text describing the results. • Surprisingly “intelligent” • Graphics 72 73 Links davidakenny.net/dtt/datatotext.htm davidakenny.net/dtt/mediate.htm davidakenny.net/dtt/mediateR.htm davidakenny.net/dtt/MedTextR.pdf 74 Topics Not Discussed Use of SEM simultaneous estimation latent variables (reflective and formative) instrumental variable estimation Causal Inference Approach Mediators or outcomes that are categorical or counted Clustering and multilevel mediation Mediated moderation and moderated mediation 75 Conclusion • Mediational Analyses Are Very Simple • Mediational Analyses Are Very Difficult –Difficulties are more in conceptualization and measurement than in the statistical analysis. davidakenny.net/cm/mediationN.ppt 76