36.1 Traditional Approach

Baron and Kenny (1986) is outdated because of step 1, but we could still see the original idea.

3 regressions

  • Step 1: \(X \to Y\)

  • Step 2: \(X \to M\)

  • Step 3: \(X + M \to Y\)

where

  • \(X\) = independent (causal) variable

  • \(Y\) = dependent (outcome) variable

  • \(M\) = mediating variable

Note: Originally, the first path from \(X \to Y\) suggested by (Baron and Kenny 1986) needs to be significant. But there are cases in which you could have indirect of \(X\) on \(Y\) without significant direct effect of \(X\) on \(Y\) (e.g., when the effect is absorbed into M, or there are two counteracting effects \(M_1, M_2\) that cancel out each other effect).

Unmediated model
Unmediated model

where \(c\) is the total effect

where

  • \(c'\) = direct effect (effect of \(X\) on \(Y\) after accounting for the indirect path)

  • \(ab\) = indirect effect

Hence,

\[ \begin{aligned} \text{total effect} &= \text{direct effect} + \text{indirect effect} \\ c &= c' + ab \end{aligned} \]

However, this simple equation does not only hold in cases of

  1. Models with latent variables
  2. Logistic models (only approximately). Hence, you can only calculate \(c\) as the total effect of \(c' + ab\)
  3. Multi-level models (Bauer, Preacher, and Gil 2006)

To measure mediation (i.e., indirect effect),

  1. \(1 - \frac{c'}{c}\) highly unstable (D. P. MacKinnon, Warsi, and Dwyer 1995), especially in cases that \(c\) is small (not re* recommended)
  2. Product method: \(a\times b\)
  3. Difference method: \(c- c'\)

For linear models, we have the following assumptions:

  1. No unmeasured confound between \(X-Y\), \(X-M\) and \(M-Y\) relationships.

  2. \(X \not\rightarrow C\) where \(C\) is a confounder between \(M-Y\) relationship

  3. Reliability: No errors in measurement of \(M\) (also known as reliability assumption) (can consider errors-in-variables models)

Mathematically,

\[ Y = b_0 + b_1 X + \epsilon \]

\(b_1\) does not need to be significant.

  1. We examine the effect of \(X\) on \(M\). This step requires that there is a significant effect of \(X\) on \(M\) to continue with the analysis

Mathematically,

\[ M = b_0 + b_2 X + \epsilon \]

where \(b_2\) needs to be significant.

  1. In this step, we want to the effect of \(M\) on \(Y\) “absorbs” most of the direct effect of \(X\) on \(Y\) (or at least makes the effect smaller).

Mathematically,

\[ Y = b_0 + b_4 X + b_3 M + \epsilon \]

\(b_4\) needs to be either smaller or insignificant.

The effect of \(X\) on \(Y\) then, \(M\) … mediates between \(X\) and \(Y\)
completely disappear (\(b_4\) insignificant) Fully (i.e., full mediation)
partially disappear (\(b_4 < b_1\) in step 1) Partially (i.e., partial mediation)
  1. Examine the mediation effect (i.e., whether it is significant)

Notes:

  • Proximal mediation (\(a > b\)) can lead to multicollinearity and reduce statistical power, whereas distal mediation (\(b > a\)) is preferred for maximizing test power.

  • The ideal balance for maximizing power in mediation analysis involves slightly distal mediators (i.e., path \(b\) is somewhat larger than path \(a\)) (Hoyle 1999).

  • Tests for direct effects (c and c’) have lower power compared to the indirect effect (ab), making it possible for ab to be significant while c is not, even in cases where there seems to be complete mediation but no statistical evidence of a direct cause-effect relationship between X and Y without considering M (Kenny and Judd 2014).

  • The testing of \(ab\) offers a power advantage over \(c’\) because it effectively combines two tests. However, claims of complete mediation based solely on the non-significance of \(c’\) should be approached with caution, emphasizing the need for sufficient sample size and power, especially in assessing partial mediation. Or one should never make complete mediation claim (Hayes and Scharkow 2013)

36.1.1 Assumptions

36.1.1.1 Direction

  • Quick fix but not convincing: Measure \(X\) before \(M\) and \(Y\) to prevent \(M\) or \(Y\) causing \(X\); measure \(M\) before \(Y\) to avoid \(Y\) causing \(M\).

  • \(Y\) may cause \(M\) in a feedback model.

    • Assuming \(c' =0\) (full mediation) allows for estimating models with reciprocal causal effects between \(M\) and \(Y\) via IV estimation.

    • E. R. Smith (1982) proposes treating both \(M\) and \(Y\) as outcomes with potential to mediate each other, requiring distinct instrumental variables for each that do not affect the other.

36.1.1.2 Interaction

  • When M interact with X to affect Y, M is both a mediator and a mediator (Baron and Kenny 1986).

  • Interaction between \(XM\) should always be estimated.

  • For the interpretation of this interaction, see (T. VanderWeele 2015)

36.1.1.3 Reliability

  • When mediator contains measurement errors, \(b, c'\) are biased. Possible fix: mediator = latent variable (but loss of power) (Ledgerwood and Shrout 2011)

    • \(b\) is attenuated (closer to 0)

    • \(c'\) is

      • overestimated when \(ab >0\)

      • underestiamted when \(ab<0\)

  • When treatment contains measurement errors, \(a,b\) are biased

    • \(a\) is attenuated

    • \(b\) is

      • overestimated when \(ac'>0\)

      • underestimated when \(ac' <0\)

  • When outcome contains measurement errors,

    • If unstandardized, no bias

    • If standardized, attenuation bias

36.1.1.4 Confounding

36.1.1.4.1 Design Strategies
  • Randomization of treatment variable. If possible, also mediator

  • Control for the confounder (but still only for measureable observables)

36.1.1.4.2 Statistical Strategies
  • Instrumental variable on treatment

    • Specifically for confounder affecting the \(M-Y\) pair, front-door adjustment is possible when there is a variable that completely mediates the effect of the mediator on the outcome and is unaffected by the confounder.
  • Weighting methods (e.g., inverse propensity) See Heiss for R code

    • Need strong ignorability assumption (i.e.., all confounders are included and measured without error (Westfall and Yarkoni 2016)). Not fixable, but can be examined with robustness checks.

36.1.2 Indirect Effect Tests

36.1.2.1 Sobel Test

Standard Error

\[ \sqrt{\hat{b}^2 s_{\hat{a}} + \hat{a}^2 s_{b}^2} \]

The test of the indirect effect is

\[ z = \frac{\hat{ab}}{\sqrt{\hat{b}^2 s_{\hat{a}} + \hat{a}^2 s_{b}^2}} \]

Disadvantages

  • Assume \(a\) and \(b\) are independent.

  • Assume \(ab\) is normally distributed.

  • Does not work well for small sample sizes.

  • Power of the test is low and the test is conservative as compared to Bootstrapping.

36.1.2.2 Joint Significance Test

  • Effective for determining if the indirect effect is nonzero (by testing whether \(a\) and \(b\) are both statistically significant), assumes \(a \perp b\).

  • It’s recommended to use it with other tests and has similar performance to a Bootstrapping test (Hayes and Scharkow 2013).

  • The test’s accuracy can be affected by heteroscedasticity (Fossum and Montoya 2023) but not by non-normality.

  • Although helpful in computing power for the test of the indirect effect, it doesn’t provide a confidence interval for the effect.

36.1.2.3 Bootstrapping

  • First used by Bollen and Stine (1990)

  • It allows for the calculation of confidence intervals, p-values, etc.

  • It does not require \(a \perp b\) and corrects for bias in the bootstrapped distribution.

  • It can handle non-normality (in the sampling distribution of the indirect effect), complex models, and small samples.

  • Concerns exist about the bias-corrected bootstrapping being too liberal (Fritz, Taylor, and MacKinnon 2012). Hence, current recommendations favor percentile bootstrap without bias correction for better Type I error rates (Tibbe and Montoya 2022).

  • A special case of bootstrapping is a proposed by where you don’t need access to raw data to generate resampling, you only need \(a, b, var(a), var(b), cov(a,b)\) (which can be taken from lots of primary studies)

result <-
    causalverse::med_ind(
        a = 0.5,
        b = 0.7,
        var_a = 0.04,
        var_b = 0.05,
        cov_ab = 0.01
    )
result$plot

36.1.2.3.1 With Instrument
library(DiagrammeR)
grViz("
digraph {
  graph []
  node [shape = plaintext]
    X [label = 'Treatment']
    Y [label = 'Outcome']
  edge [minlen = 2]
    X->Y
  { rank = same; X; Y }
}
")

grViz("
digraph {
  graph []
  node [shape = plaintext]
    X [label ='Treatment', shape = box]
    Y [label ='Outcome', shape = box]
    M [label ='Mediator', shape = box]
    IV [label ='Instrument', shape = box]
  edge [minlen = 2]
    IV->X
    X->M  
    M->Y 
    X->Y 
  { rank = same; X; Y; M }
}
")
library(mediation)
data("boundsdata")
library(fixest)

# Total Effect
out1 <- feols(out ~ ttt, data = boundsdata)

# Indirect Effect
out2 <- feols(med ~ ttt, data = boundsdata)

# Direct and Indirect Effect
out3 <- feols(out ~ med + ttt, data = boundsdata)

# Proportion Test
# To what extent is the effect of the treatment mediated by the mediator?
coef(out2)['ttt'] * coef(out3)['med'] / coef(out1)['ttt'] * 100
#>      ttt 
#> 68.63609


# Sobel Test
bda::mediation.test(boundsdata$med, boundsdata$ttt, boundsdata$out) |> 
    tibble::rownames_to_column() |> 
    causalverse::nice_tab(2)
#>   rowname Sobel Aroian Goodman
#> 1 z.value  4.05   4.03    4.07
#> 2 p.value  0.00   0.00    0.00
# Mediation Analysis using boot
library(boot)
set.seed(1)
mediation_fn <- function(data, i){
    # sample the dataset
    df <- data[i,]
    
    
    a_path <- feols(med ~ ttt, data = df)
    a <- coef(a_path)['ttt']
    
    b_path <-  feols(out ~ med + ttt, data = df)
    b <- coef(b_path)['med']
    
    cp <- coef(b_path)['ttt']
    
    # indirect effect
    ind_ef <- a*b
    total_ef <- a*b + cp
    return(c(ind_ef, total_ef))
    
}

boot_med <- boot(boundsdata, mediation_fn, R = 100, parallel = "multicore", ncpus = 2)
boot_med 
#> 
#> ORDINARY NONPARAMETRIC BOOTSTRAP
#> 
#> 
#> Call:
#> boot(data = boundsdata, statistic = mediation_fn, R = 100, parallel = "multicore", 
#>     ncpus = 2)
#> 
#> 
#> Bootstrap Statistics :
#>       original        bias    std. error
#> t1* 0.04112035  0.0006346725 0.009539903
#> t2* 0.05991068 -0.0004462572 0.029556611

summary(boot_med) |> 
    causalverse::nice_tab()
#>     R original bootBias bootSE bootMed
#> 1 100     0.04        0   0.01    0.04
#> 2 100     0.06        0   0.03    0.06

# confidence intervals (percentile is always recommended)
boot.ci(boot_med, type = c("norm", "perc"))
#> BOOTSTRAP CONFIDENCE INTERVAL CALCULATIONS
#> Based on 100 bootstrap replicates
#> 
#> CALL : 
#> boot.ci(boot.out = boot_med, type = c("norm", "perc"))
#> 
#> Intervals : 
#> Level      Normal             Percentile     
#> 95%   ( 0.0218,  0.0592 )   ( 0.0249,  0.0623 )  
#> Calculations and Intervals on Original Scale
#> Some percentile intervals may be unstable

# point estimates (Indirect, and Total Effects)
colMeans(boot_med$t)
#> [1] 0.04175502 0.05946442

Alternatively, one can use the robmed package

library(robmed)

Power test or use app

library(pwr2ppl)

# indirect path ab power
medjs(
    # X on M (path a)
    rx1m1 = .3,
    # correlation between X and Y (path c')
    rx1y  = .1,
    # correlation between M and Y (path b)
    rym1  = .3,
    # sample size
    n     = 100,
    alpha = 0.05,
    # number of mediators
    mvars = 1,
    # should use 10000
    rep   = 1000
)

36.1.3 Multiple Mediation

The most general package to handle multiple cases is manymome

See vignette for an example

library(manymome)

36.1.3.1 Multiple Mediators

library(mma)
36.1.3.1.0.1 Lavaan
# Load required packages
library(MASS)  # for mvrnorm
library(lavaan)

# Function to generate synthetic data with correctly correlated errors for mediators
generate_data <-
  function(n = 10000,
           a1 = 0.5,
           a2 = -0.35,
           b1 = 0.7,
           b2 = 0.48,
           corr = TRUE,
           correlation_value = 0.7) {
    set.seed(12345)
    X <- rnorm(n)
    
    # Generate correlated errors using a multivariate normal distribution
    if (corr) {
        Sigma <- matrix(c(1, correlation_value, correlation_value, 1), nrow = 2)  # Higher covariance matrix for errors
        errors <- mvrnorm(n, mu = c(0, 0), Sigma = Sigma)  # Generate correlated errors
    } else {
        errors <- mvrnorm(n, mu = c(0, 0), Sigma = diag(2))  # Independent errors
    }
    
    M1 <- a1 * X + errors[, 1]
    M2 <- a2 * X + errors[, 2]
    Y <- b1 * M1 + b2 * M2 + rnorm(n)  # Y depends on M1 and M2
    
    data.frame(X = X, M1 = M1, M2 = M2, Y = Y)
}

# Ground truth for comparison
ground_truth <- data.frame(Parameter = c("b1", "b2"), GroundTruth = c(0.7, 0.48))

# Function to extract relevant estimates, standard errors, and model fit
extract_estimates_b1_b2 <- function(fit) {
    estimates <- parameterEstimates(fit)
    estimates <- estimates[estimates$lhs == "Y" & estimates$rhs %in% c("M1", "M2"), c("rhs", "est", "se")]
    estimates$Parameter <- ifelse(estimates$rhs == "M1", "b1", "b2")
    estimates <- estimates[, c("Parameter", "est", "se")]
    fit_stats <- fitMeasures(fit, c("aic", "bic", "rmsea", "chisq"))
    return(list(estimates = estimates, fit_stats = fit_stats))
}

# Case 1: Correlated errors for mediators (modeled correctly)
Data_corr <- generate_data(n = 10000, corr = TRUE, correlation_value = 0.7)
model_corr <- '
  Y ~ b1 * M1 + b2 * M2 + c * X
  M1 ~ a1 * X
  M2 ~ a2 * X
  M1 ~~ M2  # Correlated mediators (errors)
'
fit_corr <- sem(model = model_corr, data = Data_corr)
results_corr <- extract_estimates_b1_b2(fit_corr)

# Case 2: Uncorrelated errors for mediators (modeled correctly)
Data_uncorr <- generate_data(n = 10000, corr = FALSE)
model_uncorr <- '
  Y ~ b1 * M1 + b2 * M2 + c * X
  M1 ~ a1 * X
  M2 ~ a2 * X
'
fit_uncorr <- sem(model = model_uncorr, data = Data_uncorr)
results_uncorr <- extract_estimates_b1_b2(fit_uncorr)

# Case 3: Correlated errors, but not modeled as correlated
fit_corr_incorrect <- sem(model = model_uncorr, data = Data_corr)
results_corr_incorrect <- extract_estimates_b1_b2(fit_corr_incorrect)

# Case 4: Uncorrelated errors, but modeled as correlated
fit_uncorr_incorrect <- sem(model = model_corr, data = Data_uncorr)
results_uncorr_incorrect <- extract_estimates_b1_b2(fit_uncorr_incorrect)

# Combine all estimates for comparison
estimates_combined <- list(
    "Correlated (Correct)" = results_corr$estimates,
    "Uncorrelated (Correct)" = results_uncorr$estimates,
    "Correlated (Incorrect)" = results_corr_incorrect$estimates,
    "Uncorrelated (Incorrect)" = results_uncorr_incorrect$estimates
)

# Combine all into a single table
comparison_table <- do.call(rbind, lapply(names(estimates_combined), function(case) {
    df <- estimates_combined[[case]]
    df$Case <- case
    df
}))

# Merge with ground truth for final comparison
comparison_table <- merge(comparison_table, ground_truth, by = "Parameter")

# Display the comparison table
comparison_table
#>   Parameter       est          se                     Case GroundTruth
#> 1        b1 0.7002984 0.013870433     Correlated (Correct)        0.70
#> 2        b1 0.6973612 0.009859426   Uncorrelated (Correct)        0.70
#> 3        b1 0.7002984 0.010010367   Correlated (Incorrect)        0.70
#> 4        b1 0.6973612 0.009859634 Uncorrelated (Incorrect)        0.70
#> 5        b2 0.4871118 0.013805615     Correlated (Correct)        0.48
#> 6        b2 0.4868318 0.010009908   Uncorrelated (Correct)        0.48
#> 7        b2 0.4871118 0.009963588   Correlated (Incorrect)        0.48
#> 8        b2 0.4868318 0.010010119 Uncorrelated (Incorrect)        0.48

# Display model fit statistics for each case
fit_stats_combined <- list(
    "Correlated (Correct)" = results_corr$fit_stats,
    "Uncorrelated (Correct)" = results_uncorr$fit_stats,
    "Correlated (Incorrect)" = results_corr_incorrect$fit_stats,
    "Uncorrelated (Incorrect)" = results_uncorr_incorrect$fit_stats
)

fit_stats_combined
#> $`Correlated (Correct)`
#>      aic      bic    rmsea    chisq 
#> 77932.45 77997.34     0.00     0.00 
#> 
#> $`Uncorrelated (Correct)`
#>       aic       bic     rmsea     chisq 
#> 84664.312 84721.995     0.000     0.421 
#> 
#> $`Correlated (Incorrect)`
#>       aic       bic     rmsea     chisq 
#> 84453.208 84510.891     0.808  6522.762 
#> 
#> $`Uncorrelated (Incorrect)`
#>      aic      bic    rmsea    chisq 
#> 84665.89 84730.78     0.00     0.00

36.1.3.2 Multiple Treatments

(Hayes and Preacher 2014)

Code in Process

References

Baron, Reuben M, and David A Kenny. 1986. “The Moderator–Mediator Variable Distinction in Social Psychological Research: Conceptual, Strategic, and Statistical Considerations.” Journal of Personality and Social Psychology 51 (6): 1173.
Bauer, Daniel J, Kristopher J Preacher, and Karen M Gil. 2006. “Conceptualizing and Testing Random Indirect Effects and Moderated Mediation in Multilevel Models: New Procedures and Recommendations.” Psychological Methods 11 (2): 142.
Bollen, Kenneth A, and Robert Stine. 1990. “Direct and Indirect Effects: Classical and Bootstrap Estimates of Variability.” Sociological Methodology, 115–40.
Fossum, Jessica L, and Amanda K Montoya. 2023. “When to Use Different Inferential Methods for Power Analysis and Data Analysis for Between-Subjects Mediation.” Advances in Methods and Practices in Psychological Science 6 (2): 25152459231156606.
Fritz, Matthew S, Amanda B Taylor, and David P MacKinnon. 2012. “Explanation of Two Anomalous Results in Statistical Mediation Analysis.” Multivariate Behavioral Research 47 (1): 61–87. https://doi.org/10.1080/00273171.2012.640596.
Hayes, Andrew F, and Kristopher J Preacher. 2014. “Statistical Mediation Analysis with a Multicategorical Independent Variable.” British Journal of Mathematical and Statistical Psychology 67 (3): 451–70.
Hayes, Andrew F, and Michael Scharkow. 2013. “The Relative Trustworthiness of Inferential Tests of the Indirect Effect in Statistical Mediation Analysis: Does Method Really Matter?” Psychological Science 24 (10): 1918–27.
Hoyle, Rick H. 1999. Statistical Strategies for Small Sample Research. sage.
Kenny, David A, and Charles M Judd. 2014. “Power Anomalies in Testing Mediation.” Psychological Science 25 (2): 334–39.
Ledgerwood, Alison, and Patrick E Shrout. 2011. “The Trade-Off Between Accuracy and Precision in Latent Variable Models of Mediation Processes.” Journal of Personality and Social Psychology 101 (6): 1174.
MacKinnon, David P, Amanda J Fairchild, and Matthew S Fritz. 2007. “Mediation Analysis.” Annu. Rev. Psychol. 58: 593–614.
MacKinnon, David P, Ghulam Warsi, and James H Dwyer. 1995. “A Simulation Study of Mediated Effect Measures.” Multivariate Behavioral Research 30 (1): 41–62.
Shrout, Patrick E, and Niall Bolger. 2002. “Mediation in Experimental and Nonexperimental Studies: New Procedures and Recommendations.” Psychological Methods 7 (4): 422.
Smith, Eliot R. 1982. “Beliefs, Attributions, and Evaluations: Nonhierarchical Models of Mediation in Social Cognition.” Journal of Personality and Social Psychology 43 (2): 248.
Sobel, Michael E. 1982. “Asymptotic Confidence Intervals for Indirect Effects in Structural Equation Models.” Sociological Methodology 13: 290–312.
Tibbe, Tristan D, and Amanda K Montoya. 2022. “Correcting the Bias Correction for the Bootstrap Confidence Interval in Mediation Analysis.” Frontiers in Psychology 13: 810258.
VanderWeele, Tyler. 2015. Explanation in Causal Inference: Methods for Mediation and Interaction. Oxford University Press.
Westfall, Jacob, and Tal Yarkoni. 2016. “Statistically Controlling for Confounding Constructs Is Harder Than You Think.” PloS One 11 (3): e0152719.