27.9 Multi-Cutoff Regression Discontinuity Design

The Multi-Cutoff Regression Discontinuity Design extends the standard RD framework by allowing for multiple cutoff points across different groups or geographic regions. Instead of a single threshold c, different subgroups are assigned different cutoffs Ci. This framework allows for a heterogeneous treatment effect function:

τ(x,c)=E[Y1iY0i|Xi=x,Ci=c].

Why Use Multi-Cutoff RD?

  • Policy Variation: Policies often implement different cutoffs across regions or institutions (e.g., different states setting different minimum test scores for scholarship eligibility).
  • Generalizability: Allows estimation of treatment effects across multiple populations instead of relying on a single threshold.
  • Improved Precision: Leveraging multiple thresholds can enhance statistical power compared to a single-cutoff RD.

The multi-cutoff RD framework provides several advantages:

  1. Estimation of Local Heterogeneous Effects
    • Unlike standard RD, which estimates a single treatment effect, multi-cutoff RD allows heterogeneity in effects across groups.
  2. Improved Precision
    • More observations across different thresholds can increase statistical power.
  3. Policy Implications
    • Useful in settings where policy thresholds vary (e.g., different states setting different income eligibility limits for welfare programs).

27.9.1 Identification

Under the potential outcomes framework, each unit i has:

  • A running variable Xi.

  • A cutoff specific to their group Ci.

  • A binary treatment indicator:

Di=I(XiCi).

The observed outcome is:

Yi=DiY1i+(1Di)Y0i.

The treatment effect is the expected difference in potential outcomes:

τ(x,c)=E[Y1iY0i|Xi=x,Ci=c].

27.9.2 Key Assumptions

To ensure causal identification, we extend the standard RD assumptions:

  1. Continuity of Potential Outcomes
    • The expected potential outcomes E[Y(0)|X] and E[Y(1)|X] are smooth functions of X at each cutoff Ci.
    • Formally: lim
  2. No Manipulation of the Running Variable
    • The density of X_i must be continuous at each C_i, ensuring that individuals cannot selectively sort above or below their assigned cutoff.
  3. Local Randomization
    • Near each cutoff, units are as-good-as-randomly assigned to treatment or control.
  4. Independence Across Cutoffs
    • The cutoff assignment rule should be exogenous and not correlated with unobserved determinants of Y.

If these assumptions hold, each cutoff provides a valid local treatment effect estimate.

27.9.3 Estimation Approaches

27.9.3.1 Pooling Cutoffs with Fixed Effects

A straightforward way to estimate multi-cutoff RD is to include cutoff fixed effects:

Y_i = \alpha + \beta (X_i - C_i) + \tau D_i + \gamma C_i + \epsilon_i.

where:

  • \tau captures the average treatment effect across all cutoffs.

  • C_i is included as a fixed effect to account for different intercepts across groups.

27.9.3.2 Separate RD Estimation for Each Cutoff

Instead of pooling, we can estimate separate RD effects for each C_i:

\tau_c = \lim_{x \downarrow C_i}E[Y|X = x, C_i = c] - \lim_{x \uparrow C_i} E[Y|X = x, C_i = c].

This approach allows for heterogeneous treatment effects.

27.9.3.3 Interaction Model for Heterogeneous Effects

To estimate how treatment effects vary with C_i, we interact D_i with C_i:

Y_i = \alpha + \beta (X_i - C_i) + \tau D_i + \lambda D_i C_i + \epsilon_i.

where:

  • \lambda captures how the treatment effect varies with the cutoff.

  • A significant \lambda implies that \tau(x, c) is not constant across cutoffs.

27.9.3.4 Nonparametric Local Estimation

A fully flexible approach estimates \tau(x, c) separately at each cutoff using kernel-based methods:

\hat{\tau}(c) = \frac{\sum_{i=1}^{n} K_h (X_i - C_i) D_i Y_i}{\sum_{i=1}^{n} K_h (X_i - C_i) D_i} - \frac{\sum_{i=1}^{n} K_h (X_i - C_i) (1 - D_i) Y_i}{\sum_{i=1}^{n} K_h (X_i - C_i) (1 - D_i)}.

where:

  • K_h(\cdot) is a kernel function (e.g., Epanechnikov).

  • h is the bandwidth, chosen via cross-validation.

27.9.4 Robustness Checks

  1. Covariate Balance at Each Cutoff
  • Test whether pre-treatment covariates show jumps at each C_i.

  • Run placebo RD regressions on covariates:

    W_i = \alpha + \beta (X_i - C_i) + \gamma D_i + \epsilon_i.

  • A significant \gamma suggests that RD assumptions are violated.

  1. McCrary Density Test
  • Perform a McCrary test separately at each cutoff to check for manipulation:

    f(X) \text{ should be continuous at } X = C_i.

  • If discontinuities exist, individuals may be sorting around cutoffs.

  1. Placebo Cutoffs
  • Implement fake cutoffs and re-estimate \tau(x, c).
  • If significant effects appear, the RD estimates may be biased.
  1. Varying Bandwidths
  • Re-estimate treatment effects using different bandwidths.
  • If \hat{\tau}(x,c) changes drastically, it suggests sensitivity to bandwidth choice.