27.9 Multi-Cutoff Regression Discontinuity Design
The Multi-Cutoff Regression Discontinuity Design extends the standard RD framework by allowing for multiple cutoff points across different groups or geographic regions. Instead of a single threshold c, different subgroups are assigned different cutoffs Ci. This framework allows for a heterogeneous treatment effect function:
τ(x,c)=E[Y1i−Y0i|Xi=x,Ci=c].
Why Use Multi-Cutoff RD?
- Policy Variation: Policies often implement different cutoffs across regions or institutions (e.g., different states setting different minimum test scores for scholarship eligibility).
- Generalizability: Allows estimation of treatment effects across multiple populations instead of relying on a single threshold.
- Improved Precision: Leveraging multiple thresholds can enhance statistical power compared to a single-cutoff RD.
The multi-cutoff RD framework provides several advantages:
- Estimation of Local Heterogeneous Effects
- Unlike standard RD, which estimates a single treatment effect, multi-cutoff RD allows heterogeneity in effects across groups.
- Improved Precision
- More observations across different thresholds can increase statistical power.
- Policy Implications
- Useful in settings where policy thresholds vary (e.g., different states setting different income eligibility limits for welfare programs).
27.9.1 Identification
Under the potential outcomes framework, each unit i has:
A running variable Xi.
A cutoff specific to their group Ci.
A binary treatment indicator:
Di=I(Xi≥Ci).
The observed outcome is:
Yi=DiY1i+(1−Di)Y0i.
The treatment effect is the expected difference in potential outcomes:
τ(x,c)=E[Y1i−Y0i|Xi=x,Ci=c].
27.9.2 Key Assumptions
To ensure causal identification, we extend the standard RD assumptions:
- Continuity of Potential Outcomes
- The expected potential outcomes E[Y(0)|X] and E[Y(1)|X] are smooth functions of X at each cutoff Ci.
- Formally: lim
- No Manipulation of the Running Variable
- The density of X_i must be continuous at each C_i, ensuring that individuals cannot selectively sort above or below their assigned cutoff.
- Local Randomization
- Near each cutoff, units are as-good-as-randomly assigned to treatment or control.
- Independence Across Cutoffs
- The cutoff assignment rule should be exogenous and not correlated with unobserved determinants of Y.
If these assumptions hold, each cutoff provides a valid local treatment effect estimate.
27.9.3 Estimation Approaches
27.9.3.1 Pooling Cutoffs with Fixed Effects
A straightforward way to estimate multi-cutoff RD is to include cutoff fixed effects:
Y_i = \alpha + \beta (X_i - C_i) + \tau D_i + \gamma C_i + \epsilon_i.
where:
\tau captures the average treatment effect across all cutoffs.
C_i is included as a fixed effect to account for different intercepts across groups.
27.9.3.2 Separate RD Estimation for Each Cutoff
Instead of pooling, we can estimate separate RD effects for each C_i:
\tau_c = \lim_{x \downarrow C_i}E[Y|X = x, C_i = c] - \lim_{x \uparrow C_i} E[Y|X = x, C_i = c].
This approach allows for heterogeneous treatment effects.
27.9.3.3 Interaction Model for Heterogeneous Effects
To estimate how treatment effects vary with C_i, we interact D_i with C_i:
Y_i = \alpha + \beta (X_i - C_i) + \tau D_i + \lambda D_i C_i + \epsilon_i.
where:
\lambda captures how the treatment effect varies with the cutoff.
A significant \lambda implies that \tau(x, c) is not constant across cutoffs.
27.9.3.4 Nonparametric Local Estimation
A fully flexible approach estimates \tau(x, c) separately at each cutoff using kernel-based methods:
\hat{\tau}(c) = \frac{\sum_{i=1}^{n} K_h (X_i - C_i) D_i Y_i}{\sum_{i=1}^{n} K_h (X_i - C_i) D_i} - \frac{\sum_{i=1}^{n} K_h (X_i - C_i) (1 - D_i) Y_i}{\sum_{i=1}^{n} K_h (X_i - C_i) (1 - D_i)}.
where:
K_h(\cdot) is a kernel function (e.g., Epanechnikov).
h is the bandwidth, chosen via cross-validation.
27.9.4 Robustness Checks
- Covariate Balance at Each Cutoff
Test whether pre-treatment covariates show jumps at each C_i.
Run placebo RD regressions on covariates:
W_i = \alpha + \beta (X_i - C_i) + \gamma D_i + \epsilon_i.
A significant \gamma suggests that RD assumptions are violated.
- McCrary Density Test
Perform a McCrary test separately at each cutoff to check for manipulation:
f(X) \text{ should be continuous at } X = C_i.
If discontinuities exist, individuals may be sorting around cutoffs.
- Placebo Cutoffs
- Implement fake cutoffs and re-estimate \tau(x, c).
- If significant effects appear, the RD estimates may be biased.
- Varying Bandwidths
- Re-estimate treatment effects using different bandwidths.
- If \hat{\tau}(x,c) changes drastically, it suggests sensitivity to bandwidth choice.