27.8 Regression Kink Design
The Regression Kink Design (RKD) extends the logic of RD by exploiting changes in the slope of the treatment intensity function at a known threshold rather than a discontinuous jump in treatment assignment.
Instead of an RD jump in treatment probability at X=c, the treatment function b(X) exhibits a kink at the cutoff:
- In Sharp RKD, the kink is deterministic, meaning the treatment function b(X) changes its slope exactly at X=c.
- In Fuzzy RKD, treatment assignment remains probabilistic, requiring an instrumental variable approach similar to Fuzzy RD.
Example: Unemployment Benefits
Consider an unemployment insurance program where benefits increase at a diminishing rate as prior earnings increase. The function governing benefits, b(X), exhibits a kink at a threshold X=c. The RKD framework allows us to estimate the marginal causal effect of additional benefits on employment duration.
27.8.1 Identification in Sharp Regression Kink Design
In a Sharp RKD, the treatment intensity function b(X) exhibits a known change in slope at X=c, formally:
Di=b(Xi),where b(X) has a kink at X=c.
The key identification assumption is that the potential outcome function E[Y(d)|X] is smooth in X. Thus, any observed change in the slope of E[Y|X] at X=c can be attributed to the change in b(X).
The causal effect of interest is:
αKRD=lim
where:
b(X) is a known function determining treatment intensity.
The numerator captures the discontinuous change in the slope of the expected outcome at X = c.
The denominator captures the change in the slope of the treatment function at X = c.
If b(X) is known and deterministic, the denominator is non-random, allowing for precise estimation of \alpha_{KRD}.
Assumptions for Identification
- Continuity of Potential Outcomes
- The expected potential outcomes E[Y(d)|X] are smooth in X (no jumps).
- No Manipulation of the Running Variable
- The density of X is continuous at X = c, implying that agents cannot sort themselves based on the kink.
- First-Stage Validity
- The slope of b(X) must change at X = c (i.e., the kink must exist).
If these assumptions hold, \alpha_{KRD} represents the marginal causal effect of treatment intensity on the outcome.
27.8.2 Identification in Fuzzy Regression Kink Design
In Fuzzy RKD, the treatment function D_i does not directly follow a deterministic function b(X) but instead exhibits a kink in its probability distribution:
E[D | X] \text{ has a kink at } X = c.
The treatment intensity function is unknown, requiring an instrumental variable (IV) strategy, analogous to Fuzzy RD.
The causal effect is given by:
\alpha_{KRD} = \frac{\lim\limits_{x \downarrow c} \frac{d}{dx}E[Y |X = x]- \lim\limits_{x \uparrow c} \frac{d}{dx}E[Y |X = x]}{\lim\limits_{x \downarrow c} \frac{d}{dx}E[D |X = x]- \lim\limits_{x \uparrow c} \frac{d}{dx}E[D |X = x]}.
where:
The numerator measures the kink in the expected outcome.
The denominator measures the kink in the probability of treatment.
The ratio provides a local instrumental variable estimate of the causal effect for compliers.
Identification Assumptions
In addition to the Sharp RKD assumptions, Fuzzy RKD requires:
- Monotonicity
- No individuals decrease their treatment intensity while others increase at the kink (analogous to Fuzzy RD monotonicity).
- Relevance of the Kink
- There must be a statistically significant slope change in E[D | X] at X = c.
If these assumptions hold, the Fuzzy RKD estimator identifies a local treatment effect.
27.8.3 Estimation of RKD Effects
RKD estimation involves three main steps:
Step 1: Estimating the Kink in the Outcome Function
Estimate the left- and right-hand derivatives of E[Y | X]:
\frac{d}{dx}E[Y | X] = \lim_{h \to 0} \frac{E[Y | X = c + h] - E[Y | X = c - h]}{h}.
This can be done using:
Local linear regression on either side of the kink.
Higher-order polynomial regression for improved flexibility.
Step 2: Estimating the Kink in the Treatment Function
For Sharp RKD, the kink in b(X) is known.
For Fuzzy RKD, estimate the kink in E[D | X]:
\frac{d}{dx}E[D | X] = \lim_{h \to 0} \frac{E[D | X = c + h] - E[D | X = c - h]}{h}.
Use local regression or piecewise polynomials to estimate this slope.
Step 3: Compute RKD Estimator
For Sharp RKD:
\hat{\alpha}_{KRD} = \frac{\hat{\tau}_Y}{\tau_b},
where:
\hat{\tau}_Y is the estimated kink in E[Y | X].
\tau_b is the known slope change in b(X).
For Fuzzy RKD:
\hat{\alpha}_{KRD} = \frac{\hat{\tau}_Y}{\hat{\tau}_D}.
where:
- \hat{\tau}_D is the estimated kink in E[D | X].
27.8.4 Robustness Checks
- Assess Covariate Smoothness
- Verify that pre-determined covariates (e.g., age, education) do not exhibit kinks at X = c.
- Check for Manipulation of the Running Variable
- Perform a McCrary test to ensure the density of X is continuous at X = c.
- Placebo Kinks
- Test for spurious kinks at other arbitrary values of X.
- Bandwidth Sensitivity
- Estimate RKD effects with varying bandwidths to check for consistency.