34.8 Treatment Intensity

Two-Stage Least Squares is a powerful method for estimating the average causal effect when treatment intensity varies across units. Rather than simple binary treatment (treated vs. untreated), many empirical applications involve treatments that can take on a range of values. TSLS can identify causal effects in these settings, capturing “a weighted average of per-unit treatment effects along the length of a causal response function” (J. D. Angrist and Imbens 1995, 431).

Common examples of treatment intensity include:

The average causal effect here refers to the conditional expectation of the difference in outcomes between the treated unit (at a given treatment intensity) and what would have happened in the counterfactual scenario (at a different treatment intensity). Importantly:

  • Linearity is not required in the relationships between the dependent variable, treatment intensities, and instruments. TSLS can accommodate nonlinear causal response functions, provided the assumptions of the method hold.

34.8.1 Example: Schooling and Earnings

In their seminal paper, J. D. Angrist and Imbens (1995) estimate the causal effect of years of schooling on earnings, using quarter of birth as an instrumental variable. The intuition is that individuals born in different quarters are subject to different compulsory schooling laws, which affect educational attainment but are plausibly unrelated to unobserved ability or motivation (the typical omitted variables in this context).

The structural outcome equation is:

Y=γ0+γ1X1+ρS+ε

where:

  • Y is the log of earnings (the dependent variable)
  • S is years of schooling (the endogenous regressor)
  • X1 is a vector (or matrix) of exogenous covariates (e.g., demographic characteristics)
  • ρ is the causal return to schooling we wish to estimate
  • ε is the error term, capturing unobserved factors

Because schooling S may be endogenous (e.g., correlated with ε), we model its first-stage relationship with the exogenous variables and instruments:

S=δ0+X1δ1+X2δ2+η

where:

  • X2 represents the instrumental variables (e.g., quarter of birth)
  • δ2 is the coefficient on the instrument
  • η is the first-stage error term

The Two-Stage Procedure

  1. First Stage Regression
    Regress S on X1 and X2 to obtain the predicted (fitted) values ˆS.

ˆS=^δ0+X1^δ1+X2^δ2

  1. Second Stage Regression
    Replace S with ˆS in the structural equation and estimate:

Y=γ0+γ1X1+ρˆS+ν

where ν is the new error term (different from ε because ˆS is constructed to be exogenous).

Under the standard IV assumptions, ρ is a consistent estimator of the causal effect of schooling on earnings.


34.8.2 Causal Interpretation of ρ

For ρ to have a valid causal interpretation, two key assumptions are essential:

  1. SUTVA (Stable Unit Treatment Value Assumption)
    • The potential outcomes of each individual are not affected by the treatment assignments of other units.
    • There are no hidden variations of the treatment; “one year of schooling” means the same treatment type across individuals.
    • While important, SUTVA is often assumed without extensive defense in empirical work, though violations (e.g., spillovers in education settings) should be acknowledged when plausible.
  2. [Local Average Treatment Effect] (LATE)
    • TSLS identifies a weighted average of marginal effects at the points where the instrument induces variation in treatment intensity.
    • Formally, ρ converges in probability to a weighted average of causal increments:

plim ˆρ=jwjE[YjYj1Compliers at level j]

where wj are weights determined by the distribution of the instrument and treatment intensity.

  • This LATE interpretation means that TSLS estimates apply to compliers whose treatment intensity changes in response to the instrument. If there is heterogeneity in treatment effects across units, the interpretation of ρ becomes instrument-dependent and may not generalize to the entire population.

References

Angrist, Joshua D, and Guido W Imbens. 1995. “Two-Stage Least Squares Estimation of Average Causal Effects in Models with Variable Treatment Intensity.” Journal of the American Statistical Association 90 (430): 431–42.
Angrist, Joshua D, and Victor Lavy. 1999. “Using Maimonides’ Rule to Estimate the Effect of Class Size on Scholastic Achievement.” The Quarterly Journal of Economics 114 (2): 533–75.
Lavy, Victor, Joshua D Angrist, and Analia Schlosser. 2006. “New Evidence on the Causal Link Between the Quantity and Quality of Children.”
Permutt, Thomas, and J Richard Hebel. 1989. “Simultaneous-Equation Estimation in a Clinical Trial of the Effect of Smoking on Birth Weight.” Biometrics, 619–22.
Powers, Donald E, and Spencer S Swinton. 1984. “Effects of Self-Study for Coachable Test Item Types.” Journal of Educational Psychology 76 (2): 266.