33.1 Framework

  • DiBern Dummy Treatment

  • Y0i,Y1i potential outcomes

  • Yi=Y0i+(Y1iY0i)Di observed outcome

  • ZiY0i,Y1i Instrumental variables (and also correlate with Di)

Under constant-effects and linear (Y1iY0i are the same for everyone)

Y0i=α+ηiY1iY0i=ρYi=Y0i+Di(Y1iY0i)=α+ηi+Diρ=α+ρDi+ηi

where

  • ηi is individual differences

  • ρ is the difference between treated outcome and untreated outcome. Here we assume they are constant for everyone

However, we have a problem with OLS because Di is correlated with ηi for each unit

But Zi can come to the rescue, the causal estimate can be written as

ρ=Cov(Yi,Zi)Cov(Di,Zi)=Cov(Yi,Zi)/V(Zi)Cov(Di,Zi)/V(Zi)=ReducedformFirststage=E[Yi|Zi=1]E[Yi|Zi=0]E[Di|Zi=1]E[Di|Zi=0]

Under heterogeneous treatment effect (Y1iY0i are different for everyone) with LATE framework

Yi(d,z) denotes the potential outcome for unit i with treatment Di=d and instrument Zi=z

Observed treatment status

Di=D0i+Zi(D1iD0i)

where

  • D1i is treatment status of unit i when zi=1

  • D0i is treatment status of unit i when zi=0

  • D1iD0i is the causal effect of Zi on Di

Assumptions

  • Independence: The instrument is randomly assigned (i.e., independent of potential outcomes and potential treatments)

    • [{Yi(d,z);d,z},D1i,D0i]ΠZi

    • This assumption let the first-stage equation be the average causal effect of Zi on Di

    E[Di|Zi=1]E[Di|Zi=0]=E[D1i|Zi=1]E[D0i|Zi=0]=E[D1iD0i]

    • This assumption also is sufficient for a causal interpretation of the reduced form, where we see the effect of the instrument on the outcome.

E[Yi|Zi=1]E[Yi|Zi=0]=E[Yi(D1i,Zi=1)Yi(D0i,Zi=0)]

  • Exclusion (i.e., existence of instruments (G. W. Imbens and Angrist 1994)

    • The treatment Di fully mediates the effect of Zi on Yi

    Y1i=Yi(1,1)=Yi(1,0)Y0i=Yi(0,1)=Yi(0,0)

    • With this assumption, the observed outcome Yi can be thought of as (assume Y1i,Y0i already satisfy the independence assumption)

    Yi=Yi(0,Zi)+[Yi(1,Zi)Yi(0,Zi)]Di=Y0i+(Y1iY0i)Di

  • Monotonicity: D1i>D0ii

    • With this assumption, we have E[D1iD0i]=P[D1i>D0i]

    • This assumption lets us assume that there is a first stage, in which we examine the proportion of the population that Di is driven by Zi

    • This assumption is used to solve to problem of the shifts between participation status back to non-participation status.

      • Alternatively, one can solve the same problem by assuming constant (homogeneous) treatment effect (G. W. Imbens and Angrist 1994), but this is rather restrictive.

      • A third solution is the assumption that there exists a value of the instrument, where the probability of participation conditional on that value is 0 J. Angrist and Imbens (1991).

With these three assumptions, we have the LATE theorem (J. D. Angrist and Pischke 2009, 4.4.1)

E[Yi|Zi=1]E[Yi|Zi=0]E[Di|Zi=1]E[Di|Zi=0]=E[Y1iY0i|D1i>D0i]

LATE assumptions allow us to go back to the types of subjects we have in Causal Inference

  • Switchers:

    • Compliers: D1i>D0i
  • Non-switchers:

    • Always-takers: D1i=D0i=1

    • Never-takers: D1i=D0i=0

Instrumental Variables can’t say anything about non-switchers because treatment status Di has no effects on them (similar to fixed effects models).

When all groups are the same, we come back to the constant-effects world.

Treatment effects on the treated is a weighted average of always-takers and compliers.

In the special case of IV in randomized trials, we have a compliance problem (when compliance is voluntary), where those in the treated will not always take the treatment (i.e., might be selection bias).

  • Intention-to-treat analysis is valid, but contaminated by non-compliance

  • IV in this case (Zi = random assignment to the treatment; Di = whether the unit actually received/took the treatment) can solve this problem.

  • Under certain assumptions (i.e., SUTVA, random assignment, exclusion restriction, no defiers, and monotinicity), this analysis can give causal interpreation of LATE because it’s the average causal effect for the compliers only.

    • Without these assumptions, it’s a ratio of intention-to-treat.
  • Without always-takers in this case, LATE = Treatment effects on the treated

See proof Bloom (1984) and examples Bloom et al. (1997) and Sherman and Berk (1984)

E[Yi|Zi=1]E[Yi|Zi=0]E[Di|Zi=1]=Intention-to-treat effectCompliance rate=E[Y1iY0i|Di=1]

References

Angrist, Joshua D, and Guido W Imbens. 1995. “Two-Stage Least Squares Estimation of Average Causal Effects in Models with Variable Treatment Intensity.” Journal of the American Statistical Association 90 (430): 431–42.
Angrist, Joshua D, and Jörn-Steffen Pischke. 2009. Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton university press.
Angrist, Joshua, and Guido Imbens. 1991. “Sources of Identifying Information in Evaluation Models.” National Bureau of Economic Research Cambridge, Mass., USA.
Bloom, Howard S. 1984. “Accounting for No-Shows in Experimental Evaluation Designs.” Evaluation Review 8 (2): 225–46.
Bloom, Howard S, Larry L Orr, Stephen H Bell, George Cave, Fred Doolittle, Winston Lin, and Johannes M Bos. 1997. “The Benefits and Costs of JTPA Title II-a Programs: Key Findings from the National Job Training Partnership Act Study.” Journal of Human Resources, 549–76.
Imbens, Guido W, and Joshua D Angrist. 1994. “Identification and Estimation of Local Average Treatment Effects.” Econometrica 62 (2): 467–75.
Sherman, Lawrence W, and Richard Alan Berk. 1984. The Minneapolis Domestic Violence Experiment. Vol. 1. Police Foundation Washington, DC.