34.2 Framework for Instrumental Variables

We consider a binary treatment framework where:

  • DiBernoulli(p) is a dummy treatment variable.

  • (Y0i,Y1i) are the potential outcomes under control and treatment.

  • The observed outcome is: Yi=Y0i+(Y1iY0i)Di.

  • We introduce an instrumental variable Zi satisfying: Zi(Y0i,Y1i,D0i,D1i).

    • This means Zi is independent of potential outcomes and potential treatment status.
    • Zi must also be correlated with Di to satisfy the relevance condition.

34.2.1 Constant-Treatment-Effect Model

Under the constant treatment effect assumption (i.e., the treatment effect is the same for all individuals),

Y0i=α+ηi,Y1iY0i=ρ,Yi=Y0i+Di(Y1iY0i)=α+ηi+Diρ=α+ρDi+ηi.

where:

  • ηi captures individual-level heterogeneity.
  • ρ is the constant treatment effect.

The problem with OLS estimation is that Di may be correlated with ηi, leading to endogeneity bias.

34.2.2 Instrumental Variable Solution

A valid instrument Zi allows us to estimate the causal effect ρ via:

ρ=Cov(Yi,Zi)Cov(Di,Zi)=Cov(Yi,Zi)/V(Zi)Cov(Di,Zi)/V(Zi)=Reduced form estimateFirst-stage estimate=E[Yi|Zi=1]E[Yi|Zi=0]E[Di|Zi=1]E[Di|Zi=0].

This ratio measures the treatment effect only if Zi is a valid instrument.

34.2.3 Heterogeneous Treatment Effects and the LATE Framework

In a more general framework where treatment effects vary across individuals,

  • Define potential outcomes as: Yi(d,z)=outcome for unit i given Di=d,Zi=z.

  • Define treatment status based on Zi: Di=D0i+Zi(D1iD0i).

    where:

    • D1i is the treatment status when Zi=1.
    • D0i is the treatment status when Zi=0.
    • D1iD0i is the causal effect of Zi on Di.

34.2.4 Assumptions for LATE Identification

34.2.4.1 Independence (Instrument Randomization)

The instrument must be as good as randomly assigned:

[{Yi(d,z);d,z},D1i,D0i]Zi.

This ensures that Zi is uncorrelated with potential outcomes and potential treatment status.

This assumption let the first-stage equation be the average causal effect of Zi on Di

E[Di|Zi=1]E[Di|Zi=0]=E[D1i|Zi=1]E[D0i|Zi=0]=E[D1iD0i]

This assumption also is sufficient for a causal interpretation of the reduced form, where we see the effect of the instrument Zi on the outcome Yi:

E[Yi|Zi=1]E[Yi|Zi=0]=E[Yi(D1i,Zi=1)Yi(D0i,Zi=0)]

34.2.4.2 Exclusion Restriction

This is also known as the existence of the instrument assumption (G. W. Imbens and Angrist 1994). The instrument should only affect Yi through Di (i.e., the treatment Di fully mediates the effect of Zi on Yi):

Y1i=Yi(1,1)=Yi(1,0)Y0i=Yi(0,1)=Yi(0,0)

Under this assumption (and assume Y1i,Y0i already satisfy the independence assumption), the observed outcome Yi can be rewritten as:

Yi=Yi(0,Zi)+[Yi(1,Zi)Yi(0,Zi)]Di=Y0i+(Y1iY0i)Di.

This assumption let us go from reduced-form causal effects to treatment effects (J. D. Angrist and Imbens 1995).

34.2.4.3 Monotonicity (No Defiers)

We assume that Zi affects Di in a monotonic way:

D1iD0i,i.

  • This assumption lets us assume that there is a first stage, in which we examine the proportion of the population that Di is driven by Zi. It implies that Zi only moves individuals toward treatment, but never away. This rules out “defiers” (i.e., individuals who would have taken the treatment when not assigned but refuse when assigned).
  • This assumption is used to solve to problem of the shifts between participation status back to non-participation status.
    • Alternatively, one can solve the same problem by assuming constant (homogeneous) treatment effect (G. W. Imbens and Angrist 1994), but this is rather restrictive.

    • A third solution is the assumption that there exists a value of the instrument, where the probability of participation conditional on that value is 0 J. Angrist and Imbens (1991).

Under monotonicity,

E[D1iD0i]=P[D1i>D0i].

34.2.5 Local Average Treatment Effect Theorem

Given Independence, Exclusion, and Monotonicity, we obtain the LATE result (J. D. Angrist and Pischke 2009, 4.4.1):

E[Yi|Zi=1]E[Yi|Zi=0]E[Di|Zi=1]E[Di|Zi=0]=E[Y1iY0i|D1i>D0i].

This states that the IV estimator recovers the causal effect only for compliers—units whose treatment status changes due to Zi.

IV only identifies treatment effects for switchers (compliers):

Switcher Type Compliance Type Definition
Switchers Compliers D1i>D0i (take treatment if Zi=1, not if Zi=0)
Non-switchers Always-Takers D1i=D0i=1 (always take treatment)
Non-switchers Never-Takers D1i=D0i=0 (never take treatment)
  • IV estimates nothing for always-takers and never-takers since their treatment status is unaffected by Zi (Similar to the fixed-effects models).

34.2.6 IV in Randomized Trials (Noncompliance)

  • In randomized trials, if compliance is imperfect (i.e., compliance is voluntary), where individuals in the treatment group will not always take the treatment (e.g., selection bias), intention-to-treat (ITT) estimates are valid but contaminated by noncompliance.
  • IV estimation using random assignment (Zi) as an instrument for actual treatment received (Di) recovers the LATE.

E[Yi|Zi=1]E[Yi|Zi=0]E[Di|Zi=1]=Intent-to-Treat EffectCompliance Rate=E[Y1iY0i|Di=1].

Under full compliance, LATE = Treatment Effect on the Treated (TOT).


References

Angrist, Joshua D, and Guido W Imbens. 1995. “Two-Stage Least Squares Estimation of Average Causal Effects in Models with Variable Treatment Intensity.” Journal of the American Statistical Association 90 (430): 431–42.
Angrist, Joshua D, and Jörn-Steffen Pischke. 2009. Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton university press.
Angrist, Joshua, and Guido Imbens. 1991. “Sources of Identifying Information in Evaluation Models.” National Bureau of Economic Research; National Bureau of Economic Research Cambridge, Mass., USA.
Imbens, Guido W, and Joshua D Angrist. 1994. “Identification and Estimation of Local Average Treatment Effects.” Econometrica 62 (2): 467–75.