33.1 Framework
Di∼Bern Dummy Treatment
Y0i,Y1i potential outcomes
Yi=Y0i+(Y1i−Y0i)Di observed outcome
Zi⊥Y0i,Y1i Instrumental variables (and also correlate with Di)
Under constant-effects and linear (Y1i−Y0i are the same for everyone)
Y0i=α+ηiY1i−Y0i=ρYi=Y0i+Di(Y1i−Y0i)=α+ηi+Diρ=α+ρDi+ηi
where
ηi is individual differences
ρ is the difference between treated outcome and untreated outcome. Here we assume they are constant for everyone
However, we have a problem with OLS because Di is correlated with ηi for each unit
But Zi can come to the rescue, the causal estimate can be written as
ρ=Cov(Yi,Zi)Cov(Di,Zi)=Cov(Yi,Zi)/V(Zi)Cov(Di,Zi)/V(Zi)=ReducedformFirst−stage=E[Yi|Zi=1]−E[Yi|Zi=0]E[Di|Zi=1]−E[Di|Zi=0]
Under heterogeneous treatment effect (Y1i−Y0i are different for everyone) with LATE framework
Yi(d,z) denotes the potential outcome for unit i with treatment Di=d and instrument Zi=z
Observed treatment status
Di=D0i+Zi(D1i−D0i)
where
D1i is treatment status of unit i when zi=1
D0i is treatment status of unit i when zi=0
D1i−D0i is the causal effect of Zi on Di
Assumptions
Independence: The instrument is randomly assigned (i.e., independent of potential outcomes and potential treatments)
[{Yi(d,z);∀d,z},D1i,D0i]ΠZi
This assumption let the first-stage equation be the average causal effect of Zi on Di
E[Di|Zi=1]−E[Di|Zi=0]=E[D1i|Zi=1]−E[D0i|Zi=0]=E[D1i−D0i]
- This assumption also is sufficient for a causal interpretation of the reduced form, where we see the effect of the instrument on the outcome.
E[Yi|Zi=1]−E[Yi|Zi=0]=E[Yi(D1i,Zi=1)−Yi(D0i,Zi=0)]
Exclusion (i.e., existence of instruments (G. W. Imbens and Angrist 1994)
- The treatment Di fully mediates the effect of Zi on Yi
Y1i=Yi(1,1)=Yi(1,0)Y0i=Yi(0,1)=Yi(0,0)
- With this assumption, the observed outcome Yi can be thought of as (assume Y1i,Y0i already satisfy the independence assumption)
Yi=Yi(0,Zi)+[Yi(1,Zi)−Yi(0,Zi)]Di=Y0i+(Y1i−Y0i)Di
- This assumption let us go from reduced-form causal effects to treatment effects (J. D. Angrist and Imbens 1995)
Monotonicity: D1i>D0i∀i
With this assumption, we have E[D1i−D0i]=P[D1i>D0i]
This assumption lets us assume that there is a first stage, in which we examine the proportion of the population that Di is driven by Zi
This assumption is used to solve to problem of the shifts between participation status back to non-participation status.
Alternatively, one can solve the same problem by assuming constant (homogeneous) treatment effect (G. W. Imbens and Angrist 1994), but this is rather restrictive.
A third solution is the assumption that there exists a value of the instrument, where the probability of participation conditional on that value is 0 J. Angrist and Imbens (1991).
With these three assumptions, we have the LATE theorem (J. D. Angrist and Pischke 2009, 4.4.1)
E[Yi|Zi=1]−E[Yi|Zi=0]E[Di|Zi=1]−E[Di|Zi=0]=E[Y1i−Y0i|D1i>D0i]
LATE assumptions allow us to go back to the types of subjects we have in Causal Inference
Switchers:
- Compliers: D1i>D0i
Non-switchers:
Always-takers: D1i=D0i=1
Never-takers: D1i=D0i=0
Instrumental Variables can’t say anything about non-switchers because treatment status Di has no effects on them (similar to fixed effects models).
When all groups are the same, we come back to the constant-effects world.
Treatment effects on the treated is a weighted average of always-takers and compliers.
In the special case of IV in randomized trials, we have a compliance problem (when compliance is voluntary), where those in the treated will not always take the treatment (i.e., might be selection bias).
Intention-to-treat analysis is valid, but contaminated by non-compliance
IV in this case (Zi = random assignment to the treatment; Di = whether the unit actually received/took the treatment) can solve this problem.
Under certain assumptions (i.e., SUTVA, random assignment, exclusion restriction, no defiers, and monotinicity), this analysis can give causal interpreation of LATE because it’s the average causal effect for the compliers only.
- Without these assumptions, it’s a ratio of intention-to-treat.
Without always-takers in this case, LATE = Treatment effects on the treated
See proof Bloom (1984) and examples Bloom et al. (1997) and Sherman and Berk (1984)
E[Yi|Zi=1]−E[Yi|Zi=0]E[Di|Zi=1]=Intention-to-treat effectCompliance rate=E[Y1i−Y0i|Di=1]