22.2 Semi-random Experiment
Chicago Open Enrollment Program (Cullen, Jacob, and Levitt 2005)
Students can apply to “choice” schools
Many schools are oversubscribed (Demand > Supply)
Resolve scarcity via random lotteries
Non-random enrollment, we only have random lottery which mean the above
Let
δj=E(Yi|Enrollij=1;Applyij=1)−E(Yi|Enrollij=0;Applyij=1)
and
θj=E(Yi|Winij=1;Applyij=1)−E(Yi|Winij=0;Applyij=1)
Hence, we can clearly see that δj≠θj because you can only enroll, but you cannot ensure that you will win. Thus, intention to treat is different from treatment effect.
Non-random enrollment, we only have random lottery which means we can only estimate θj
To recover the true treatment effect, we can use
δj=E(Yi|Wij=1;Aij=1)−E(Yi|Wij=0;Aij=1)P(Enrollij=1|Wij=1;Aij=1)−P(Enrollij=1|Wij=0;Aij=1)
where
δj = treatment effect
W = Whether students win the lottery
A = Whether student apply for the lottery
i = application
j = school
Say that we have
10 win
Number students | Type | Selection effect | Treatment effect | Total effect |
---|---|---|---|---|
1 | Always Takers | +0.2 | +1 | +1.2 |
2 | Compliers | 0 | +1 | +1 |
7 | Never Takers | -0.1 | 0 | -0.1 |
10 lose
Number students | Type | Selection effect | Treatment effect | Total effect |
---|---|---|---|---|
1 | Always Takers | +0.2 | +1 | +1.2 |
2 | Compliers | 0 | 0 | 0 |
7 | Never Takers | -0.1 | 0 | -0.1 |
Intent to treatment = Average effect of who you give option to choose
E(Yi|Wij=1;Aij=1)=1∗(1.2)+2∗(1)+7∗(−0.1)10=0.25
E(Yi|Wij=0;Aij=1)=1∗(1.2)+2∗(0)+7∗(−0.1)10=0.05
Hence,
Intent to treatment=0.25−0.05=0.2Treatment effect=1
P(Enrollij=1|Wij=1;Aij=1)=1+210=0.3P(Enrollij=1|Wij=0;Aij=1)=110=0.1
Treatment effect=0.20.3−0.1=1
After knowing how to recover the treatment effect, we turn our attention to the main model
Yia=δWia+λLia+eia
where
W = whether a student wins a lottery
L = whether student enrolls in the lottery
δ = intent to treat
Hence,
Conditional on lottery, the δ is valid
But without lottery, your δ is not random
Winning and losing are only identified within lottery
Each lottery has multiple entries. Thus, we can have within estimator
We can also include other control variables (Xiθ)
Yia=δ1Wia+λ1Lia+Xiθ+uia
E(δ)=E(δ1)E(λ)≠E(λ1)because choosing a lottery is not random
Including (Xiθ) just shifts around control variables (i.e., reweighting of lottery), which would not affect your treatment effect E(δ)