34.4 Asymptotic Properties of the IV Estimator
IV estimation provides consistent and asymptotically normal estimates of structural parameters under a specific set of assumptions. Understanding the asymptotic properties of the IV estimator requires clarity on the identification conditions and the large-sample behavior of the estimator.
Consider the linear structural model:
Y=Xβ+u
Where:
Y is the dependent variable (n×1)
X is a matrix of endogenous regressors (n×k)
u is the error term
β is the parameter vector of interest (k×1)
Suppose we have a matrix of instruments Z (n×m), with m≥k.
The IV estimator of β is:
ˆβIV=(Z′X)−1Z′Y
Alternatively, when using 2SLS, this is equivalent to:
ˆβ2SLS=(X′PZX)−1X′PZY
Where:
- PZ=Z(Z′Z)−1Z′ is the projection matrix onto the column space of Z.
34.4.1 Consistency
For ˆβIV to be consistent, the following conditions must hold as n→∞:
- Instrument Exogeneity
E[Z′u]=0
Instruments must be uncorrelated with the structural error term.
Guarantees instrument validity.
- Instrument Relevance
rank(E[Z′X])=k
Instruments must be correlated with the endogenous regressors.
Ensures identification of β.
If this fails, the model is underidentified, and ˆβIV does not converge to the true β.
- Random Sampling (IID Observations)
- {(Yi,Xi,Zi)}ni=1 are independent and identically distributed (i.i.d.).
- In more general settings, stationarity and mixing conditions can relax this.
- Finite Moments
- E[||Z||2]<∞ and E[||u||2]<∞
- Ensures Law of Large Numbers applies to sample moments.
If these conditions are satisfied: ˆβIVp→β This means the IV estimator is consistent.
34.4.2 Asymptotic Normality
In addition to consistency conditions, we require:
- Homoskedasticity (Optional but Simplifying)
E[uu′|Z]=σ2I
Simplifies variance estimation.
If violated, heteroskedasticity-robust variance estimators must be used.
- Central Limit Theorem Conditions
- Sample moments must satisfy a CLT: √n(1nn∑i=1Ziui)d→N(0,Ω) Where Ω=E[ZiZ′iu2i].
Under the above conditions: √n(ˆβIV−β)d→N(0,V)
Where the asymptotic variance-covariance matrix V is: V=(QZX)−1QZuu(Q′ZX)−1 With:
QZX=E[ZiX′i]
QZuu=E[ZiZ′iu2i]
34.4.3 Asymptotic Efficiency
- Optimal Instrument Choice
- Among all IV estimators, 2SLS is efficient when the instrument matrix Z contains all relevant information.
- Generalized Method of Moments (GMM) can deliver efficiency gains in the presence of heteroskedasticity, by optimally weighting the moment conditions.
GMM Estimator
ˆβGMM=argmin
Where W is an optimal weighting matrix, typically:
W = \Omega^{-1}
Result
- If Z is overidentified (m > k), GMM can be more efficient than 2SLS.
- When instruments are exactly identified (m = k), IV, 2SLS, and GMM coincide.
Summary Table of Conditions
Condition | Requirement | Purpose |
---|---|---|
Instrument Exogeneity | \mathbb{E}[Z'u] = 0 | Instrument validity |
Instrument Relevance | \mathrm{rank}(\mathbb{E}[Z'X]) = k | Model identification |
Random Sampling | IID (or stationary and mixing) | LLN and CLT applicability |
Finite Second Moments | \mathbb{E}[||Z||^2] < \infty, etc. | LLN and CLT applicability |
Homoskedasticity (optional) | \mathbb{E}[u u' | Z] = \sigma^2 I | Simplifies variance formulas |
Optimal Weighting | W = \Omega^{-1} in GMM | Asymptotic efficiency |