34.4 Asymptotic Properties of the IV Estimator

IV estimation provides consistent and asymptotically normal estimates of structural parameters under a specific set of assumptions. Understanding the asymptotic properties of the IV estimator requires clarity on the identification conditions and the large-sample behavior of the estimator.

Consider the linear structural model:

Y=Xβ+u

Where:

  • Y is the dependent variable (n×1)

  • X is a matrix of endogenous regressors (n×k)

  • u is the error term

  • β is the parameter vector of interest (k×1)

Suppose we have a matrix of instruments Z (n×m), with mk.

The IV estimator of β is:

ˆβIV=(ZX)1ZY

Alternatively, when using 2SLS, this is equivalent to:

ˆβ2SLS=(XPZX)1XPZY

Where:

  • PZ=Z(ZZ)1Z is the projection matrix onto the column space of Z.

34.4.1 Consistency

For ˆβIV to be consistent, the following conditions must hold as n:

  1. Instrument Exogeneity

E[Zu]=0

  • Instruments must be uncorrelated with the structural error term.

  • Guarantees instrument validity.

  1. Instrument Relevance

rank(E[ZX])=k

  • Instruments must be correlated with the endogenous regressors.

  • Ensures identification of β.

  • If this fails, the model is underidentified, and ˆβIV does not converge to the true β.

  1. Random Sampling (IID Observations)
  • {(Yi,Xi,Zi)}ni=1 are independent and identically distributed (i.i.d.).
  • In more general settings, stationarity and mixing conditions can relax this.
  1. Finite Moments
  • E[||Z||2]< and E[||u||2]<
  • Ensures Law of Large Numbers applies to sample moments.

If these conditions are satisfied: ˆβIVpβ This means the IV estimator is consistent.

34.4.2 Asymptotic Normality

In addition to consistency conditions, we require:

  1. Homoskedasticity (Optional but Simplifying)

E[uu|Z]=σ2I

  • Simplifies variance estimation.

  • If violated, heteroskedasticity-robust variance estimators must be used.

  1. Central Limit Theorem Conditions
  • Sample moments must satisfy a CLT: n(1nni=1Ziui)dN(0,Ω) Where Ω=E[ZiZiu2i].

Under the above conditions: n(ˆβIVβ)dN(0,V)

Where the asymptotic variance-covariance matrix V is: V=(QZX)1QZuu(QZX)1 With:

  • QZX=E[ZiXi]

  • QZuu=E[ZiZiu2i]

34.4.3 Asymptotic Efficiency

  1. Optimal Instrument Choice
  • Among all IV estimators, 2SLS is efficient when the instrument matrix Z contains all relevant information.
  • Generalized Method of Moments (GMM) can deliver efficiency gains in the presence of heteroskedasticity, by optimally weighting the moment conditions.

GMM Estimator

ˆβGMM=argmin

Where W is an optimal weighting matrix, typically:

W = \Omega^{-1}

Result

  • If Z is overidentified (m > k), GMM can be more efficient than 2SLS.
  • When instruments are exactly identified (m = k), IV, 2SLS, and GMM coincide.

Summary Table of Conditions

Condition Requirement Purpose
Instrument Exogeneity \mathbb{E}[Z'u] = 0 Instrument validity
Instrument Relevance \mathrm{rank}(\mathbb{E}[Z'X]) = k Model identification
Random Sampling IID (or stationary and mixing) LLN and CLT applicability
Finite Second Moments \mathbb{E}[||Z||^2] < \infty, etc. LLN and CLT applicability
Homoskedasticity (optional) \mathbb{E}[u u' | Z] = \sigma^2 I Simplifies variance formulas
Optimal Weighting W = \Omega^{-1} in GMM Asymptotic efficiency