Chapter 22 Additional Topics Related to Likelihood
Akaike’s Information Criteria: AIC=−2ℓ(ˆθ)+2k.
Schwarz’s Bayesian Information Criterion: BIC=−2ℓ(ˆθ)+klnn
If REML likelihoods are used, compared models must have the same model for the response mean. Different models for the mean would yield different error contrasts and different datasets for computation of maximized REML likelihoods.
Asymptotic normality of ˆθ: for sufficiently large n, ˆθ⋅∼N(θ,I−1(θ)) where I(θ)=[−E{∂2ℓ(θ)∂θi∂θj}] is called Fisher Information matrix.
- This implies (ˆθ−θ)′[^Var(ˆθ)]−1(ˆθ−θ)⋅∼(ˆθ−θ)d→z′z∼χ2k.
Wald Tests: Suppose for large n that ˆθ⋅∼N(θ,^Var(ˆθ)), then a confidence interval for c′θ that has confidence level approximately equal to 1−α is c′ˆθ±z1−α/2√c′^Var(ˆθ)c. Likewise, if C is a q×k matrix of rank q, a test of H0:Cθ=d can be based on the test statistic (Cˆθ−d)′[C^Var(ˆθ)C′]−1(Cˆθ−d)∼χ2q
Multivarite Delta Method: Suppose a g is a function from Rk to Rm, i.e. g=[g1(θ),…,gm(θ)]′. Let D≡∂g∂θ be the derivative matrix. By Taylor’s theorem, g(ˆθ)≈g(θ)+D′(ˆθ−θ) which implies E(g(ˆθ))≈g(θ)+D′E(ˆθ−θ)=g(θ) and Var(g(ˆθ))≈Var[g(θ)+D′(ˆθ−θ)]=D′Var(ˆθ)D. Therefore, g(ˆθ)⋅∼N(g(θ),D′Var(ˆθ)D).
Likelihood Ratio: Define Λ as Reduced Model Maximial likelihoodFull Model Maximized Likelihood. −2ln(Λ) is known as the likelihood ratio test statisti and −2ln(Λ)∼χ2kf−kr under the null hypothesis.
Profile Likelihood Confidence: {θ1:ℓ(θ1,ˆθ2(θ1))≥ℓ(ˆθ)−12χ2k1,1−α}.