Chapter 19 Maximum Likelihood Estimation for the General Linear Model

  • Likelihood function: \(L(\theta\mid y) = f(y\mid \theta)\) is a real-valued function of \(\theta\) for a given value of \(y\).

  • Maximum likelihood estimatior: \(\hat\theta = \arg\max_{\theta} L(\theta\mid y)\)

  • Score function: \(\frac{\partial \ell(\theta\mid y)}{\partial \theta}\)

  • Score equation: \(\frac{\partial \ell(\theta\mid y)}{\partial \theta} = 0\)

    • Example: \[ \begin{bmatrix} \hat\beta \\ n^{-1}(y-X\hat\beta)'(y-X\hat\beta)\end{bmatrix} \text{ is an MLE of }\theta = \begin{bmatrix} \beta \\ \theta \end{bmatrix} \] However, the MLE of \(\sigma^2\) is not an unbiased estimator, \(E(SSE/n) = \frac{n-r}{n}\sigma^2\)
  • Profiled Log Likelihood: example \(\ell^*(\gamma\mid y) = \ell(\begin{bmatrix} \hat\beta_{\Sigma} \\\ \gamma \end{bmatrix} \mid y)\) where we suppose \(y = X\beta + \epsilon\), \(\epsilon \sim N(0, \Sigma)\), \(\Sigma = \sigma^2\begin{bmatrix} 1 & \rho & \rho^2 \\ \rho & 1 & \rho \\\ \rho^2 & rho & 1 \end{bmatrix}\), \(\gamma = \begin{bmatrix} \sigma^2 \\ \rho \end{bmatrix}\), \(\sigma^2 > 0\), and \(\rho \in (-1, 1)\).

    We can use numerical methods to find \(\hat\gamma\) , a maximizer of \(\ell^*(\gamma\mid y)\).

Example

Researchers were interested in comparing the dry weight of maize seedlings from two different genotypes. For each genotype, nine seeds were planted in each of four trays. The eight trays in total were randomly positioned in a growth chamber. Three weeks after the emergence of the first seedling, emerged seedlings were harvested from each tray and individually weighed after drying to obtain one dry weight for each seedling. Although nine seeds were planted in each tray, fewer than nine seedlings emerged in many of the trays.

Let \(y_{ijk}\) be the dry weight of the \(k\)th seedling in the \(j\)th tray for genotype \(i\). Suppose \[ y_{ijk} = \mu_i + t_{ij} + e_{ijk} \] where \(\mu_1\) and \(\mu_2\) are unknown constants, \(t_{ij} \sim N(0, \sigma_t^2)\), \(e_{ijk} \sim N(0, \sigma_e^2)\), and all random terms are independent.

library(nlme)
lme(SeedlingWeight ̃Genotype,random= ̃1|Tray,method="ML", data=d)

Linear mixed-effects model fit by maximum likelihood
  Data: d
  Log-likelihood: -126.3709   # l(thetahat|y)
  Fixed: SeedlingWeight  ̃ Genotype 
(Intercept)   Genotype2
  15.301832   -3.567017
Random effects:
 Formula:  ̃1 | Tray
        (Intercept) Residual
StdDev:    2.932294  1.88247     # sigma_that, sigma_ehat
Number of Observations: 56
Number of Groups: 8


library(lme4)
lmer(SeedlingWeight ̃Genotype+(1|Tray),REML=F,data=d)

Linear mixed model fit by maximum likelihood [’lmerMod’]
Formula: SeedlingWeight  ̃ Genotype + (1 | Tray)
   Data: d
      AIC       BIC    logLik  deviance 
 260.7418  268.8432 -126.3709  252.7418
Random effects:
 Groups   Name        Std.Dev.
 Tray   (Intercept)   2.932
 Residual             1.882
Number of obs: 56, groups: Tray, 8
Fixed Effects:
(Intercept)    Genotype2
     15.302       -3.567