Chapter 27 A Generalized Linear Model for Binomial Response Data

For all i=1,,n, yiBinomial(mi,πi), where mi is known number of trials for observation i, πi=exp(xiβ)1+exp(xiβ), and y1,,yn are independent. The Binomial log likelihood is (βy)=ni=1[yixiβmilog(1+exp(xiβ))]+constant We can compare the fit of a logitstic regression model known as saturated model. The MLE of πi under the logistic regression model is ˆπi=exp(xiˆβ)1+exp(xiˆβ), and the MLE of πi under saturated model is yi/mi. Then the likelihood ratio statistic for testing the logistic regression model as the reduced model VS. the saturated model as the full model is 2ni=1[yilog(yi/miˆπi)+(miyi)log(1yi/mi1ˆπi)] which is called the Deviance Statistics, the Residual Deviance or just the Deviance.

A Lack-of-fit Test: when n is large, and/or m1,mn, are each suitablely large, the Deviance Statistic is approximately χ2np if the logistic regression model is correct.

Deviance Residual: disign(yi/miˆπi)2[yilog(yimiˆπi)+(miyi)log(miyimimiˆπi)] The residual deviance statistic is the sum of the squared deviance residuals (ni=1d2i).

Pearson’s Chi-Square Statistic: Another lack of fit statistic that is approximately χ2np under the null is Pearson’s Chi-Square Statistic: X2=ni=1(yiˆE(yi)^Var(yi))2=ni=1(yimiˆπimiˆπi(1ˆπi))2. The term ri=yimiˆπimiˆπi(1ˆπi) is known as Pearson residual.

Residual Diagnostics: For large mi values, both di and ri should be approximately distributed as standard normal random variables if the logistic regression model is correct.

o=g1m(cbind(tumor, total-tumor)~dose, 
      family=binomial(link=logit), data=d)
summary(o)

Overdispersion: in the GLM framework, its often the case that Var(yi) is the function of E(yi). That is the case for logistic regression where Var(yi)=miπ(1πi)=miπi(miπi)2/mi=E(yi)[E(yi)]2/mi. If the variability of our response is greater than we should expect based on our estimates of the mean, we say that there is overdispersion.

Quasi-likelihood Inference: in the binomial case, we make all the same assumptions as before except that we assume Var(yi)=ϕmiπi(1πi) for some unknown dispersion parameter ϕ>1. The dispersion parameter can be estimated by ˆϕ=ni=1d2i/(np) or ˆϕ=ni=1r2i/(np).

  • The estimated variance of ˆβ is multiplied by ˆϕ.
  • For Wald type inferences, the standard normal null distribution is replaced by t with np degrees of freedom.
  • Any test statistic T that was assumed χ2q under H0 is replaced with T/(qˆϕ) and compared to an F distribution with q and np degrees of freedom.
oq=g1m(cbind(tumor, total-tumor)~dosef, 
       family=quasibinomial(link=logit),data=d)
summary(oq)