24.1 Completely Randomized Design

A Completely Randomized Design (CRD) is the simplest type of experimental design, where experimental units are randomly assigned to treatments.

Consider a treatment factor $A$ with $a \geq 2$ treatment levels. Each experimental unit is randomly assigned to one of these levels. The number of units in each group can be:

Balanced: All groups have equal sample sizes $n$ .
Unbalanced: Groups have different sample sizes $n_i$ (for $i = 1, ..., a$ ).

The total sample size is given by:

$N = \sum_{i=1}^{a} n_i$

The number of possible assignments of units to treatments is:

$k = \frac{N!}{n_1! n_2! \dots n_a!}$

Each assignment has an equal probability of being selected: $1/k$ . The response of each experimental unit is denoted as $Y_{ij}$ , where:

$i$ indexes the treatment group.
$j$ indexes the individual unit within treatment $i$ .

Treatment Response Table
Treatment	1	2	…	a
	$Y_{11}$	$Y_{21}$	…	$Y_{a1}$
	$Y_{12}$	…	…	…
	…	…	…	…
Sample Mean	$\bar{Y_{1.}}$	$\bar{Y_{2.}}$	…	$\bar{Y_{a.}}$
Sample SD	$s_1$	$s_2$	…	$s_a$

Where:

$\bar{Y_{i.}} = \frac{1}{n_i} \sum_{j=1}^{n_i} Y_{ij}$

$s_i^2 = \frac{1}{n_i - 1} \sum_{j=1}^{n_i} (Y_{ij} - \bar{Y_{i.}})^2$

The grand mean is:

$\bar{Y_{..}} = \frac{1}{N} \sum_{i} \sum_{j} Y_{ij}$

24.1.1 Single-Factor Fixed Effects ANOVA

Also known as One-Way ANOVA or ANOVA Type I Model.

The total variability in the response variable $Y_{ij}$ can be decomposed as follows:

$\begin{aligned} Y_{ij} - \bar{Y_{..}} &= Y_{ij} - \bar{Y}_{..} + \bar{Y}_{i.} - \bar{Y}_{i.} \\ & = (\bar{Y_{i.}} - \bar{Y_{..}}) + (Y_{ij} - \bar{Y_{i.}}) \end{aligned}$

where:

The first term represents between-treatment variability (deviation of treatment means from the grand mean).
The second term represents within-treatment variability (deviation of observations from their treatment mean).

Thus, we partition the total sum of squares (SSTO) as:

$\sum_{i} \sum_{j} (Y_{ij} - \bar{Y_{..}})^2 = \sum_{i} n_i (\bar{Y_{i.}} - \bar{Y_{..}})^2 + \sum_{i} \sum_{j} (Y_{ij} - \bar{Y_{i.}})^2$

Or equivalently:

$SSTO = SSTR + SSE$

Where:

SSTO (Total SS): Total variability in the data.
SSTR (Treatment SS): Variability due to differences between treatment means.
SSE (Error SS): Variability within treatments (unexplained variance).

Degrees of freedom (d.f.):

$(N-1) = (a-1) + (N-a)$

where we lose a degree of freedom for the total corrected SSTO because of the estimation of the mean ( $\sum_i \sum_j (Y_{ij} - \bar{Y}_{..} )= 0$ ) and for the SSTR ( $\sum_i n_i (\bar{Y}_{i.} - \bar{Y}_{..}) = 0$ )

Mean squares:

$MSTR = \frac{SSTR}{a-1}, \quad MSR = \frac{SSE}{N-a}$

ANOVA Table
Source of Variation	SS	df	MS
Between Treatments	$\sum_{i}n_i (\bar{Y_{i.}}-\bar{Y_{..}})^2$	$a-1$	$SSTR/(a-1)$
Error (within treatments)	$\sum_{i}\sum_{j}(Y_{ij}-\bar{Y_{i.}})^2$	$N-a$	$SSE/(N-a)$
Total (corrected)	$\sum_{i}n_i (\bar{Y_{i.}}-\bar{Y_{..}})^2$	$N-1$

For a linear model interpretation of ANOVA, we have either

Cell Means Model
Treatment Effect (Factor Effects Model)

24.1.1.1 Cell Means Model

The cell means model describes the response as:

$Y_{ij} = \mu_i + \epsilon_{ij}$

where:

$Y_{ij}$ : Response for unit $j$ in treatment $i$ .
$\mu_i$ : Fixed population mean for treatment $i$ .
$\epsilon_{ij} \sim N(0, \sigma^2)$ : Independent errors.
$E(Y_{ij}) = \mu_i$ , $\text{Var}(Y_{ij}) = \sigma^2$ .

All observations are assumed to have equal variance across treatments.

Example: ANOVA with $a = 3$ Treatments

Consider a case with three treatments ( $a = 3$ ), where each treatment has two replicates ( $n_1 = n_2 = n_3 = 2$ ). The response vector can be expressed in matrix form as:

$\begin{aligned} \left(\begin{array}{c} Y_{11}\\ Y_{12}\\ Y_{21}\\ Y_{22}\\ Y_{31}\\ Y_{32}\\ \end{array}\right) &= \left(\begin{array}{ccc} 1 & 0 & 0 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \\ 0 & 0 & 1 \\ \end{array}\right) \left(\begin{array}{c} \mu_1 \\ \mu_2 \\ \mu_3 \\ \end{array}\right) + \left(\begin{array}{c} \epsilon_{11} \\ \epsilon_{12} \\ \epsilon_{21} \\ \epsilon_{22} \\ \epsilon_{31} \\ \epsilon_{32} \\ \end{array}\right)\\ \mathbf{y} &= \mathbf{X\beta} +\mathbf{\epsilon} \end{aligned}$

where:

$X_{k,ij} = 1$ if the $k$ -th treatment is applied to unit $(i,j)$ .
$X_{k,ij} = 0$ otherwise.

Note: There is no intercept term in this model.

The least squares estimator for $\beta$ is given by:

$\begin{equation} \begin{aligned} \mathbf{b}= \left[\begin{array}{c} \mu_1 \\ \mu_2 \\ \mu_3 \\ \end{array}\right] &= (\mathbf{X}'\mathbf{X})^{-1}\mathbf{X}'\mathbf{y} \\ & = \left[\begin{array}{ccc} n_1 & 0 & 0\\ 0 & n_2 & 0\\ 0 & 0 & n_3 \\ \end{array}\right]^{-1} \left[\begin{array}{c} Y_1\\ Y_2\\ Y_3\\ \end{array}\right] \\ & = \left[\begin{array}{c} \bar{Y_1}\\ \bar{Y_2}\\ \bar{Y_3}\\ \end{array}\right] \end{aligned} \tag{24.1} \end{equation}$

Thus, the estimated treatment means are:

$\hat{\mu}_i = \bar{Y_i}, \quad i = 1,2,3$

This estimator $\mathbf{b} = [\bar{Y_1}, \bar{Y_2}, \bar{Y_3}]'$ is the best linear unbiased estimator (BLUE) for $\beta$ (i.e., $E(\mathbf{b}) = \beta$ )

Since $\mathbf{b} \sim N(\beta, \sigma^2 (\mathbf{X'X})^{-1})$ , the variance of the estimated treatment means is:

$var(\mathbf{b}) = \sigma^2(\mathbf{X'X})^{-1} = \sigma^2 \left[\begin{array}{ccc} 1/n_1 & 0 & 0\\ 0 & 1/n_2 & 0\\ 0 & 0 & 1/n_3\\ \end{array}\right]$

Thus, the variance of each estimated treatment mean is:

$var(b_i) = var(\hat{\mu}_i) = \frac{\sigma^2}{n_i}, \quad i = 1,2,3$

The mean squared error (MSE) is given by:

$\begin{aligned} MSE &= \frac{1}{N - a} \sum_{i=1}^a \sum_{j=1}^{n_i} \bigl(Y_{ij} - \overline{Y}_{i\cdot}\bigr)^2 \\[6pt] &= \frac{1}{N - a} \sum_{i=1}^a \Bigl[ (n_i - 1) \; \underbrace{ \frac{1}{n_i - 1} \sum_{j=1}^{n_i} \bigl(Y_{ij} - \overline{Y}_{i\cdot}\bigr)^2 }_{=\,s_i^2} \Bigr] \\[6pt] &= \frac{1}{N - a} \sum_{i=1}^a (n_i - 1)\, s_i^2. \end{aligned}$

where $s_i^2$ is the sample variance within the $i$ -th treatment group.

Since $E(s_i^2) = \sigma^2$ , we get:

$E(MSE) = \frac{1}{N-a} \sum_{i} (n_i-1) \sigma^2 = \sigma^2$

Thus, MSE is an unbiased estimator of $\sigma^2$ , regardless of whether the treatment means are equal.

The expected mean square for treatments (MSTR) is:

$E(MSTR) = \sigma^2 + \frac{\sum_{i} n_i (\mu_i - \mu_.)^2}{a-1}$

where:

$\mu_. = \frac{\sum_{i=1}^{a} n_i \mu_i}{\sum_{i=1}^{a} n_i}$

If all treatment means are equal ( $\mu_1 = \mu_2 = \dots = \mu_a = \mu_.$ ), then:

$E(MSTR) = \sigma^2$

$F$ -Test for Equality of Treatment Means

We test the null hypothesis:

$H_0: \mu_1 = \mu_2 = \dots = \mu_a$

against the alternative:

$H_a: \text{at least one } \mu_i \text{ differs}$

The test statistic is:

$F = \frac{MSTR}{MSE}$

Large values of $F$ suggest rejecting $H_0$ (since MSTR will be larger than MSE when $H_a$ is true).
Values of $F$ near 1 suggest that we fail to reject $H_0$ .

Since $MSTR$ and $MSE$ are independent chi-square random variables scaled by their degrees of freedom, under $H_0$ :

$F \sim F_{(a-1, N-a)}$

Decision Rule:

If $F \leq F_{(a-1, N-a;1-\alpha)}$ , fail to reject $H_0$ .
If $F \geq F_{(a-1, N-a;1-\alpha)}$ , reject $H_0$ .

If there are only two treatments ( $a = 2$ ), the ANOVA $F$ -test reduces to the two-sample $t$ -test:

$F = t^2$

where:

$t = \frac{\bar{Y_1} - \bar{Y_2}}{\sqrt{MSE \left(\frac{1}{n_1} + \frac{1}{n_2} \right)}}$

24.1.1.2 Treatment Effects (Factor Effects)

Besides cell means model, we have another way to formalize one-way ANOVA:

$Y_{ij} = \mu + \tau_i + \epsilon_{ij}$

where:

$Y_{ij}$ is the $j$ -th response for the $i$ -th treatment.
$\tau_i$ is the $i$ -th treatment effect.
$\mu$ is the constant component common to all observations.
$\epsilon_{ij}$ are independent random errors, assumed to be normally distributed: $\epsilon_{ij} \sim N(0, \sigma^2)$ .

For example, if we have $a = 3$ treatments and $n_1 = n_2 = n_3 = 2$ observations per treatment, the model representation is:

$\begin{equation} \begin{aligned} \left(\begin{array}{c} Y_{11}\\ Y_{12}\\ Y_{21}\\ Y_{22}\\ Y_{31}\\ Y_{32}\\ \end{array}\right) &= \left(\begin{array}{cccc} 1 & 1 & 0 & 0 \\ 1 & 1 & 0 & 0 \\ 1 & 0 & 1 & 0 \\ 1 & 0 & 1 & 0 \\ 1 & 0 & 0 & 1 \\ 1 & 0 & 0 & 1 \\ \end{array}\right) \left(\begin{array}{c} \mu \\ \tau_1 \\ \tau_2 \\ \tau_3\\ \end{array}\right) + \left(\begin{array}{c} \epsilon_{11} \\ \epsilon_{12} \\ \epsilon_{21} \\ \epsilon_{22} \\ \epsilon_{31} \\ \epsilon_{32} \\ \end{array}\right)\\ \mathbf{y} &= \mathbf{X\beta} +\mathbf{\epsilon} \end{aligned} \tag{24.2} \end{equation}$

However, the matrix:

$\mathbf{X'X} = \left( \begin{array} {cccc} \sum_{i}n_i & n_1 & n_2 & n_3 \\ n_1 & n_1 & 0 & 0 \\ n_2 & 0 & n_2 & 0 \\ n_3 & 0 & 0 & n_3 \\ \end{array} \right)$

is singular, meaning $\mathbf{X'X}$ is not invertible. This results in an infinite number of possible solutions for $\mathbf{b}$ .

To resolve this, we impose restrictions on the parameters to ensure that $\mathbf{X}$ has full rank. Regardless of the restriction used, the expected value remains:

$E(Y_{ij}) = \mu + \tau_i = \mu_i = \text{mean response for the $i$-th treatment}$

24.1.1.2.1 Restriction on Sum of Treatment Effects

One common restriction is:

$\sum_{i=1}^{a} \tau_i = 0$

which implies that:

$\mu = \frac{1}{a} \sum_{i=1}^{a} (\mu + \tau_i)$

meaning that $\mu$ represents the grand mean (the overall mean response across treatments).

Each treatment effect can then be expressed as:

$\begin{aligned} \tau_i &= \mu_i - \mu \\ &= \text{treatment mean} - \text{grand mean} \end{aligned}$

Since $\sum_{i} \tau_i = 0$ , we can solve for $\tau_a$ as:

$\tau_a = -(\tau_1 + \tau_2 + \dots + \tau_{a-1})$

Thus, the mean for the $a$ -th treatment is:

$\mu_a = \mu + \tau_a = \mu - (\tau_1 + \tau_2 + \dots + \tau_{a-1})$

This reduces the number of parameters from $a + 1$ to just $a$ , meaning we estimate:

$\mu, \tau_1, \tau_2, ..., \tau_{a-1}$

Rewriting Equation (24.2):

$\begin{equation} \begin{aligned} \left(\begin{array}{c} Y_{11}\\ Y_{12}\\ Y_{21}\\ Y_{22}\\ Y_{31}\\ Y_{32}\\ \end{array}\right) &= \left(\begin{array}{ccc} 1 & 1 & 0 \\ 1 & 1 & 0 \\ 1 & 0 & 1 \\ 1 & 0 & 1 \\ 1 & -1 & -1 \\ 1 & -1 & -1 \\ \end{array}\right) \left(\begin{array}{c} \mu \\ \tau_1 \\ \tau_2 \\ \end{array}\right) + \left(\begin{array}{c} \epsilon_{11} \\ \epsilon_{12} \\ \epsilon_{21} \\ \epsilon_{22} \\ \epsilon_{31} \\ \epsilon_{32} \\ \end{array}\right)\\ \mathbf{y} &= \mathbf{X\beta} +\mathbf{\epsilon} \end{aligned} \end{equation}$

where $\beta = [\mu, \tau_1, \tau_2]'$ .

24.1.1.2.2 Restriction on the First $\tau$

In R, the default parameterization in lm() for a one-way ANOVA model sets $\tau_1 = 0$ . This effectively chooses the first treatment (or group) as a baseline or reference, making its treatment effect $\tau_1$ equal to zero.

Consider the last example with three treatments, each having two observations, $\,n_1 = n_2 = n_3 = 2$ . Under the restriction $\tau_1 = 0$ , the treatment means can be expressed as:

$\begin{aligned} \mu_1 &= \mu + \tau_1 \;=\; \mu + 0 \;=\; \mu, \\ \mu_2 &= \mu + \tau_2, \\ \mu_3 &= \mu + \tau_3. \end{aligned}$

Hence, $\mu$ becomes the mean response for the first treatment.

We write the observations in vector form:

$\begin{aligned} \mathbf{y} &= \begin{pmatrix} Y_{11}\\ Y_{12}\\ Y_{21}\\ Y_{22}\\ Y_{31}\\ Y_{32}\\ \end{pmatrix} = \underbrace{ \begin{pmatrix} 1 & 0 & 0 \\ 1 & 0 & 0 \\ 1 & 1 & 0 \\ 1 & 1 & 0 \\ 1 & 0 & 1 \\ 1 & 0 & 1 \\ \end{pmatrix} }_{\mathbf{X}} \begin{pmatrix} \mu \\ \tau_2 \\ \tau_3 \\ \end{pmatrix} + \begin{pmatrix} \epsilon_{11} \\ \epsilon_{12} \\ \epsilon_{21} \\ \epsilon_{22} \\ \epsilon_{31} \\ \epsilon_{32} \\ \end{pmatrix} \\ &= \mathbf{X\beta} + \mathbf{\epsilon}, \end{aligned}$

where

$\beta = \begin{pmatrix} \mu \\ \tau_2 \\ \tau_3 \end{pmatrix}.$

The OLS estimator is:

$\mathbf{b} = \begin{pmatrix} \hat{\mu} \\ \hat{\tau_2} \\ \hat{\tau_3} \end{pmatrix} = (\mathbf{X}'\mathbf{X})^{-1}\mathbf{X}'\,\mathbf{y}.$

In our specific case with equal sample sizes ( $n_1=n_2=n_3=2$ ), the $(\mathbf{X}'\mathbf{X})^{-1}\mathbf{X}'\mathbf{y}$ calculation yields:

$\begin{aligned} \mathbf{b} & = \left[\begin{array}{ccc} \sum_{i}n_i & n_2 & n_3\\ n_2 & n_2 & 0\\ n_3 & 0 & n_3 \\ \end{array}\right]^{-1}\left[\begin{array}{c} Y_{..}\\ Y_{2.}\\ Y_{3.}\\ \end{array}\right] \\ &= \begin{pmatrix} \bar{Y}_{1\cdot} \\ \bar{Y}_{2\cdot} - \bar{Y}_{1\cdot} \\ \bar{Y}_{3\cdot} - \bar{Y}_{1\cdot} \end{pmatrix} \end{aligned}$ where $\bar{Y}_{1\cdot}$ , $\bar{Y}_{2\cdot}$ , and $\bar{Y}_{3\cdot}$ are the sample means for treatments 1, 2, and 3, respectively.

Taking the expectation of $\mathbf{b}$ confirms:

$E(\mathbf{b}) = \beta = \begin{pmatrix} \mu \\ \tau_2 \\ \tau_3 \end{pmatrix} = \begin{pmatrix} \mu_1 \\ \mu_2 - \mu_1 \\ \mu_3 - \mu_1 \end{pmatrix}.$

Recall that:

$\text{Var}(\mathbf{b}) = \sigma^2\,(\mathbf{X}'\mathbf{X})^{-1}.$

Hence,

$\begin{aligned} \text{Var}(\hat{\mu}) &= \text{Var}(\bar{Y}_{1\cdot}) = \frac{\sigma^2}{n_1}, \\[6pt] \text{Var}(\hat{\tau_2}) &= \text{Var}\bigl(\bar{Y}_{2\cdot}-\bar{Y}_{1\cdot}\bigr) = \frac{\sigma^2}{n_2} + \frac{\sigma^2}{n_1}, \\[6pt] \text{Var}(\hat{\tau_3}) &= \text{Var}\bigl(\bar{Y}_{3\cdot}-\bar{Y}_{1\cdot}\bigr) = \frac{\sigma^2}{n_3} + \frac{\sigma^2}{n_1}. \end{aligned}$

24.1.1.3 Equivalence of Parameterizations

Despite having different ways of writing the model, all three parameterizations yield the same ANOVA table:

Model 1: $Y_{ij} = \mu_i + \epsilon_{ij}$ .
Model 2: $Y_{ij} = \mu + \tau_i + \epsilon_{ij}$ where $\sum_i \tau_i = 0$ .
Model 3: $Y_{ij} = \mu + \tau_i + \epsilon_{ij}$ where $\tau_1 = 0$ .

All three lead to the same fitted values, because

$\mathbf{\hat{Y}} = \mathbf{X}\bigl(\mathbf{X}'\mathbf{X}\bigr)^{-1}\mathbf{X}'\mathbf{Y} = \mathbf{P\,Y} = \mathbf{X\,b}.$

24.1.1.4 ANOVA Table

The generic form of the ANOVA table is:

Decomposition of Variance in One-Way Analysis
Source of Variation	SS	df	MS	F
Between Treatments	$\sum_{i} n_i (\overline{Y}_{i\cdot} - \overline{Y}_{\cdot\cdot})^2 \;=\; \mathbf{Y}'(\mathbf{P} - \mathbf{P}_1)\mathbf{Y}$	$a-1$	$\frac{SSTR}{a-1}$	$\frac{MSTR}{MSE}$
Error (within treatments)	$\sum_{i}\sum_{j}\bigl(Y_{ij} - \overline{Y}_{i\cdot}\bigr)^2 \;=\; \mathbf{e}'\mathbf{e}$	$N-a$	$\frac{SSE}{N-a}$
Total (corrected)	$\sum_{i} n_i(\overline{Y}_{i\cdot} - \overline{Y}_{\cdot\cdot})^2 \;=\; \mathbf{Y}'\mathbf{Y} \;-\; \mathbf{Y}'\mathbf{P}_1\mathbf{Y}$	$N-1$

where $\mathbf{P}_1 = \frac{1}{n}\mathbf{J}$ , $n = \sum_i n_i$ , and $\mathbf{J}$ is the all-ones matrix.

The $F$ -statistic has $(a-1, N-a)$ degrees of freedom and the numeric value is unchanged under any of the three parameterizations. The slight difference lies in how we state the null hypothesis:

$\begin{aligned} H_0 &: \mu_1 = \mu_2 = \dots = \mu_a, \\ H_0 &: \mu + \tau_1 = \mu + \tau_2 = \dots = \mu + \tau_a, \\ H_0 &: \tau_1 = \tau_2 = \dots = \tau_a. \end{aligned}$

The $F$ -test here serves as a preliminary analysis, to see if there is any difference at different factors. For more in-depth analysis, we consider different testing of treatment effects.

24.1.1.5 Testing of Treatment Effects

24.1.1.5.1 Single Treatment Mean

For a single treatment group, the sample mean serves as an estimate of the population mean:

$\hat{\mu_i} = \bar{Y}_{i.}$

where:

$E(\bar{Y}_{i.}) = \mu_i$ , indicating unbiasedness.
$var(\bar{Y}_{i.}) = \frac{\sigma^2}{n_i}$ , estimated by $s^2(\bar{Y}_{i.}) = \frac{MSE}{n_i}$ .

Since the standardized test statistic

$T = \frac{\bar{Y}_{i.} - \mu_i}{s(\bar{Y}_{i.})}$

follows a $t$ -distribution with $N-a$ degrees of freedom ( $t_{N-a}$ ), a $(1-\alpha)100\%$ confidence interval for $\mu_i$ is:

$\bar{Y}_{i.} \pm t_{1-\alpha/2;N-a} s(\bar{Y}_{i.})$

To test whether $\mu_i$ is equal to some constant $c$ , we set up the hypothesis:

$\begin{aligned} &H_0: \mu_i = c \\ &H_1: \mu_i \neq c \end{aligned}$

The test statistic:

$T = \frac{\bar{Y}_{i.} - c}{s(\bar{Y}_{i.})} \sim t_{N-a}$

Under $H_0$ , we reject $H_0$ at the $\alpha$ level if:

$|T| > t_{1-\alpha/2;N-a}$

24.1.1.5.2 Differences Between Treatment Means

The difference between two treatment means, also called a pairwise comparison, is given by:

$D = \mu_i - \mu_{i'}$

which is estimated by:

$\hat{D} = \bar{Y}_{i.} - \bar{Y}_{i'.}$

This estimate is unbiased since:

$E(\hat{D}) = \mu_i - \mu_{i'}$

Since $\bar{Y}_{i.}$ and $\bar{Y}_{i'.}$ are independent, the variance of $\hat{D}$ is:

$var(\hat{D}) = var(\bar{Y}_{i.}) + var(\bar{Y}_{i'.}) = \sigma^2 \left(\frac{1}{n_i} + \frac{1}{n_{i'}}\right)$

which is estimated by:

$s^2(\hat{D}) = MSE \left(\frac{1}{n_i} + \frac{1}{n_{i'}}\right)$

Using the same inference structure as the single treatment mean:

$\frac{\hat{D} - D}{s(\hat{D})} \sim t_{N-a}$

A $(1-\alpha)100\%$ confidence interval for $D$ is:

$\hat{D} \pm t_{1-\alpha/2;N-a} s(\hat{D})$

For hypothesis testing:

$\begin{aligned} &H_0: \mu_i = \mu_{i'} \\ &H_a: \mu_i \neq \mu_{i'} \end{aligned}$

we use the test statistic:

$T = \frac{\hat{D}}{s(\hat{D})} \sim t_{N-a}$

We reject $H_0$ at the $\alpha$ level if:

$|T| > t_{1-\alpha/2;N-a}$

24.1.1.5.3 Contrast Among Treatment Means

To generalize the comparison of two means, we introduce contrasts.

A contrast is a linear combination of treatment means:

$L = \sum_{i=1}^{a} c_i \mu_i$

where the coefficients $c_i$ are non-random constants that satisfy the constraint:

$\sum_{i=1}^{a} c_i = 0$

This ensures that contrasts focus on relative comparisons rather than absolute magnitudes.

An unbiased estimator of $L$ is given by:

$\hat{L} = \sum_{i=1}^{a} c_i \bar{Y}_{i.}$

Since expectation is a linear operator:

$E(\hat{L}) = \sum_{i=1}^{a} c_i E(\bar{Y}_{i.}) = \sum_{i=1}^{a} c_i \mu_i = L$

Thus, $\hat{L}$ is an unbiased estimator of $L$ .

Since the sample means $\bar{Y}_{i.}$ are independent, the variance of $\hat{L}$ is:

$\begin{aligned} var(\hat{L}) &= var\left(\sum_{i=1}^a c_i \bar{Y}_{i.} \right) \\ &= \sum_{i=1}^a c_i^2 var(\bar{Y}_{i.}) \\ &= \sum_{i=1}^a c_i^2 \frac{\sigma^2}{n_i} \\ &= \sigma^2 \sum_{i=1}^{a} \frac{c_i^2}{n_i} \end{aligned}$

Since $\sigma^2$ is unknown, we estimate it using the mean squared error:

$s^2(\hat{L}) = MSE \sum_{i=1}^{a} \frac{c_i^2}{n_i}$

Since $\hat{L}$ is a linear combination of independent normal random variables, it follows a normal distribution:

$\hat{L} \sim N\left(L, \sigma^2 \sum_{i=1}^{a} \frac{c_i^2}{n_i} \right)$

Since $SSE/\sigma^2 \sim \chi^2_{N-a}$ and $MSE = SSE/(N-a)$ , we use the $t$ -distribution:

$\frac{\hat{L} - L}{s(\hat{L})} \sim t_{N-a}$

Thus, a $(1-\alpha)100\%$ confidence interval for $L$ is:

$\hat{L} \pm t_{1-\alpha/2; N-a} s(\hat{L})$

To test whether a specific contrast equals zero:

$\begin{aligned} &H_0: L = 0 \quad \text{(no difference in the contrast)} \\ &H_a: L \neq 0 \quad \text{(significant contrast)} \end{aligned}$

We use the test statistic:

$T = \frac{\hat{L}}{s(\hat{L})} \sim t_{N-a}$

We reject $H_0$ at the $\alpha$ level if:

$|T| > t_{1-\alpha/2;N-a}$

24.1.1.5.4 Linear Combination of Treatment Means

A linear combination of treatment means extends the idea of a contrast:

$L = \sum_{i=1}^{a} c_i \mu_i$

Unlike contrasts, there are no restrictions on the coefficients $c_i$ (i.e., they do not need to sum to zero).

Since tests on a single treatment mean, pairwise differences, and contrasts are all special cases of this general form, we can express the hypothesis test as:

$\begin{aligned} &H_0: \sum_{i=1}^{a} c_i \mu_i = c \\ &H_a: \sum_{i=1}^{a} c_i \mu_i \neq c \end{aligned}$

The test statistic follows a $t$ -distribution:

$T = \frac{\hat{L} - c}{s(\hat{L})} \sim t_{N-a}$

Since squaring a $t$ -distributed variable results in an $F$ -distributed variable,

$F = T^2 \sim F_{1,N-a}$

This means that all such tests can be viewed as single-degree-of-freedom $F$ -tests, since the numerator degrees of freedom is always 1.

Multiple Contrasts

When testing $k \geq 2$ contrasts simultaneously, the test statistics $T_1, T_2, ..., T_k$ follow a multivariate $t$ -distribution, since they are dependent (as they are based on the same data).

Limitations of Multiple Comparisons

Inflation of Type I Error:
The confidence coefficient $(1-\alpha)$ applies to a single estimate, not a series of estimates. Similarly, the Type I error rate $\alpha$ applies to an individual test, not a collection of tests.

Example: If three $t$ -tests are performed at $\alpha = 0.05$ , and if they were independent (which they are not), then:

$(1 - 0.05)^3 = 0.857$

meaning the overall Type I error rate would be approximately $0.143$ , not $0.05$ .
Data Snooping Concern:
The significance level $\alpha$ is valid only if the test was planned before examining the data.
- Often, an experiment suggests relationships to investigate.
- Exploring effects based on observed data is known as data snooping.

To address these issues, we use Multiple Comparison Procedures, such as:

Tukey – for all pairwise comparisons of treatment means.
Scheffé – for all possible contrasts.
Bonferroni – for a fixed number of planned comparisons.

24.1.1.5.4.1 Tukey

Used for all pairwise comparisons of treatment means:

$D = \mu_i - \mu_{i'}$

Hypothesis test:

$\begin{aligned} &H_0: \mu_i - \mu_{i'} = 0 \\ &H_a: \mu_i - \mu_{i'} \neq 0 \end{aligned}$

Properties:

When sample sizes are equal ( $n_1 = n_2 = ... = n_a$ ), the family confidence coefficient is exactly $(1-\alpha)$ .
When sample sizes are unequal, the method is conservative (i.e., the actual significance level is less than $\alpha$ ).

The Tukey test is based on the studentized range:

$w = \max(Y_i) - \min(Y_i)$

If $Y_1, ..., Y_r$ are observations from a normal distribution with mean $\mu$ and variance $\sigma^2$ , then the statistic:

$q(r, v) = \frac{w}{s}$

follows the studentized range distribution, which requires a special table.

Notes:

When testing only a subset of pairwise comparisons, the confidence coefficient exceeds $(1-\alpha)$ , making the test more conservative.
Tukey’s method can be used for data snooping, as long as the investigated effects are pairwise comparisons.

24.1.1.5.4.2 Scheffé

Scheffé’s method is used for testing all possible contrasts:

$L = \sum_{i=1}^{a} c_i \mu_i, \quad \text{where} \quad \sum_{i=1}^{a} c_i = 0$

Hypothesis test:

$\begin{aligned} &H_0: L = 0 \\ &H_a: L \neq 0 \end{aligned}$

Properties:

Valid for any set of contrasts, making it the most general multiple comparison procedure.
The family confidence level is exactly $(1-\alpha)$ , regardless of sample sizes.

Simultaneous Confidence Intervals:

$\hat{L} \pm S s(\hat{L})$

where:

$\hat{L} = \sum c_i \bar{Y}_{i.}$
$s^2(\hat{L}) = MSE \sum \frac{c_i^2}{n_i}$
$S^2 = (a-1) f_{1-\alpha; a-1, N-a}$

Test Statistic:

$F = \frac{\hat{L}^2}{(a-1) s^2(\hat{L})}$

We reject $H_0$ if:

$F > f_{1-\alpha; a-1, N-a}$

Notes:

Finite Family Correction: Since we never test all possible contrasts in practice, the actual confidence coefficient is greater than $(1-\alpha)$ . Thus, some researchers use a higher $\alpha$ (e.g., a 90% confidence level instead of 95%).
Scheffé is useful for data snooping, since it applies to any contrast.
If only pairwise comparisons are needed, Tukey’s method gives narrower confidence intervals than Scheffé.

24.1.1.5.4.3 Bonferroni

The Bonferroni correction is applicable regardless of whether sample sizes are equal or unequal. It is particularly useful when a small number of planned comparisons are of interest.

A $(1-\alpha)100\%$ simultaneous confidence interval for a set of $g$ comparisons is:

$\hat{L} \pm B s(\hat{L})$

where:

$B = t_{1-\alpha/(2g), N-a}$

and $g$ is the number of comparisons in the family.

To test:

$\begin{aligned} &H_0: L = 0 \\ &H_a: L \neq 0 \end{aligned}$

we use the test statistic:

$T = \frac{\hat{L}}{s(\hat{L})}$

Reject $H_0$ if:

$|T| > t_{1-\alpha/(2g),N-a}$

Notes:

If all pairwise comparisons are needed, Tukey’s method is superior, as it provides narrower confidence intervals.
Bonferroni is better than Scheffé when the number of contrasts is similar to or smaller than the number of treatment levels.
Practical recommendation: Compute Tukey, Scheffé, and Bonferroni and use the method with the smallest confidence intervals.
Bonferroni cannot be used for data snooping, as it assumes the comparisons were planned before examining the data.

24.1.1.5.4.4 Fisher’s Least Significant Difference

The Fisher LSD method does not control the family-wise error rate (refer to 16.3), meaning it does not correct for multiple comparisons. However, it can be useful for exploratory analysis when a preliminary ANOVA is significant.

The hypothesis test for comparing two treatment means:

$H_0: \mu_i = \mu_j$

uses the $t$ -statistic:

$t = \frac{\bar{Y}_i - \bar{Y}_j}{\sqrt{MSE \left(\frac{1}{n_i} + \frac{1}{n_j}\right)}}$

where:

$\bar{Y}_i$ and $\bar{Y}_j$ are the sample means for treatments $i$ and $j$ .
$MSE$ is the mean squared error from ANOVA.
$n_i, n_j$ are the sample sizes for groups $i$ and $j$ .

Notes:

The LSD method does not adjust for multiple comparisons, which increases the Type I error rate.
It is only valid if the overall ANOVA is significant (i.e., the global null hypothesis of no treatment effect is rejected).
Tukey and Bonferroni methods are preferred when many comparisons are made.

24.1.1.5.4.5 Newman-Keuls

The Newman-Keuls procedure is a stepwise multiple comparison test similar to Tukey’s method but less rigorous.

Key Issues:

Unlike Tukey, Newman-Keuls does not control the family-wise error rate.
It has less power than ANOVA.
It is rarely recommended in modern statistical practice.
Do not recommend using the Newman-Keuls test.

24.1.1.5.4.6 Summary of Multiple Comparison Procedures

Method	Type of Comparisons	Controls Family-Wise Error Rate?	Best Used For	Strengths	Weaknesses
Tukey	All pairwise comparisons	Yes	Comparing all treatment means	Exact confidence level when sample sizes are equal; more powerful than Scheffé for pairwise tests	Conservative if sample sizes are unequal
Scheffé	All possible contrasts	Yes	Exploratory analysis, especially when interested in any contrast	Valid for any contrast; can be used for data snooping	Confidence intervals wider than Tukey for pairwise comparisons
Bonferroni	Fixed number of planned comparisons	Yes	A small number of pre-specified tests	Simple and flexible; better than Scheffé for few comparisons	Less powerful than Tukey for many pairwise tests; cannot be used for data snooping
Fisher’s LSD	Pairwise comparisons	No	Exploratory comparisons after significant ANOVA	Most powerful for pairwise comparisons when ANOVA is significant	Inflates Type I error rate; not valid without a significant ANOVA
Newman-Keuls	Pairwise comparisons	No	-	-	Less power than ANOVA; generally not recommended

24.1.1.5.4.7 Dunnett’s Test

In some experiments, instead of comparing all treatment groups against each other, we are specifically interested in comparing each treatment to a control. This is common in clinical trials or A/B testing, where one group serves as a baseline.

Dunnett’s test is designed for experiments with $a$ groups, where:

One group is the control (e.g., placebo or standard treatment).
The remaining $a-1$ groups are treatment groups.

Thus, we perform $a-1$ pairwise comparisons:

$D_i = \mu_i - \mu_c, \quad i = 1, \dots, a-1$

where $\mu_c$ is the mean of the control group.

Dunnett’s Test vs. Other Methods

Unlike Tukey’s method (which compares all pairs), Dunnett’s method only compares treatments to the control.
Dunnett’s test controls the family-wise error rate, making it more powerful than Bonferroni for this scenario.
If the goal is to compare treatments against each other as well, Tukey’s method is preferable.

24.1.2 Single Factor Random Effects ANOVA

Also known as an ANOVA Type II model, the single factor random effects model assumes that treatments are randomly selected from a larger population. Thus, inference extends beyond the observed treatments to the entire population of treatments.

24.1.2.1 Random Cell Means Model

The model is given by:

$Y_{ij} = \mu_i + \epsilon_{ij}$

where:

$\mu_i \sim N(\mu, \sigma^2_{\mu})$ , independent across treatments.
$\epsilon_{ij} \sim N(0, \sigma^2)$ , independent across observations.
$\mu_i$ and $\epsilon_{ij}$ are mutually independent for $i = 1, \dots, a$ and $j = 1, \dots, n$ .

When all treatment sample sizes are equal:

$\begin{aligned} E(Y_{ij}) &= E(\mu_i) = \mu \\ var(Y_{ij}) &= var(\mu_i) + var(\epsilon_{ij}) = \sigma^2_{\mu} + \sigma^2 \end{aligned}$

24.1.2.1.1 Covariance Structure

Since $Y_{ij}$ are not independent, we calculate their covariances:

Same treatment group ( $i$ fixed, $j \neq j'$ ):

$\begin{aligned} cov(Y_{ij}, Y_{ij'}) &= E(Y_{ij} Y_{ij'}) - E(Y_{ij}) E(Y_{ij'}) \\ &= E(\mu_i^2 + \mu_i \epsilon_{ij'} + \mu_i \epsilon_{ij} + \epsilon_{ij} \epsilon_{ij'}) - \mu^2 \\ &= \sigma^2_{\mu} + \mu^2 - \mu^2 \\ &= \sigma^2_{\mu} \end{aligned}$

Different treatment groups ( $i \neq i'$ ):

$\begin{aligned} cov(Y_{ij}, Y_{i'j'}) &= E(\mu_i \mu_{i'} + \mu_i \epsilon_{i'j'} + \mu_{i'} \epsilon_{ij} + \epsilon_{ij} \epsilon_{i'j'}) - \mu^2 \\ &= \mu^2 - \mu^2 = 0 \end{aligned}$

Thus:

All observations have the same variance: $var(Y_{ij}) = \sigma^2_{\mu} + \sigma^2$ .
Observations from the same treatment have covariance: $\sigma^2_{\mu}$ .
Observations from different treatments are uncorrelated.

The intraclass correlation between two responses from the same treatment:

$\rho(Y_{ij}, Y_{ij'}) = \frac{\sigma^2_{\mu}}{\sigma^2_{\mu} + \sigma^2}, \quad j \neq j'$

24.1.2.1.2 Inference for Random Effects Model

The Intraclass Correlation Coefficient:

$\frac{\sigma^2_{\mu}}{\sigma^2 + \sigma^2_{\mu}}$

measures the proportion of total variability in $Y_{ij}$ that is accounted for by treatment differences.

To test whether treatments contribute significantly to variance:

$\begin{aligned} &H_0: \sigma_{\mu}^2 = 0 \quad \text{(No treatment effect, all $\mu_i = \mu$)} \\ &H_a: \sigma_{\mu}^2 \neq 0 \end{aligned}$

Under $H_0$ , an ANOVA F-test is used:

$F = \frac{MSTR}{MSE}$

where:

$MSTR$ (Mean Square for Treatments) captures variation between treatments.
$MSE$ (Mean Square Error) captures variation within treatments.

If $H_0$ is true, then:

$F \sim F_{(a-1, a(n-1))}$

Reject $H_0$ if:

$F > f_{(1-\alpha; a-1, a(n-1))}$

24.1.2.1.3 Comparison: Fixed Effects vs. Random Effects Models

Although ANOVA calculations are the same for fixed and random effects models, the interpretation of results differs.

Expected Mean Squares under Fixed and Random Effects Models
Random Effects Model	Fixed Effects Model
$E(MSE) = \sigma^2$	$E(MSE) = \sigma^2$
$E(MSTR) = \sigma^2 + n \sigma^2_{\mu}$	$E(MSTR) = \sigma^2 + \frac{ \sum_i n_i (\mu_i - \mu)^2}{a-1}$

If $\sigma^2_{\mu} = 0$ , then $E(MSTR) = E(MSE)$ , implying no treatment effect.
Otherwise, $E(MSTR) > E(MSE)$ , suggesting significant treatment variation.

When sample sizes are not equal, the $F$ -test remains valid, but the degrees of freedom change to:

$F \sim F_{(a-1, N-a)}$

24.1.2.1.4 Estimation of $\mu$

An unbiased estimator of $E(Y_{ij}) = \mu$ is the grand mean:

$\hat{\mu} = \bar{Y}_{..} = \frac{1}{a n} \sum_{i=1}^{a} \sum_{j=1}^{n} Y_{ij}$

The variance of this estimator is:

$\begin{aligned} var(\bar{Y}_{..}) &= var\left(\frac{1}{a} \sum_{i=1}^{a} \bar{Y}_{i.} \right) \\ &= \frac{1}{a^2} \sum_{i=1}^{a} var(\bar{Y}_{i.}) \\ &= \frac{1}{a^2} \sum_{i=1}^{a} \left(\sigma^2_\mu + \frac{\sigma^2}{n} \right) \\ &= \frac{n \sigma^2_{\mu} + \sigma^2}{a n} \end{aligned}$

An unbiased estimator of this variance is:

$s^2(\bar{Y}_{..}) = \frac{MSTR}{a n}$

Since:

$\frac{\bar{Y}_{..} - \mu}{s(\bar{Y}_{..})} \sim t_{a-1}$

A $(1-\alpha)100\%$ confidence interval for $\mu$ is:

$\bar{Y}_{..} \pm t_{1-\alpha/2; a-1} s(\bar{Y}_{..})$

24.1.2.1.5 Estimation of Intraclass Correlation Coefficient $\frac{\sigma^2_\mu}{\sigma^2_{\mu}+\sigma^2}$

In both random and fixed effects models, $MSTR$ and $MSE$ are independent.

When sample sizes are equal ( $n_i = n$ for all $i$ ), the test statistic:

$\frac{\frac{MSTR}{n\sigma^2_\mu + \sigma^2}}{\frac{MSE}{\sigma^2}} \sim F_{a-1, a(n-1)}$

A $(1-\alpha)100\%$ confidence interval for $\frac{\sigma^2_\mu}{\sigma^2_\mu + \sigma^2}$ follows from:

$P\left(f_{\alpha/2; a-1, a(n-1)} \leq \frac{\frac{MSTR}{n\sigma^2_\mu + \sigma^2}}{\frac{MSE}{\sigma^2}} \leq f_{1-\alpha/2; a-1, a(n-1)} \right) = 1 - \alpha$

Defining:

$\begin{aligned} L &= \frac{1}{n} \left( \frac{MSTR}{MSE} \times \frac{1}{f_{1-\alpha/2; a-1, a(n-1)}} - 1 \right) \\ U &= \frac{1}{n} \left( \frac{MSTR}{MSE} \times \frac{1}{f_{\alpha/2; a-1, a(n-1)}} - 1 \right) \end{aligned}$

The lower and upper confidence limits for $\frac{\sigma^2_\mu}{\sigma^2_\mu + \sigma^2}$ are:

$\begin{aligned} L^* &= \frac{L}{1+L} \\ U^* &= \frac{U}{1+U} \end{aligned}$

If $L^*$ is negative, we customarily set it to 0.

24.1.2.1.6 Estimation of $\sigma^2$

Since:

$\frac{a(n-1) MSE}{\sigma^2} \sim \chi^2_{a(n-1)}$

A $(1-\alpha)100\%$ confidence interval for $\sigma^2$ is:

$\frac{a(n-1) MSE}{\chi^2_{1-\alpha/2; a(n-1)}} \leq \sigma^2 \leq \frac{a(n-1) MSE}{\chi^2_{\alpha/2; a(n-1)}}$

If sample sizes are unequal, the same formula applies, but the degrees of freedom change to:

$df = N - a$

24.1.2.1.7 Estimation of $\sigma^2_\mu$

From the expectations:

$E(MSE) = \sigma^2, \quad E(MSTR) = \sigma^2 + n\sigma^2_\mu$

we solve for $\sigma^2_{\mu}$ :

$\sigma^2_{\mu} = \frac{E(MSTR) - E(MSE)}{n}$

An unbiased estimator of $\sigma^2_\mu$ is:

$s^2_\mu = \frac{MSTR - MSE}{n}$

If $s^2_\mu < 0$ , we set $s^2_\mu = 0$ (since variances cannot be negative).

If sample sizes are unequal, we replace $n$ with an effective sample size $n'$ :

$s^2_\mu = \frac{MSTR - MSE}{n'}$

where:

$n' = \frac{1}{a-1} \left(\sum_i n_i - \frac{\sum_i n_i^2}{\sum_i n_i} \right)$

There are no exact confidence intervals for $\sigma^2_\mu$ , but we can approximate them using the Satterthwaite procedure.

24.1.2.1.7.1 Satterthwaite Approximation

A linear combination of expected mean squares:

$\sigma^2_\mu = \frac{1}{n} E(MSTR) + \left(-\frac{1}{n}\right) E(MSE)$

For a general linear combination:

$S = d_1 E(MS_1) + \dots + d_h E(MS_h)$

where $d_i$ are coefficients, an unbiased estimator of $S$ is:

$\hat{S} = d_1 MS_1 + \dots + d_h MS_h$

Let $df_i$ be the degrees of freedom associated with each mean square $MS_i$ . The Satterthwaite approximation states:

$\frac{(df) \hat{S}}{S} \sim \chi^2_{df}$

where the degrees of freedom are approximated as:

$df = \frac{(d_1 MS_1 + \dots + d_h MS_h)^2}{\sum_{i=1}^{h} \frac{(d_i MS_i)^2}{df_i}}$

Applying the Satterthwaite method to the single factor random effects model:

$\frac{(df) s^2_\mu}{\chi^2_{1-\alpha/2; df}} \leq \sigma^2_\mu \leq \frac{(df) s^2_\mu}{\chi^2_{\alpha/2; df}}$

where the approximate degrees of freedom are:

$df = \frac{(s^2_\mu)^2}{\frac{(MSTR)^2}{a-1} + \frac{(MSE)^2}{a(n-1)}}$

24.1.2.2 Random Treatment Effects Model

In a random effects model, treatment levels are considered random samples from a larger population of possible treatments. The model accounts for variability across all potential treatments, not just those observed in the study.

We define the random treatment effect as:

$\tau_i = \mu_i - E(\mu_i) = \mu_i - \mu$

where $\tau_i$ represents the deviation of treatment mean $\mu_i$ from the overall mean $\mu$ .

Thus, we rewrite treatment means as:

$\mu_i = \mu + \tau_i$

Substituting this into the response model:

$Y_{ij} = \mu + \tau_i + \epsilon_{ij}$

where:

$\mu$ = common mean across all observations.
$\tau_i \sim N(0, \sigma^2_\tau)$ , random treatment effects, assumed independent.
$\epsilon_{ij} \sim N(0, \sigma^2)$ , random error terms, also independent.
$\tau_{i}$ and $\epsilon_{ij}$ are mutually independent for $i = 1, \dots, a$ and $j = 1, \dots, n$ .
We consider only balanced single-factor ANOVA (equal sample sizes across treatments).

24.1.2.3 Diagnostic Measures for Model Assumptions

Checking assumptions is crucial for valid inference. Common issues include:

Common Regression Issues and Diagnostic Tools
Issue	Diagnostic Tools
Non-constant error variance (heteroscedasticity)	Residual plots, Levene’s test, Hartley’s test
Non-independence of errors	Residual plots, Durbin-Watson test (for autocorrelation)
Outliers	Boxplots, residual plots, regression influence measures (e.g., Cook’s distance)
Non-normality of errors	Histogram, Q-Q plot, Shapiro-Wilk test, Anderson-Darling test
Omitted variable bias	Residual plots, checking for unaccounted sources of variation

24.1.2.4 Remedial Measures

If diagnostic checks indicate violations of assumptions, possible solutions include:

Weighted Least Squares – Adjusts for heteroscedasticity.
Variable Transformation – Log or Box-Cox transformations may improve normality or stabilize variance.
Non-Parametric Procedures – Kruskal-Wallis test or bootstrapping when normality assumptions fail.

24.1.2.5 Key Notes on Robustness

Fixed effects ANOVA is relatively robust to:
- Non-normality, particularly when sample sizes are moderate to large.
- Unequal variances when sample sizes are roughly equal.
- F-test and multiple comparisons remain valid under mild violations.
Random effects ANOVA is sensitive to:
- Lack of independence, which severely affects both fixed and random effects models.
- Unequal variances, particularly when estimating variance components.

24.1.3 Two-Factor Fixed Effects ANOVA

A multi-factor experiment offers several advantages:

Higher efficiency – More precise estimates with fewer observations.
Increased information – Allows for testing interactions between factors.
Greater validity – Reduces confounding by controlling additional sources of variation.

Balanced Two-Factor ANOVA: Assumptions

Equal sample sizes for all treatment combinations.
All treatment means are of equal importance (no weighting).
Factors are categorical and chosen purposefully.

We assume:

Factor A has $a$ levels and Factor B has $b$ levels.
All $a \times b$ factor level combinations are included.
Each treatment combination has $n$ replications.
The total number of observations:
$N = abn$

24.1.3.1 Cell Means Model

The response is modeled as:

$Y_{ijk} = \mu_{ij} + \epsilon_{ijk}$

where:

$\mu_{ij}$ are fixed parameters (cell means).
$i = 1, \dots, a$ represents levels of Factor A.
$j = 1, \dots, b$ represents levels of Factor B.
$\epsilon_{ijk} \sim \text{independent } N(0, \sigma^2)$ for all $i, j, k$ .

Expected values and variance:

$\begin{aligned} E(Y_{ijk}) &= \mu_{ij} \\ var(Y_{ijk}) &= var(\epsilon_{ijk}) = \sigma^2 \end{aligned}$

Thus:

$Y_{ijk} \sim \text{independent } N(\mu_{ij}, \sigma^2)$

This can be expressed in matrix notation:

$\mathbf{Y} = \mathbf{X} \beta + \epsilon$

where:

$\begin{aligned} E(\mathbf{Y}) &= \mathbf{X} \beta \\ var(\mathbf{Y}) &= \sigma^2 \mathbf{I} \end{aligned}$

24.1.3.1.1 Interaction Effects

Interaction measures whether the effect of one factor depends on the level of the other factor. It is defined as:

$(\alpha \beta)_{ij} = \mu_{ij} - (\mu_{..} + \alpha_i + \beta_j)$

where:

Grand mean:
$\mu_{..} = \frac{1}{ab} \sum_i \sum_j \mu_{ij}$
Main effect for Factor A (average effect of level $i$ ):
$\alpha_i = \mu_{i.} - \mu_{..}$
Main effect for Factor B (average effect of level $j$ ):
$\beta_j = \mu_{.j} - \mu_{..}$
Interaction effect:
$(\alpha \beta)_{ij} = \mu_{ij} - \mu_{i.} - \mu_{.j} + \mu_{..}$

To determine whether interactions exist:

Check if all $\mu_{ij}$ can be written as sums $\mu_{..} + \alpha_i + \beta_j$
(i.e., check if interaction terms are zero).
Compare mean differences across levels of Factor B at each level of Factor A.
Compare mean differences across levels of Factor A at each level of Factor B.
Graphical method:
- Plot treatment means for each level of Factor B.
- If lines are not parallel, an interaction exists.

The interaction terms satisfy:

For each level of Factor B:

$\sum_i (\alpha \beta)_{ij} = \sum_i \left(\mu_{ij} - \mu_{..} - \alpha_i - \beta_j \right)$

Expanding:

$\begin{aligned} \sum_i (\alpha \beta)_{ij} &= \sum_i \mu_{ij} - a \mu_{..} - \sum_i \alpha_i - a \beta_j \\ &= a \mu_{.j} - a \mu_{..} - \sum_i (\mu_{i.} - \mu_{..}) - a (\mu_{.j} - \mu_{..}) \\ &= a \mu_{.j} - a \mu_{..} - a \mu_{..}+ a \mu_{..} - a (\mu_{.j} - \mu_{..}) \\ &= 0 \end{aligned}$

Similarly:

$\sum_j (\alpha \beta)_{ij} = 0, \quad i = 1, \dots, a$

and:

$\sum_i \sum_j (\alpha \beta)_{ij} = 0, \quad \sum_i \alpha_i = 0, \quad \sum_j \beta_j = 0$

24.1.3.2 Factor Effects Model

In the Factor Effects Model, we express the response as:

$\begin{aligned} \mu_{ij} &= \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} \\ Y_{ijk} &= \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} + \epsilon_{ijk} \end{aligned}$

where:

$\mu_{..}$ is the grand mean.
$\alpha_i$ are main effects for Factor A, subject to:
$\sum_i \alpha_i = 0$
$\beta_j$ are main effects for Factor B, subject to:
$\sum_j \beta_j = 0$
$(\alpha \beta)_{ij}$ are interaction effects, subject to:
$\sum_i (\alpha \beta)_{ij} = 0, \quad j = 1, \dots, b$
$\sum_j (\alpha \beta)_{ij} = 0, \quad i = 1, \dots, a$
$\epsilon_{ijk} \sim \text{independent } N(0, \sigma^2)$ for $k = 1, \dots, n$ .

Thus, we have:

$\begin{aligned} E(Y_{ijk}) &= \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} \\ var(Y_{ijk}) &= \sigma^2 \\ Y_{ijk} &\sim N (\mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij}, \sigma^2) \end{aligned}$

24.1.3.3 Parameter Counting and Restrictions

The Cell Means Model has $ab$ parameters corresponding to each combination of factor levels.
In the Factor Effects Model, the imposed constraints reduce the number of estimable parameters:

Parameter Count in Two-Way ANOVA with Interaction
Parameter	Count
$\mu_{..}$	$1$
$\alpha_i$ (Main effects for A)	$a-1$ (due to constraint $\sum_i \alpha_i = 0$ )
$\beta_j$ (Main effects for B)	$b-1$ (due to constraint $\sum_j \beta_j = 0$ )
$(\alpha \beta)_{ij}$ (Interaction effects)	$(a-1)(b-1)$ (due to two constraints)

Thus, the total number of parameters:

$1 + (a-1) + (b-1) + (a-1)(b-1) = ab$

which matches the number of parameters in the Cell Means Model.

To uniquely estimate parameters, we apply constraints:

$\begin{aligned} \alpha_a &= -(\alpha_1 + \alpha_2 + \dots + \alpha_{a-1}) \\ \beta_b &= -(\beta_1 + \beta_2 + \dots + \beta_{b-1}) \\ (\alpha \beta)_{ib} &= -(\alpha \beta)_{i1} - (\alpha \beta)_{i2} - \dots - (\alpha \beta)_{i,b-1}, \quad i = 1, \dots, a \\ (\alpha \beta)_{aj} &= -(\alpha \beta)_{1j} - (\alpha \beta)_{2j} - \dots - (\alpha \beta)_{a-1,j}, \quad j = 1, \dots, b \end{aligned}$

The model can be fitted using least squares or maximum likelihood estimation.

24.1.3.3.1 Cell Means Model Estimation

Minimizing:

$Q = \sum_i \sum_j \sum_k (Y_{ijk} - \mu_{ij})^2$

yields estimators:

$\begin{aligned} \hat{\mu}_{ij} &= \bar{Y}_{ij} \\ \hat{Y}_{ijk} &= \bar{Y}_{ij} \\ e_{ijk} &= Y_{ijk} - \hat{Y}_{ijk} = Y_{ijk} - \bar{Y}_{ij} \end{aligned}$

where $e_{ijk} \sim \text{independent } N(0, \sigma^2)$ .

24.1.3.3.2 Factor Effects Model Estimation

Minimizing:

$Q = \sum_i \sum_j \sum_k (Y_{ijk} - \mu_{..} - \alpha_i - \beta_j - (\alpha \beta)_{ij})^2$

subject to the constraints:

$\begin{aligned} \sum_i \alpha_i &= 0 \\ \sum_j \beta_j &= 0 \\ \sum_i (\alpha \beta)_{ij} &= 0, \quad j = 1, \dots, b \\ \sum_j (\alpha \beta)_{ij} &= 0, \quad i = 1, \dots, a \end{aligned}$

yields estimators:

$\begin{aligned} \hat{\mu}_{..} &= \bar{Y}_{...} \\ \hat{\alpha}_i &= \bar{Y}_{i..} - \bar{Y}_{...} \\ \hat{\beta}_j &= \bar{Y}_{.j.} - \bar{Y}_{...} \\ (\hat{\alpha \beta})_{ij} &= \bar{Y}_{ij.} - \bar{Y}_{i..} - \bar{Y}_{.j.} + \bar{Y}_{...} \end{aligned}$

The fitted values are:

$\hat{Y}_{ijk} = \bar{Y}_{...} + (\bar{Y}_{i..} - \bar{Y}_{...}) + (\bar{Y}_{.j.} - \bar{Y}_{...}) + (\bar{Y}_{ij.} - \bar{Y}_{i..} - \bar{Y}_{.j.} + \bar{Y}_{...})$

which simplifies to:

$\hat{Y}_{ijk} = \bar{Y}_{ij.}$

The residuals are:

$e_{ijk} = Y_{ijk} - \bar{Y}_{ij.}$

and follow:

$e_{ijk} \sim \text{independent } N(0, \sigma^2)$

The variances of the estimated effects are:

$\begin{aligned} s^2_{\hat{\mu}_{..}} &= \frac{MSE}{nab} \\ s^2_{\hat{\alpha}_i} &= MSE \left(\frac{1}{nb} - \frac{1}{nab} \right) \\ s^2_{\hat{\beta}_j} &= MSE \left(\frac{1}{na} - \frac{1}{nab} \right) \\ s^2_{(\hat{\alpha\beta})_{ij}} &= MSE \left(\frac{1}{n} - \frac{1}{na} - \frac{1}{nb} + \frac{1}{nab} \right) \end{aligned}$

24.1.3.3.3 Partitioning the Total Sum of Squares

The total deviation of an observation from the overall mean can be decomposed as:

$Y_{ijk} - \bar{Y}_{...} = (\bar{Y}_{ij.} - \bar{Y}_{...}) + (Y_{ijk} - \bar{Y}_{ij.})$

where:

$Y_{ijk} - \bar{Y}_{...}$ : Total deviation of an observation.
$\bar{Y}_{ij.} - \bar{Y}_{...}$ : Deviation of treatment mean from the overall mean.
$Y_{ijk} - \bar{Y}_{ij.}$ : Residual deviation of an observation from the treatment mean.

Summing over all observations:

$\sum_i \sum_j \sum_k (Y_{ijk} - \bar{Y}_{...})^2 = n \sum_i \sum_j (\bar{Y}_{ij.} - \bar{Y}_{...})^2 + \sum_i \sum_j \sum_k (Y_{ijk} - \bar{Y}_{ij.})^2$

Thus:

$SSTO = SSTR + SSE$

where:

$SSTO$ = Total Sum of Squares (Total variation).
$SSTR$ = Treatment Sum of Squares (Variation due to factor effects).
$SSE$ = Error Sum of Squares (Residual variation).

Since the cross-product terms are 0, the model naturally partitions the variance.

From the factor effects model:

$\bar{Y}_{ij.} - \bar{Y}_{...} = (\bar{Y}_{i..} - \bar{Y}_{...}) + (\bar{Y}_{.j.} - \bar{Y}_{...}) + (\bar{Y}_{ij.} - \bar{Y}_{i..} - \bar{Y}_{.j.} + \bar{Y}_{...})$

Squaring and summing:

$\begin{aligned} n\sum_i \sum_j (\bar{Y}_{ij.} - \bar{Y}_{...})^2 &= nb\sum_i (\bar{Y}_{i..} - \bar{Y}_{...})^2 + na\sum_j (\bar{Y}_{.j.} - \bar{Y}_{...})^2 \\ &+ n\sum_i \sum_j (\bar{Y}_{ij.} - \bar{Y}_{i..} - \bar{Y}_{.j.} + \bar{Y}_{...})^2 \end{aligned}$

Thus, treatment sum of squares can be further partitioned as:

$SSTR = SSA + SSB + SSAB$

where:

$SSA$ : Sum of Squares for Factor A.
$SSB$ : Sum of Squares for Factor B.
$SSAB$ : Sum of Squares for Interaction.

The interaction term can also be expressed as:

$SSAB = SSTO - SSE - SSA - SSB$

or equivalently:

$SSAB = SSTR - SSA - SSB$

where:

$SSA$ measures the variability of the estimated factor A level means ( $\bar{Y}_{i..}$ ). The more variable these means, the larger $SSA$ .
$SSB$ measures the variability of the estimated factor B level means ( $\bar{Y}_{.j.}$ ).
$SSAB$ measures the variability in interaction effects.

For Two-Factor ANOVA, the degrees of freedom partitioning follows:

Sum of Squares and Associated Degrees of Freedom
Sum of Squares	Degrees of Freedom (df)
$SSTO$ (Total)	$N - 1 = abn - 1$
$SSTR$ (Treatments)	$ab - 1$
$SSE$ (Error)	$N - ab = ab(n - 1)$
$SSA$ (Factor A)	$a - 1$
$SSB$ (Factor B)	$b - 1$
$SSAB$ (Interaction)	$(a-1)(b-1)$

Since:

$SSTR = SSA + SSB + SSAB$

the treatment degrees of freedom also partition as:

$ab - 1 = (a - 1) + (b - 1) + (a - 1)(b - 1)$

$df_{SSA} = a - 1$
(One degree of freedom lost due to the constraint $\sum (\bar{Y}_{i..} - \bar{Y}_{...}) = 0$ ).
$df_{SSB} = b - 1$
(One degree of freedom lost due to the constraint $\sum (\bar{Y}_{.j.} - \bar{Y}_{...}) = 0$ ).
$df_{SSAB} = (a - 1)(b - 1)$
(Due to interaction constraints).

The Mean Squares are obtained by dividing Sum of Squares by the corresponding degrees of freedom:

$\begin{aligned} MSA &= \frac{SSA}{a - 1} \\ MSB &= \frac{SSB}{b - 1} \\ MSAB &= \frac{SSAB}{(a - 1)(b - 1)} \end{aligned}$

The expectations of the mean squares are:

$\begin{aligned} E(MSE) &= \sigma^2 \\ E(MSA) &= \sigma^2 + nb \frac{\sum \alpha_i^2}{a - 1} = \sigma^2 + nb \frac{\sum (\mu_{i..} - \mu_{..})^2}{a - 1} \\ E(MSB) &= \sigma^2 + na \frac{\sum \beta_j^2}{b - 1} = \sigma^2 + na \frac{\sum (\mu_{.j.} - \mu_{..})^2}{b - 1} \\ E(MSAB) &= \sigma^2 + n \frac{\sum \sum (\alpha \beta)^2_{ij}}{(a-1)(b-1)} = \sigma^2 + n \frac{\sum (\mu_{ij} - \mu_{i..} - \mu_{.j.} + \mu_{..})^2}{(a - 1)(b - 1)} \end{aligned}$

If Factor A has no effect ( $\mu_{i..} = \mu_{..}$ ), then $MSA$ and $MSE$ have the same expectation.
Similarly, if Factor B has no effect, then $MSB = MSE$ .

Thus, MSA > MSE and MSB > MSE suggest the presence of factor effects.

24.1.3.4 Testing for Interaction

Hypotheses:

$\begin{aligned} H_0: \mu_{ij} - \mu_{i..} - \mu_{.j.} + \mu_{..} = 0 &\quad \text{(No interaction)} \\ H_a: \mu_{ij} - \mu_{i..} - \mu_{.j.} + \mu_{..} \neq 0 &\quad \text{(Interaction present)} \end{aligned}$

or equivalently:

$\begin{aligned} &H_0: \text{All } (\alpha \beta)_{ij} = 0 \\ &H_a: \text{Not all } (\alpha \beta)_{ij} = 0 \end{aligned}$

The F-statistic is:

$F = \frac{MSAB}{MSE}$

Under $H_0$ , $F \sim F_{(a-1)(b-1), ab(n-1)}$ . Reject $H_0$ if:

$F > F_{1-\alpha; (a-1)(b-1), ab(n-1)}$

24.1.3.5 Two-Way ANOVA Summary Table

The Two-Way ANOVA table partitions the total variation into its components:

Source of Variation, Sums of Squares, and F-Statistics
Source of Variation	Sum of Squares (SS)	Degrees of Freedom (df)	Mean Square (MS)	F-Statistic
Factor A	$SSA$	$a-1$	$MSA = \frac{SSA}{a-1}$	$F_A = \frac{MSA}{MSE}$
Factor B	$SSB$	$b-1$	$MSB = \frac{SSB}{b-1}$	$F_B = \frac{MSB}{MSE}$
Interaction (A × B)	$SSAB$	$(a-1)(b-1)$	$MSAB = \frac{SSAB}{(a-1)(b-1)}$	$F_{AB} = \frac{MSAB}{MSE}$
Error	$SSE$	$ab(n-1)$	$MSE = \frac{SSE}{ab(n-1)}$	-
Total (corrected)	$SSTO$	$abn - 1$	-	-

Interpreting Two-Way ANOVA Results

When conducting a Two-Way ANOVA, always check interaction effects first:

If the interaction ( $A \times B$ ) is significant:
- The effect of one factor depends on the level of the other factor.
- Main effects are not interpretable alone because their impact varies across levels of the second factor.
If the interaction is NOT significant:
- The factors have independent (additive) effects.
- Main effects can be tested individually.

Post-Hoc Comparisons

If interaction is not significant, proceed with main effect comparisons using:
If interaction is significant, post-hoc tests should examine simple effects (comparisons within each level of a factor).

24.1.3.5.1 Contrasts in Two-Way ANOVA

In Two-Way ANOVA, we can define contrasts to test specific hypotheses:

$L = \sum c_i \mu_i, \quad \text{where } \sum c_i = 0$

An unbiased estimator of $L$ :

$\hat{L} = \sum c_i \bar{Y}_{i..}$

with variance:

$\sigma^2(\hat{L}) = \frac{\sigma^2}{bn} \sum c_i^2$

and variance estimate:

$\frac{MSE}{bn} \sum c_i^2$

24.1.3.5.1.1 Orthogonal Contrasts in Two-Way ANOVA

For two contrasts:

$\begin{aligned} L_1 &= \sum c_i \mu_i, \quad \sum c_i = 0 \\ L_2 &= \sum d_i \mu_i, \quad \sum d_i = 0 \end{aligned}$

They are orthogonal if:

$\sum \frac{c_i d_i}{n_i} = 0$

For balanced designs ( $n_i = n$ ):

$\sum c_i d_i = 0$

This ensures that orthogonal contrasts are uncorrelated:

$\begin{aligned} cov(\hat{L}_1, \hat{L}_2) &= cov\left(\sum_i c_i \bar{Y}_{i..}, \sum_l d_l \bar{Y}_{l..}\right) \\ &= \sum_i \sum_l c_i d_l cov(\bar{Y}_{i..},\bar{Y}_{l..}) \\ &= \sum_i c_i d_i \frac{\sigma^2}{bn} = 0 \end{aligned}$

Thus, orthogonal contrasts allow us to partition the sum of squares further.

24.1.3.5.1.2 Orthogonal Polynomial Contrasts

Used when factor levels are equally spaced (e.g., dose levels: 0, 15, 30, 45, 60).
Requires equal sample sizes across factor levels.

The Sum of Squares (SS) for a given contrast:

$SS_L = \frac{\hat{L}^2}{\sum_{i=1}^a \frac{c^2_i}{bn_i}}$

The $t$ -statistic for testing contrasts:

$T = \frac{\hat{L}}{\sqrt{MSE \sum_{i=1}^a \frac{c_i^2}{bn_i}}} \sim t$

Since:

$t^2_{(1-\alpha/2; df)} = F_{(1-\alpha; 1, df)}$

we can equivalently test:

$\frac{SS_L}{MSE} \sim F_{(1-\alpha;1,df_{MSE})}$

All contrasts have $df = 1$ .

24.1.3.6 Unbalanced Two-Way ANOVA

In many practical situations, sample sizes may be unequal across factor combinations, such as in:

Observational studies (e.g., real-world data with missing values).
Dropouts in designed studies (e.g., clinical trials with subject attrition).
Larger sample sizes for inexpensive treatments.
Sample sizes chosen to match population proportions.

We assume the standard Two-Way ANOVA model:

$Y_{ijk} = \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} + \epsilon_{ijk}$

where sample sizes vary:

$\begin{aligned} n_{i.} &= \sum_j n_{ij} \quad \text{(Total for factor level } i) \\ n_{.j} &= \sum_i n_{ij} \quad \text{(Total for factor level } j) \\ n_T &= \sum_i \sum_j n_{ij} \quad \text{(Total sample size)} \end{aligned}$

However, for unbalanced designs, a major issue arises:

$SSTO \neq SSA + SSB + SSAB + SSE$

Unlike the balanced case, the design is non-orthogonal, meaning sum-of-squares partitions do not add up cleanly.

24.1.3.6.1 Indicator Variables for Factor Levels

To handle unbalanced data, we use indicator (dummy) variables as predictors.

For Factor A ( $i = 1, \dots, a-1$ ):

$u_i = \begin{cases} +1 & \text{if observation is from level } i \text{ of Factor A} \\ -1 & \text{if observation is from the reference level (level } a \text{)} \\ 0 & \text{otherwise} \end{cases}$

For Factor B ( $j = 1, \dots, b-1$ ):

$v_j = \begin{cases} +1 & \text{if observation is from level } j \text{ of Factor B} \\ -1 & \text{if observation is from the reference level (level } b \text{)} \\ 0 & \text{otherwise} \end{cases}$

Rewriting the ANOVA model using indicator variables:

$Y = \mu_{..} + \sum_{i=1}^{a-1} \alpha_i u_i + \sum_{j=1}^{b-1} \beta_j v_j + \sum_{i=1}^{a-1} \sum_{j=1}^{b-1}(\alpha \beta)_{ij} u_i v_j + \epsilon$

Here, the unknown parameters are:

$\mu_{..}$ (grand mean),
$\alpha_i$ (main effects for Factor A),
$\beta_j$ (main effects for Factor B),
$(\alpha \beta)_{ij}$ (interaction effects).

24.1.3.6.2 Hypothesis Testing Using Extra Sum of Squares

For unbalanced designs, we use sequential (type I) or adjusted (type III) sum of squares to test hypotheses.

To test for interaction effects, we test:

$\begin{aligned} &H_0: \text{All } (\alpha \beta)_{ij} = 0 \quad \text{(No interaction)} \\ &H_a: \text{Not all } (\alpha \beta)_{ij} = 0 \quad \text{(Interaction present)} \end{aligned}$

To test whether Factor B has an effect:

$\begin{aligned} &H_0: \beta_1 = \beta_2 = \dots = \beta_b = 0 \\ &H_a: \text{At least one } \beta_j \neq 0 \end{aligned}$

24.1.3.6.3 Factor Mean Analysis and Contrasts

Factor means and contrasts (e.g., pairwise comparisons) work similarly to the balanced case but require adjustments due to unequal sample sizes.

The variance estimate for a contrast:

$\sigma^2(\hat{L}) = \frac{\sigma^2}{\sum n_{ij}} \sum c_i^2$

is modified to:

$\frac{MSE}{\sum n_{ij}} \sum c_i^2$

Orthogonal contrasts are harder to define because unequal sample sizes break orthogonality.

24.1.3.6.4 Regression Approach to Unbalanced ANOVA

An alternative is to fit the cell means model as a regression model:

$Y_{ij} = \mu_{ij} + \epsilon_{ij}$

which allows us to analyze each treatment mean separately.

However, if there are empty cells (some factor combinations have no observations), the regression approach fails, and only partial analyses can be conducted.

24.1.4 Two-Way Random Effects ANOVA

The Two-Way Random Effects ANOVA assumes that both Factor A and Factor B levels are randomly sampled from larger populations.

The model is:

$Y_{ijk} = \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} + \epsilon_{ijk}$

where:

$\mu_{..}$ : Overall mean (constant).
$\alpha_i \sim N(0, \sigma^2_{\alpha})$ for $i = 1, \dots, a$ (random effects for Factor A, independently distributed).
$\beta_j \sim N(0, \sigma^2_{\beta})$ for $j = 1, \dots, b$ (random effects for Factor B, independently distributed).
$(\alpha \beta)_{ij} \sim N(0, \sigma^2_{\alpha \beta})$ for $i = 1, \dots, a$ , $j = 1, \dots, b$ (random interaction effects, independently distributed).
$\epsilon_{ijk} \sim N(0, \sigma^2)$ (random error, independently distributed).

Additionally, all random effects ( $\alpha_i, \beta_j, (\alpha \beta)_{ij}$ ) and error terms ( $\epsilon_{ijk}$ ) are mutually independent.

24.1.4.1 Expectation

Taking expectations on both sides:

$E(Y_{ijk}) = E(\mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} + \epsilon_{ijk})$

Since all random effects have mean zero:

$E(Y_{ijk}) = \mu_{..}$

Thus, the mean response across all factor levels is $\mu_{..}$ .

24.1.4.2 Variance

The total variance of observations is the sum of all variance components:

$\begin{aligned} var(Y_{ijk}) &= var(\alpha_i) + var(\beta_j) + var((\alpha \beta)_{ij}) + var(\epsilon_{ijk}) \\ &= \sigma^2_{\alpha} + \sigma^2_{\beta} + \sigma^2_{\alpha \beta} + \sigma^2 \end{aligned}$

Thus:

$Y_{ijk} \sim N(\mu_{..}, \sigma^2_{\alpha} + \sigma^2_{\beta} + \sigma^2_{\alpha \beta} + \sigma^2)$

24.1.4.3 Covariance Structure

In random effects models, observations are correlated if they share the same factor levels.

Case 1: Same factor A, different factor B

If $i$ is the same but $j \neq j'$ , then:

$cov(Y_{ijk}, Y_{ij'k'}) = var(\alpha_i) = \sigma^2_{\alpha}$

Case 2: Same factor B, different factor A

If $j$ is the same but $i \neq i'$ , then:

$cov(Y_{ijk}, Y_{i'jk'}) = var(\beta_j) = \sigma^2_{\beta}$

Case 3: Same factor A and B, different replication

If both factor levels are the same ( $i, j$ fixed), but different replication ( $k \neq k'$ ):

$cov(Y_{ijk}, Y_{ijk'}) = var(\alpha_i) + var(\beta_j) + var((\alpha \beta)_{ij}) = \sigma^2_{\alpha} + \sigma^2_{\beta} + \sigma^2_{\alpha \beta}$

Case 4: Completely different factor levels

If neither factor A nor B is the same ( $i \neq i'$ , $j \neq j'$ ), then:

$cov(Y_{ijk}, Y_{i'j'k'}) = 0$

since all random effects are independent across different factor levels.

Summary of Variance-Covariance Structure

Summary of Variance-Covariance Structure
Case	Condition	Covariance
Same factor A, different factor B	$i$ same, $j \neq j'$	$\sigma^2_{\alpha}$
Same factor B, different factor A	$j$ same, $i \neq i'$	$\sigma^2_{\beta}$
Same factor levels, different replications	$i$ same, $j$ same, $k \neq k'$	$\sigma^2_{\alpha} + \sigma^2_{\beta} + \sigma^2_{\alpha \beta}$
Different factor levels	$i \neq i'$ , $j \neq j'$	$0$

24.1.5 Two-Way Mixed Effects ANOVA

In a Two-Way Mixed Effects Model, one factor is fixed, while the other is random.
This is often referred to as a mixed effects model or simply a mixed model.

24.1.5.1 Balanced

For a balanced design, the restricted mixed model is:

$Y_{ijk} = \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} + \epsilon_{ijk}$

where:

$\mu_{..}$ : Overall mean (constant).
$\alpha_i$ : Fixed effects for Factor A, subject to the constraint $\sum \alpha_i = 0$ .
$\beta_j \sim N(0, \sigma^2_\beta)$ (random effects for Factor B).
$(\alpha \beta)_{ij} \sim N(0, \frac{a-1}{a} \sigma^2_{\alpha \beta})$
(interaction effects, constrained so that $\sum_i (\alpha \beta)_{ij} = 0$ for all $j$ ). The variance is written as the proportion for convenience, it makes the expected mean squares simpler.
$\epsilon_{ijk} \sim N(0, \sigma^2)$ (random error).
$\beta_j, (\alpha \beta)_{ij}, \epsilon_{ijk}$ are pairwise independent.

The restriction on interaction variance ( $\frac{a-1}{a} \sigma^2_{\alpha \beta}$ ) simplifies the expected mean squares, though some sources assume $var((\alpha \beta)_{ij}) = \sigma^2_{\alpha \beta}$ .

An unrestricted version of the model removes constraints on interaction terms.

Define:

$\begin{aligned} \beta_j &= \beta_j^* + (\bar{\alpha \beta})_{ij}^* \\ (\alpha \beta)_{ij} &= (\alpha \beta)_{ij}^* - (\bar{\alpha \beta})_{ij}^* \end{aligned}$

where $\beta^*$ and $(\alpha \beta)^*_{ij}$ are unrestricted random effects.

Some consider the restricted model more general, but we use the restricted form for simplicity.

Taking expectations:

$E(Y_{ijk}) = \mu_{..} + \alpha_i$

The total variance of responses:

$var(Y_{ijk}) = \sigma^2_\beta + \frac{a-1}{a} \sigma^2_{\alpha \beta} + \sigma^2$

Covariance Structure

Observations sharing the same random factor (B) level are correlated.

Covariances for Different Cases
Condition	Covariance
Same $i, j$ , different replications ( $k \neq k'$ )	$cov (Y_{ijk}, Y_{ijk'}) = \sigma^2_\beta + \frac{a-1}{a} \sigma^2_{\alpha \beta}$
Same $j$ , different $i$ ( $i \neq i'$ )	$cov(Y_{ijk}, Y_{i'jk'}) = \sigma^2_\beta - \frac{1}{a} \sigma^2_{\alpha \beta}$
Different $i$ and $j$ ( $i \neq i'$ , $j \neq j'$ )	$cov(Y_{ijk}, Y_{i'j'k'}) = 0$

Thus, observations only become independent when they do not share the same random effect.

An advantage of the restricted mixed model is that 2 observations from the same random factor (B) level can be positively or negatively correlated. In the unrestricted model, they can only be positively correlated.

Comparison of Fixed, Random, and Mixed Effects Models
Mean Square	Fixed ANOVA (A, B fixed)	Random ANOVA (A, B random)	Mixed ANOVA (A fixed, B random)
MSA	$\sigma^2 + n b \frac{\sum \alpha_i^2}{a-1}$	$\sigma^2 + n b \sigma^2_{\alpha}$	$\sigma^2 + n b \frac{\sum_{i = 1}^a \alpha_i^2}{a-1} + n \sigma^2_{\alpha \beta}$
MSB	$\sigma^2 + n a \frac{\sum \beta_j^2}{b-1}$	$\sigma^2 + n a \sigma^2_{\beta}$	$\sigma^2 + n a \sigma^2_{\beta} + n \sigma^2_{\alpha \beta}$
MSAB	$\sigma^2 + n \frac{\sum (\alpha \beta)_{ij}^2}{(a-1)(b-1)}$	$\sigma^2 + n \sigma^2_{\alpha \beta}$	$\sigma^2 + n \sigma^2_{\alpha \beta}$
MSE	$\sigma^2$	$\sigma^2$	$\sigma^2$

While SS and df are identical across models, the expected mean squares differ, affecting test statistics.

24.1.5.1.1 Hypothesis Testing in Mixed ANOVA

In random ANOVA, we test:

$\begin{aligned} H_0: \sigma^2 = 0 \quad vs. \quad H_a: \sigma^2 > 0 \end{aligned}$

using:

$F = \frac{MSA}{MSAB} \sim F_{a-1, (a-1)(b-1)}$

For mixed models, the same test statistic is used for:

$H_0: \alpha_i = 0, \quad \forall i$

However, for fixed effects models, the test statistic differs.

F-Test Denominators for Factorial ANOVA Under Different Model Assumptions
Test for Effect of	Fixed ANOVA (A, B fixed)	Random ANOVA (A, B random)	Mixed ANOVA (A fixed, B random)
Factor A	$\frac{MSA}{MSE}$	$\frac{MSA}{MSAB}$	$\frac{MSA}{MSAB}$
Factor B	$\frac{MSB}{MSE}$	$\frac{MSB}{MSAB}$	$\frac{MSB}{MSE}$
Interaction (A × B)	$\frac{MSAB}{MSE}$	$\frac{MSAB}{MSE}$	$\frac{MSAB}{MSE}$

24.1.5.1.2 Variance Component Estimation

In random and mixed effects models, we are interested in estimating variance components.

To estimate $\sigma^2_\beta$ :

$E(\sigma^2_\beta) = \frac{E(MSB) - E(MSE)}{na} = \frac{\sigma^2 + na \sigma^2_\beta - \sigma^2}{na} = \sigma^2_\beta$

which is estimated by:

$\hat{\sigma}^2_\beta = \frac{MSB - MSE}{na}$

Confidence intervals for variance components can be approximated using:

Satterthwaite procedure.
Modified large-sample (MLS) method

24.1.5.1.3 Estimating Fixed Effects in Mixed Models

Fixed effects $\alpha_i$ are estimated by:

$\begin{aligned} \hat{\alpha}_i &= \bar{Y}_{i..} - \bar{Y}_{...} \\ \hat{\mu}_{i.} &= \bar{Y}_{...} + (\bar{Y}_{i..} - \bar{Y}_{...}) = \bar{Y}_{i..} \end{aligned}$

Their variances:

$\begin{aligned} \sigma^2(\hat{\alpha}_i) &= \frac{\sigma^2 + n \sigma^2_{\alpha \beta}}{bn} = \frac{E(MSAB)}{bn} \\ s^2(\hat{\alpha}_i) &= \frac{MSAB}{bn} \end{aligned}$

24.1.5.1.4 Contrasts on Fixed Effects

For a contrast:

$L = \sum c_i \alpha_i, \quad \text{where } \sum c_i = 0$

Estimate:

$\hat{L} = \sum c_i \hat{\alpha}_i$

Variance:

$\sigma^2(\hat{L}) = \sum c^2_i \sigma^2(\hat{\alpha}_i), \quad s^2(\hat{L}) = \frac{MSAB}{bn} \sum c^2_i$

24.1.5.2 Unbalanced Two-Way Mixed Effects ANOVA

In an unbalanced two-way mixed model (e.g., $a = 2, b = 4$ ), the model remains:

$Y_{ijk} = \mu_{..} + \alpha_i + \beta_j + (\alpha \beta)_{ij} + \epsilon_{ijk}$

where:

$\alpha_i$ : Fixed effects for Factor A.
$\beta_j \sim N(0, \sigma^2_\beta)$ : Random effects for Factor B.
$(\alpha \beta)_{ij} \sim N(0, \frac{\sigma^2_{\alpha \beta}}{2})$ : Interaction effects.
$\epsilon_{ijk} \sim N(0, \sigma^2)$ : Residual error.

24.1.5.2.1 Variance Components

The variance components are:

$\begin{aligned} var(\beta_j) &= \sigma^2_\beta \\ var((\alpha \beta)_{ij}) &= \frac{2-1}{2} \sigma^2_{\alpha \beta} = \frac{\sigma^2_{\alpha \beta}}{2} \\ var(\epsilon_{ijk}) &= \sigma^2 \end{aligned}$

24.1.5.2.2 Expectation and Variance

Taking expectations:

$E(Y_{ijk}) = \mu_{..} + \alpha_i$

Total variance:

$var(Y_{ijk}) = \sigma^2_{\beta} + \frac{\sigma^2_{\alpha \beta}}{2} + \sigma^2$

24.1.5.2.3 Covariance Structure

Observations sharing Factor B (random effect) are correlated.

Covariances for Different Cases
Condition	Covariance
Same $i, j$ , different replications ( $k \neq k'$ )	$cov(Y_{ijk}, Y_{ijk'}) = \sigma^2 + \frac{\sigma^2_{\alpha \beta}}{2}$
Same $j$ , different $i$ ( $i \neq i'$ )	$cov (Y_{ijk}, Y_{i'jk'}) = \sigma^2_{\beta} - \frac{\sigma^2_{\alpha \beta}}{2}$
Different $i$ and $j$ ( $i \neq i'$ , $j \neq j'$ )	$cov(Y_{ijk}, Y_{i'j'k'}) = 0$

Thus, only observations within the same random factor level share dependence.

24.1.5.2.4 Matrix Representation

Assume:

$\mathbf{Y} \sim N(\mathbf{X} \beta, M)$

where:

$\mathbf{X}$ : Fixed effects design matrix.
$\beta$ : Fixed effect coefficients.
$M$ : Block diagonal covariance matrix containing variance components.

The density function of $\mathbf{Y}$ is:

$f(\mathbf{Y}) = \frac{1}{(2\pi)^{N/2} |M|^{1/2}} \exp \left( -\frac{1}{2} (\mathbf{Y} - \mathbf{X} \beta)' M^{-1} (\mathbf{Y} - \mathbf{X} \beta) \right)$

If variance components were known, we could use Generalized Least Squares:

$\hat{\beta}_{GLS} = (\mathbf{X}' M^{-1} \mathbf{X})^{-1} \mathbf{X}' M^{-1} \mathbf{Y}$

However, since variance components ( $\sigma^2, \sigma^2_\beta, \sigma^2_{\alpha \beta}$ ) are unknown, we estimate them using:

Maximizing the likelihood:

$\ln L = - \frac{N}{2} \ln (2\pi) - \frac{1}{2} \ln |M| - \frac{1}{2} (\mathbf{Y} - \mathbf{X} \beta)' M^{-1} (\mathbf{Y} - \mathbf{X} \beta)$

where:

$|M|$ : Determinant of the variance-covariance matrix.
$(\mathbf{Y} - \mathbf{X} \beta)' M^{-1} (\mathbf{Y} - \mathbf{X} \beta)$ : Quadratic form in the likelihood.