Prediction interval (PI) for Y given x

The quantity of interest here is a future observation of $Y$ , $Y_f$ say, when $x$ takes the value $x_f$ , which denotes the value of the explanatory variable at the position where a prediction of $Y$ is required. Note that the observation $(Y_f, x_f)$ was not used to construct the regression model or estimate regression parameters. The expected value $E(Y|x_f)$ can be written in the form $\mathbf{b}_f^T\boldsymbol{\beta}$ . For example, with a simple linear regression, $E(Y) = \alpha+\beta x$ , we can write $E(Y|x_f)=\alpha+\beta x_f = \mathbf{b}^T\boldsymbol{\beta}$ , where $\mathbf{b}^T = (1, x_f)$ and $\boldsymbol{\beta}^T = (\alpha,\beta)$ .

A prediction interval for $y_f$ is

$\mathbf{b}^T\boldsymbol{\hat{\beta}}\pm t\left(n-p; \frac{1+c}{2}\right)\sqrt{\frac{RSS}{n-p}(1+\mathbf{b}^T(\mathbf{X}^T\mathbf{X})^{-1}\mathbf{b})}.$

Simple linear regression

Construct a 95% C.I. (confidence interval) of $\beta$ in the model $y_i = \alpha+\beta x_{1i}+\gamma x_{2i}+\epsilon_i$ . We can write this C.I. as

$\hat{\beta} \pm t(n-p; 0.975)\mathrm{s.e.}(\hat{\beta})$

where

$\mathrm{s.e.}(\hat{\beta}) = \sqrt{\frac{RSS}{n-p}\mathbf{b}^T(\mathbf{X}^T\mathbf{X})^{-1}\mathbf{b}}$

and

$\mathbf{b} = \left( \begin{array}{c} 0 \\ 1 \\ 0\\ \end{array} \right)$

R automatically prints the standard error (s.e.) of each individual parameter when it fits a regression model. This makes the construction of C.I.s for each parameter very easy given these estimates.

A parameter estimate is a random variable, since it can take several values, and in probability terms the estimated standard error of a parameter estimate is an estimate of its standard deviation.