We have estimated the population mean of a Normal distribution, assuming known population standard deviation \(\sigma\), using grid approximation in Chapter 8 and posterior simulation in Section 14.1. We will relax the unreasonable assumption of known standard deviation soon. But before doing that, we will briefly investigate one more case where the posterior distribution can be derived analytically, in a form that makes explicit the relative influence of prior and likelihood.
The precision of a Normal distribution is the reciprocal of its variance.
If \(y\) follows a Normal\((\theta, \sigma)\) distribution, the precision in a single measurement of \(y\) is \(1/\sigma^2\).
For a random sample of \(n\) values of the variable \(y\), the sample mean \(\bar{y}\) follows a Normal distribution with standard deviation \(\sigma/\sqrt{n}\) and precision \(n/\sigma^2\). That is, the precision in \(n\) independent data values is \(n\) times the precision in a single value.
For the posterior distribution in the Normal-Normal model, there is an intuitive interpretation of the compromise between prior and likelihood.
The posterior precision is the sum of the prior precision and the data precision.
The posterior mean is a weighted average of the prior mean and the sample mean, with weights proportional to the precisions.
Example 16.1 Assume body temperatures (degrees Fahrenheit) of healthy adults follow a Normal distribution with unknown mean \(\theta\) and known standard deviation \(\sigma=1\). (It’s unrealistic to assume the population standard deviation is known. We’ll consider the case of unknown standard deviation later.) Suppose we wish to estimate \(\theta\), the population mean healthy human body temperature.
Assume the prior distribution of \(\theta\) is Normal with mean 98.6 and standard deviation 0.7.
The sample mean body temperature in a sample of 208 healthy adults is 97.7 degrees F.
Use the Normal-Normal model to identify the posterior distribution of \(\theta\). (Compare to the results of Chapter 8 and Section 14.1.)
Find a 98% posterior credible interval.
Considering each of the following changes in isolation, indicate if (1) the center and (2) the width of the posterior credible interval would be greater than, less than, or equal to the respective values from the previous part.
Credibility is 80%
Sample size is 20
Sample mean is 98.0
Prior mean is 98.0
Prior SD is 0.1
Population SD is \(\sigma = 0.5\)
16.1 Notes
The code below just plugs the numbers into the above Normal-Normal model formulas.