24 More details about CIs

So far, you have learnt to ask a RQ, design a study, classify and summarise the data, and construct some confidence intervals. In this chapter, you will learn more about forming confidence intervals. You will learn to

  • communicate confidence intervals.
  • interpret confidence intervals.

24.1 General comments

The previous chapters discussed forming confidence intervals (CI) for estimating a population proportion, and for estimating a population mean. CIs in other contexts will also be studied (Chaps. 29 to 31). This chapter discusses some principles that apply to CIs in general:

  • statistical validity (Sect. 24.2).
  • writing conclusions (Sect. 24.3).
  • interpreting CIs (Sect. 24.4).

CIs are formed for an unknown population parameter (such as the population proportion p), using the best estimate of the parameter: the sample statistic (such as the sample proportion p^). When the sampling distribution of the statistic has an approximate normal distribution, CIs have the form statistic±(multiplier×standard error), where (multiplier×standard error) is called the margin of error. For an approximate 95% CI, the multiplier is 2 (from the 68--95--99.7 rule), provided the statistical validity conditions are met. To compute CIs other than 95% CIs (such as 99% CIs), and for exact 95% CIs, software is used.

Confidence intervals tell us about the unknown population parameter, based on what we learn from one of the countless possible sample statistics.

24.2 More details about statistical validity

When constructing confidence intervals, statistical validity conditions must be true to ensure the mathematics behind the calculations are sound. For instance, many CIs assume the sampling distribution has a normal distribution (so that, for example, the 68--95--99.7 rule can be used); the statistical validity conditions state the conditions under which the sampling distribution has an approximate normal distribution. If these conditions are not met, the sampling distribution may not be close to an approximate normal distribution, so the 68--95--99.7 rule (on which the CI is based) may not be appropriate, and the CI itself may be inappropriate. Of course, if the statistical validity conditions are close to be satisfied, then the resulting confidence interval will still be reasonably useful.

Besides checking the statistical validity conditions, the internal validity and external validity of the study should be discussed (Fig. 24.1). In addition, CIs also require that the sample size is less than about 10% of the population size; this is almost always the case.

Four types of validities for studies.

FIGURE 24.1: Four types of validities for studies.

24.3 More details about writing conclusions

When reporting a CI, include:

  1. the CI (including units of measurement, if relevant);
  2. the level of confidence for the CI (typically, a 95% CI); and
  3. the value of the statistic (the parameter estimate) and the sample size.

If the CI is an approximate CI (e.g., based on using an approximate multiplier of 2 from the 68--95--99.7 rule), this should also be clear.

Example 24.1 (Writing conclusions) In Sect. 23.6, the mean cadmium level of peanuts was estimated. The conclusion given was:

The sample mean cadmium concentration of peanuts is x¯=0.0768ppm (n=290), with an approximate 95% CI from 0.0714 to 0.0822ppm.

Each of the three elements above are given:

  1. the CI: 0.0714 to 0.0822ppm;
  2. the level of confidence for the CI: 95%; and
  3. the value of the statistic: x¯=0.0768ppm.

In addition, the CI is flagged as an approximate 95% CI.

24.4 More details about interpreting CIs

Interpreting CIs correctly takes care. The correct interpretation (Def. 22.4) of a 95% CI is:

The CI is an interval which contains the unknown parameter 95% of the time (over repeated sampling).

That is, if we repeated the process (of selecting a sample of 290 peanuts and computing the CI for each sample) numerous times, 95% of those confidence intervals formed would contain the value of the parameter. This is the idea shown in the animation in Sect. 22.5.

In practice, this definition is unsatisfying, since we only ever have one sample, not many samples. Furthermore, since the value of the parameter is unknown (after all, the reason for taking a sample was to estimate the value of the parameter), we don't know if the CI from our single sample straddles the population parameter or not.

Two reasonable alternative interpretations for a 95% CI are:

  • The 95% CI gives a range of values of the unknown parameter that could reasonably (with 95% confidence) have produced our observed value of the statistic.
  • There is a 95% chance that our 95% CI straddles the value of the parameter.

These alternatives are adequate and common interpretations.

Frequently, the CI is described as having a 95% chance of containing the population parameter. This is not strictly correct (the CI either does or does not contain the value of the population parameter), but is a common and a brief paraphrase for the correct interpretation above.

I use this analogy: most people say the sun rises in the east. This is incorrect; the sun doesn't rise at all. People say the sun rises in the east as a convenient way to explain that we see the sun each morning in the east as the earth rotates. Similarly, most people interpret a CI as an interval with a certain chance of containing the value of the population parameter, even though it is technically incorrect.

Example 24.2 (Interpreting CIs) In Example 24.1, the approximate 95% CI was from 0.0714 to 0.0822. The correct interpretation is:

If many samples of 290 peanuts were taken, and the approximate 95% CI computed for each one, about 95% of those CIs would contain the population mean.

Our CI may or may not include the value of μ, however. We might say:

This 95% CI (from 0.0714 to 0.0822ppm) has a 95% chance of straddling the actual value of μ.

or

The range of values of μ that could plausibly (with 95% confidence) have produced x¯=0.0768 is between 0.0714 to 0.0822ppm.

In practice, the CI is usually interpreted as:

There is a 95% chance that the population mean level of cadmium in peanuts is between 0.0714 to 0.0822ppm.

This last statement is not strictly correct, but is commonly-used, and sufficient for our use.

24.5 Chapter summary

Confidence intervals (or CIs) tell us about the unknown population parameter, based on what we learn from one the countless possible sample statistics. CIs give an interval in which a parameter is likely to lie over repeated sampling. Since we only ever have one sample, two reasonable alternative interpretations for a 95% CI are:

  • The 95% CI gives a range of values of the unknown parameter that could reasonably (with 95% confidence) have produced our observed value of the statistic.
  • There is a 95% chance that our 95% CI straddles the value of the parameter.

We never know if the CI from our single sample includes the population parameter or not. When reporting a CI, include:

  1. the CI (including units of measurement, if relevant);
  2. the level of confidence for the CI (typically, a 95% CI); and
  3. the value of the statistic (the parameter estimate) and the sample size.

24.6 Quick revision exercises

Are the following statements true or false?

  1. CIs always have 95% confidence.
  2. Statistical validity concern generalisability of the results.
  3. CIs always include the value of the population parameter.
  4. All other things being equal, a 95% CI is wider than a 90% CI.
  5. The 'multiplier times the standard error' is called the margin of error.
  6. We are fairly sure (but not certain) that the CI includes the value of the statistic.
0 of 6 correct

24.7 Exercises

Answers to odd-numbered exercises are given at the end of the book.

Exercise 24.1 Hirst and Stedman (1962) computed a 95% CI to estimate the proportion of trees with apple scab, and found p^=0.314 and s.e.(p^)=0.091. What would be wrong with the following conclusions?

  • An approximate 95% CI for the sample proportion is between 0.223 and 0.405.
  • This CI means we are 95% confident that between 22.3% and 40.5% trees are infected with apple scab.

Exercise 24.2 Fayet-Moore et al. (2017) studied the snacking habits of Australian children. In 2007 (for which n=3 637), the CI for the proportion of children snacking ('an eating occasion that occurred between meals based on time of day'; p. 1) was 0.981±0.003 in 2007. What would be wrong with the following conclusion?

An approximate 95% CI for the sample proportion of snacks (in 2007) is 0.981±0.003.

Exercise 24.3 Guirao et al. (2017) studied how far amputees, following a femoral (leg) implant, could walk in two minutes. After 14 months, the sample of ten amputees walked a mean of 122.5m; the 95% CI was computed as 96.4m to 148.6m. What would be wrong with the following conclusions?

  • Approximately 95% of the amputees walked between 96.4 and 1488.6m in two minutes.
  • The 95% CI for the sample mean distance walked in two minutes was between 96.4 and 1488.6m.

Exercise 24.4 A study of sodium intake in Thailand found the 95% CI for the mean daily sodium intake for subjects with a secondary school education was 3565 to 3903mg. What would be wrong with the following conclusions?

  • This CI means that approximately 95% of the subjects had a daily sodium intake between 3565 to 3903mg.
  • A 95% CI for the sample mean daily sodium intake is between 3565 to 3903mg.

Exercise 24.5 In discussing the weight of adult male Leadbeater's possums, J. L. Williams et al. (2022) state (p. 170):

The average adult male Leadbeater’s possum weighed 137g (95% CI = 135g, 139g), with 90% of weights between 122 and 153g.

Figure 22.5 indicates that a higher value for the confidence level means wider confidence intervals, since wider intervals are needed to be more certain that the interval contains the value of the parameter that produced the value of the statistic.

In light of this, explain why the 90% interval is wider than the 95% interval in the above quote.