25 Making decisions

So far, you have learnt to ask a RQ, design a study, describe and summarise the data, and form confidence intervals. In this chapter, you will learn how decisions are made in science, so you can answer RQs. You will learn to:

  • state the two broad reasons to explain the difference between statistics and parameters.
  • explain how decisions are made in research.

25.1 Introduction

In Sect. 17.4, a study of three rural communities in Cameroon and their access to water was described (López-Serrano et al. 2022). One purpose of the study was to determine contributors to the incidence of diarrhoea in young children.

In the observed sample, the odds of an incidence of diarrhoea in children in the last two weeks among households without livestock was \(0.176\) (Table 14.3). However, the odds in households with livestock was \(0.548\). In other words, the odds of an incidence of diarrhoea in young children was over three times greater in households with livestock than without (i.e., \(\text{OR} = 0.548/0.176 = 3.11\)). That is, the odds are not the same for both groups in the sample; however, RQs ask about the population.

Of course, the sample studied is just one of countless possible samples that could have been chosen to study. Some other samples may even have produced the opposite results: incidence of diarrhoea greater in households without livestock. Different sample comprises different households, and the sample results depends on which households are in the studied sample. This leads to one of the most important observations about sampling.

Studying a sample leads to the following observations:

  • Every sample is likely to be different.
  • We observe just one of the many possible samples.
  • Every sample is likely to yield a different value for the sample statistic.
  • We observe just one of the many possible values for the statistic.

Since many values for the sample are possible, the possible values of the statistic vary (this is called sampling variation) and have a distribution (this is called a sampling distribution).

We observe just one of the many possible samples, and the sample odds of an incidence of diarrhoea in young children are not the same for the two groups, in the observed sample. What does this imply about the population odds in the two groups?

Two reasons could explain why the sample odds in the two groups are not the same:

  1. The two population odds are the same, but the two sample odds are not the same due to sampling variation. That is, we just happen to have---by chance---one of those samples where the two sample odds are not the same, even though they are the same in the population.
  2. The two population odds are different, and the different sample odds simply reflect the situation in the population.

These two possible explanations ('statistical hypotheses') have special names:

  1. The first explanation is the null hypothesis, denoted \(H_0\)the population odds are the same, and the sample odds are not equal simply due to sampling variation.

  2. The second explanation is the alternative hypothesis, denoted \(H_1\)the population odds are not the same, which is reflected in the sample odds.

How do we decide which of these explanations is supported by the data? What is the decision-making process?

The usual approach to decision-making in science begins by assuming the null hypothesis (the sampling-variation explanation) is true. Then the data are examined to see if persuasive evidence exists to change our mind (and support the alternative hypothesis). Conclusions drawn about the population from the sample can never be certain, since the sample studied is just one of many possible samples that could have been taken (and every sample is likely to be different).

The onus is on the data to refute the null hypothesis. That is, the null hypothesis is retained unless persuasive evidence suggests otherwise.

25.2 Making decisions: drawing cards

In research, decisions need to be made about parameter, based on one of many possible values of the statistic. Sensible decisions can be made (and are made) about parameters based on statistics. To do this though, the process of how decisions are made needs to be articulated, which is the purpose of this chapter.

To begin, suppose I produce a pack of cards, and shuffle them well, and I am interested in the event 'the number of red cards when I draw \(15\) cards from the pack (with replacement)'. ('With replacement' means that after I draw a card and record the card drawn, I place the card back into the pack, and reshuffle before drawing a new card.) The pack of cards can be considered the population (where the proportion of red cards is \(p = 0.5\)). Suppose I draw a sample of \(15\) cards from the pack. Define \(\hat{p}\) as the proportion of red cards in this sample.

Suppose my sample of \(15\) cards produces \(\hat{p} = 1\); that is, all \(n = 15\) cards are red cards. What should you conclude? How likely is it that this would happen by chance from a fair pack; see the animation below. Is this evidence that the pack of cards is somehow unfair, poorly shuffled, or manipulated?

Getting \(15\) reds cards out of \(15\) (i.e., \(\hat{p} = 1\)) from a well-shuffled pack seems very unlikely; you probably conclude that the pack is somehow unfair. Importantly, how did you reach that decision? Your unconscious decision-making process may have been this:

  1. You assumed, quite reasonably, that I used a standard, well-shuffled pack of cards, where half the cards are red and half the cards are black. That is, you assumed the population proportion of red cards really was \(p = 0.5\).
  2. Based on that assumption, you expected about half the cards in the sample of \(15\) to be red (i.e., expect \(\hat{p}\) to be about \(0.5\)). You wouldn't necessarily expect exactly half red and half black (because of sampling variation), but you'd probably expect something close to that. That is, you would expect that \(\hat{p}\) would be close to \(0.5\).
  3. You then observed that all \(15\) cards were red. That is, \(\hat{p} = 1\).
  4. You were expecting \(\hat{p} = 0.5\) approximately, but instead observed \(\hat{p} = 1\). What you observed ('all red cards') was not at all like what you were expecting ('about half red cards'); the sample contradicts what you were expecting from a fair pack. This suggests your assumption of a fair pack is probably wrong.

Of course, getting \(15\) red cards in a row is possible; it's just very unlikely. For this reason, you would probably conclude, based on the evidence, that this is persuasive evidence that the pack is not fair.

You probably didn't consciously go through this process. This process of decision-making is similar to the process used in research.

25.3 Making decisions: the process

Based on the ideas in the last section, a formal process of decision-making in research can be described.

  1. Make an assumption. Make an assumption about the value of the parameter. By doing so, we assume that sampling variation explains any discrepancy between the value of the observed statistic and this assumed value of the parameter.
  2. Define the expectations of the statistic. Based on the assumption made about the parameter, describe what values of the statistic might reasonably be observed from all the possible samples from the population (due to sampling variation).
  3. Make the observations. Take a sample (one of the many possible samples), and compute the observed sample statistic from these data.
  4. Make a decision. If the observed sample statistic:
    • is unlikely to have been observed by chance, the statistic (i.e., the evidence) contradicts the assumption about the parameter: the assumption is probably (but not certainly) wrong.
    • could easily be explained by chance, the statistic (i.e., the evidence) is consistent with the assumption about the parameter, and the assumption may be (but is not certainly) correct.

This is one way to describe the process of decision-making in science (Fig. 25.2).

A way to make decisions.

FIGURE 25.1: A way to make decisions.

A way to make decisions.

FIGURE 25.2: A way to make decisions.

This approach is similar to how we unconsciously make decisions every day. For example, suppose I ask my son to brush his teeth (Budgett et al. 2013). Later, I wish to decide if he really did.

  1. Assumption: I assume my son brushed his teeth (because I asked him to).
  2. Expectation: Based on my assumption, I would expect to find his toothbrush is damp.
  3. Observation: When I check later, I observe a dry toothbrush.
  4. Decision: The evidence contradicts what I expected to find based on my assumption, so my assumption is probably false: he probably didn't brush his teeth.

I may have made the wrong decision: he may have brushed his teeth, then dried his toothbrush with a hair dryer. However, based on the evidence, it is more likely that he has not brushed his teeth.

The situation may have ended differently: when I check later, suppose I observe a damp toothbrush. Then, the evidence seems consistent with what I expected if he brushed his teeth (my assumption), so my assumption is probably true; he probably did brush his teeth. Again, I may be wrong: he may have rinsed his toothbrush under a tap. Nonetheless, I don't have evidence that he didn't brush his teeth.

Similar logic underlies most decision-making in science4.

Example 25.1 (The decision-making process) Consider the cards example from Sect. 25.2 again. The formal process might look like this:

  1. Assumption: Assume the pack is fair and well-shuffled pack of cards: the population proportion of red cards is \(p = 0.5\) (the value of the parameter).
  2. Expectation: Based on this assumption, roughly (but not necessarily exactly) equal proportions of red and black cards would be expected in a sample of \(15\) cards. The sample proportion of red cards \(\hat{p}\) (the value of the statistic) is expected to be close to, but maybe not exactly, \(0.5\).
  3. Observation: Suppose I then deal \(15\) cards, and all \(15\) are red cards: then \(\hat{p} = 1\).
  4. Decision: \(15\) red cards from \(15\) cards seems unlikely to occur if the pack is fair and well-shuffled. The data seem inconsistent with what I was expecting based on the assumption (Fig. 25.3), suggesting the assumption is probably false.

Of course, getting \(15\) red cards out of \(15\) is not impossible, so my conclusion may be wrong. However, the evidence certainly implies the pack is not fair.

A way to make decisions for the cards example.

FIGURE 25.3: A way to make decisions for the cards example.

25.4 Making decisions: the steps

Let's think about each step in the decision-making process (Fig. 25.2) individually.

  • Making an assumption about the parameter (Sect. 25.4.1);
  • Describing the expectations of the statistic (Sect. 25.4.2);
  • Making the sample observations (Sect. 25.4.3); and
  • Making a decision (Sect. 25.4.4).

25.4.1 Making an assumption about the parameter

The initial assumption is that sampling variation explains any discrepancy between the parameter and the statistic. This assumption about the parameter is called the null hypothesis, denoted \(H_0\). The null hypothesis is always the 'sampling variation' explanation; that is, 'no change, no difference, no relationship' in the population (depending on the context and RQ). Any change, difference or relationship in the sample is due only to sampling variation.

Using this idea, a reasonable assumption can be made about the parameter. For example, when comparing the mean of two groups, we would initially assume no difference between the population means: any difference between the sample means would be attributed to sampling variation.

Example 25.2 (Assumptions about the population) Many dental associations recommend brushing teeth for two minutes. I. D. M. Macgregor and Rugg-Gunn (1979) recorded the tooth-brushing time for \(85\) uninstructed schoolchildren from England to assess compliance with the guidelines.

We initially assume the population mean tooth-brushing time is two minutes (\(H_0\)\(\mu = 2\)). If the sample mean is not two minutes, the null hypothesis explains this discrepancy by sampling variation. A sample can then be obtained to determine if the sample mean is consistent with, or contradicts, this assumption.

25.4.2 Describing the expectations of the statistic

Based on the assumed value for the parameter, we then determine what values to expect from the statistic from all the possible samples we could select (of which we only select one). Since many samples are possible, and every sample is likely to be different (sampling variation), the observed value of the statistic depends on which one of the countless possible samples we obtain. In other words, we describe the sampling distribution.

Think about the cards in Sect. 25.2. Assuming a fair pack, then half the cards are red in the population (the pack of cards), so the population proportion is assumed to be \(p = 0.5\). In a sample of \(15\) cards, what values could be reasonably expected for the sample proportion \(\hat{p}\) of red cards (the statistic)? If samples of size \(15\) were repeatedly taken from a fair pack with \(p = 0.5\), the sample proportion of red cards would vary from hand to hand, of course. But how would \(\hat{p}\) vary from sample to sample?

As in Chaps. 22 and 23, we could simulate (using a computer) the process of dealing \(15\) cards many hundreds of times from a fair pack (i.e., \(p = 0.5\)), and observe how the sample proportion \(\hat{p}\) varies. To begin, suppose we simulated only ten hands of \(n = 15\) cards each; the animation below shows the sample proportion of red cards from each of the ten repetitions.

Suppose we repeated this for hundreds of hands of \(15\) cards (rather than the ten above), and for each hand we recorded \(\hat{p}\), sample proportion of cards that were red. The value of \(\hat{p}\) would vary from sample to sample (sampling variation), and we could record the value of \(\hat{p}\) from each of those hundreds of hands. A histogram of these hundreds of sample proportions could be constructed; the animation below shows a histogram of the sample proportions from \(1\,000\) repetitions of a hand of \(15\) cards. This histogram shows how we might expect the sample proportions \(\hat{p}\) to vary from sample to sample (the sampling distribution), when the population proportion of red cards is \(p = 0.5\).

Interestingly, the shape of the sampling distribution in the animation above is roughly bell-shaped. This is no coincidence.

25.4.3 Making the sample observations

While many samples are possible, we only observe one of those countless possible samples. From our sample of \(15\) cards, all cards are red (see Sect. 25.2), and so the statistic is \(\hat{p} = 1\) (i.e., \(15\) red cards out of \(15\) cards). Assuming \(p = 0.5\), is this value of \(\hat{p}\) unusual, or not unusual?

From the animation in Sect. @(ExpecationOf), the value \(\hat{p} = 1\) was never observed in any of the thousands of simulations. Not one of these \(1\,000\) simulations produced a value of \(\hat{p}\) even close to \(1\).

25.4.4 Making a decision

Observing \(15\) red cards out of \(15\) cards is rare: it never happened once in the \(1\,000\) simulations (see the animation above), so the chance that our specific sample produced \(\hat{p} = 1\) is incredibly unlikely. So you could reasonably conclude that finding \(\hat{p} = 1\) almost never occurs, if the assumption of a fair pack was true.

But we did find \(\hat{p} = 1\) (i.e., \(15\) red cards in \(15\) cards), so the assumption of a fair pack is probably wrong. That is, there is persuasive evidence that the pack of cards is not fair.

What if we had observed \(11\) red cards in a hand of \(15\) cards, a sample proportion of \(\hat{p} = 11/15 = 0.733\)? The conclusion is not quite so obvious then: the animation above shows shows that \(\hat{p} = 0.733\) is unlikely, but \(\hat{p} = 0.733\) (and even larger values) did occur in the \(1\,000\) simulations.

What if \(9\) red cards were found in the \(15\) (i.e., \(\hat{p} = 0.6\))? The animation above shows shows that \(\hat{p} = 0.6\) could reasonably be observed, since there are many possible samples that lead to \(\hat{p} = 0.6\), or even higher. This would not seem unusual at all, and is certainly not persuasive evidence to change our mind. Many of the possible samples produce values of \(\hat{p}\) near \(0.6\).

Example 25.3 (Sampling variation) Many dental associations recommend brushing teeth for two minutes. I. D. M. Macgregor and Rugg-Gunn (1979) recorded the tooth-brushing time for \(85\) uninstructed schoolchildren from England to assess compliance with the guidelines.

Of course, every possible sample of \(85\) children will include different children, and so produce a different sample mean brushing time \(\bar{x}\). Even if the population mean toothbrushing time really was two minutes (\(\mu = 2\)), the sample mean probably won't be exactly two minutes, because of sampling variation.

Assume the population mean tooth-brushing time is two minutes (\(\mu = 2\)). If this is true, we then could describe what values of the sample statistic \(\bar{x}\) to expect from all possible samples. Then, after obtaining a sample and computing the sample mean, we could determine if the sample mean seems consistent with the assumption of two minutes, or whether it seems to contradict this assumption.

25.5 Chapter summary

Making decisions about parameters based on a statistic is difficult: only one of the many possible samples is observed. Since every sample is likely to be different, different values of the sample statistic are possible. In this chapter, though, a process for making decisions has been studied (Fig. 25.2).

Decisions are often made by making an assumption about the parameter, which leads to an expectation of what values of the statistic are reasonably possible. We can then make observations about our sample, and then make a decision about whether the sample data support or contradict the initial assumption.

25.6 Quick review questions

Are the following statements true or false?

  1. Parameters describe populations.
  2. Both \(\bar{x}\) and \(\mu\) are statistics.
  3. The value of a statistic is likely to be same in every sample.
  4. Sampling variation describes how the value of a statistic varies from sample to sample.
  5. An initial assumption is made about the sample statistic.
  6. The variation in the statistic from sample to sample is called sampling variation.
  7. If the sample results seem inconsistent with what was expected, then the assumption about the population is probably true.
  8. In the sample, we know exactly what to expect.
  9. Hypotheses are made about the population.

25.7 Exercises

Answers to odd-numbered exercises are given at the end of the book.

Exercise 25.1 While playing a die-based game, your opponent rolls a ten times in a row.

  1. Do you think there is a problem with the die?
  2. Explain how you came to this decision.

Exercise 25.2 In a 2012 advertisement, an Australian pizza company claimed that their \(12\)-inch pizzas were 'real \(12\)-inch pizzas' (P. K. Dunn 2012).

  1. What is a reasonable assumption to make to test this claim?
  2. The claim is supported by a sample of \(125\) pizzas, which gave the sample mean pizza diameter as \(\bar{x} = 11.48\) inches. What are the two reasons why the sample mean is not exactly \(12\)-inches?
  3. Does the claim appear to be supported by, or contradicted by, the data? Explain.
  4. Would your conclusion change if the sample mean was \(\bar{x} = 11.25\) inches? Explain.
  5. Does your answer depend on the sample size? For example, is observing a sample mean of \(11.25\) inches from a sample of size \(n = 10\) equivalent to observing a sample mean of \(11.25\) inches from a sample of size \(n = 125\)? Explain.

Exercise 25.3 Since my wife and I have been married, I have been called to jury service four times. The latest notice reads: 'Your name has been selected at random from the electoral roll'.

In the same time, my wife has never been called to jury service. Do you think the selection process really is 'at random'? Explain.

Exercise 25.4 Suppose that \(36\)% of all students at a certain large university are aged over \(30\). A student takes a sample of \(n = 40\) students from the School of Arts to determine if students in that School are somehow different from the general university population in terms of age.

  1. What is the null hypothesis?
  2. If the student researcher finds \(13\) students in the sample aged over \(30\), does this present persuasive evidence to change your mind? Explain.
  3. If the student researcher finds three students in the sample aged over \(30\), does this present persuasive evidence to change your mind? Explain.