## 3.5 Chance and variation in the response

*Natural* (chance) variation refers to variation that
cannot otherwise be explained:
even repeating a study exactly the same way every time
will not always produce the same values of the response variable.
This is called *natural variation*, *chance variation*, or just *chance*.

Natural variation makes
the influence of the explanatory variable
(which we are wanting to study)
hard to detect,
so minimizing chance variation is important.
*Minimise the amount of the chance variation*,
requires using good design principles,
and measuring as many other extraneous variables
that may explain variation in the response variable
as is reasonable.

Chance can impact the values of the response variable in different ways:
each *individual* can produce different values of the response variable
each time the individual repeats the study
(*within*-individuals variation);
each individual in the study can produce different values of the
response variable compared to *other* individuals
(*between*-individuals variation):

- To estimate the amount of variation
*within*individuals: Many observations are needed from each unit of analysis (individual). - To estimate amount of variation
*between*individuals: Many units of analysis (individuals) are needed.

Since *between*-individual variation is usually more variable than the
*within*-individual variation,
using *many* individuals
is usually more important than
using a smaller number of individuals many times.

**Lemma 3.2 (Chance) **Consider the letter-typing study (Example 2.4) again.
What are the advantages and disadvantage of:

- measuring one female 30 times?
- measuring 30 different females once each?
- measuring 10 different females three times each?