1.6 External and internal validity

All studies should be designed to be externally valid (Chap. 2) and internally valid (we will look at this in more detail in the next topic) as far as possible.

A study is externally valid if the results are likely to be generalise to other groups in the population, apart from those studied in the sample.

For a study to be externally valid, it first needs to be internally valid. Using a random sample helps ensure external validity. In addition, the use of inclusion and exclusion criteria helps clarify to whom or what the results may apply outside of the sample being studied.

Definition 1.8 (External validity) Externally validity refers to the ability to generalise the results to other groups in the population, apart from the sample studied.

For a study to be truly externally valid, the sample must be a random sample.

A study is externally valid if the results from the sample studied are likely to apply to the intended population. It does not mean that the results apply more widely than the intended population.

Example 1.9 Suppose the population in a study is Queensland university students. The sample would be the students studied. The study is externally valid if the sample is a random sample from the population of students.

The results will not necessarily apply to Queensland residents, but this has nothing to do with externally validity. External validity concerns how the sample represents the intended population in the RQ, which is Queensland university students. The study is not concerned with all Queensland residents.

Internally validity refers to how reasonable and logical it is to draw connections between the outcome and the comparison/connection: that is, the strength of the inferences made from the study.

High internal validity means that changes in the response variable can confidently be related to changes in the explanatory variable in the group that was studied; the possibility of other explanations for changes in the response variable have been minimised.

Definition 1.9 (Internal validity) Internally valid refers to the strength of the association between the outcome and the comparison/connection.

In a study with high internal validity, the association between the outcome and the comparison/connection can be attributed to that comparison/connection, rather than to other factors.

One of many threats to internal validity might be that the groups being compared are different to begin with (for example, if the group receiving echinacea is younger (on average) than the group receiving no medication).

To check this, the baseline characteristics of the individuals in the groups can be compared: the groups being compared should be as similar as possible, so that any differences in the outcome cannot be attributed to pre-existing difference in the two groups being compared.

Example 1.10 (Baseline characteristics) In a study of treating depression in adults (Danielsson et al. 2014), three treatments were compared: exercise, basic body awareness therapy, or advice.

If any differences between the treatments were found, the researchers need to be confident that the differences were due to the treatment.

For this reason, the three groups were compared to ensure the groups were similar in terms of average ages, percentage of women, taking of anti-depressants, and many other aspects.

An internally valid study requires studies to be carefully designed; this is discussed at length in the next topic. In general, well-designed experimental studies are more likely to be internally valid than observational studies (Fig. 1.8).

Figure 1.8: Well-designed true experiments are more likely to have high internal validity

References

Danielsson, Louise, Ilias Papoulias, Eva-Lisa Petersson, Jane Carlsson, and Margda Waern. 2014. “Exercise or Basic Body Awareness Therapy as Add-on Treatment for Major Depression: A Controlled Study.” Journal of Affective Disorders 168: 98–106.