Chapter 18 Cluster Randomized Controlled Trials

18.1 Learning objectives

By the end of this chapter, you will be able to:

  • Understand when a cluster RCT is preferable to a regular RCT

  • Explain the limitations of a cluster RCT

18.2 What is a cluster RCT?

In some situations, allocation of intervention can be problematic due to pragmatic constraints; for example, for a whole-class intervention, it may be disruptive to split students within each class into intervention and control groups. A potential solution is to use cluster trials, where the allocation to intervention is performed at the group level. We can designate clusters that would naturally occur together, for example, school classes, patients under the care of a particular medical practice or a specific therapist, or perhaps by geographic location (county, district or healthcare provider).

Individual RCT, illustrated on left, assigns participants to control and treatment groups disregarding any clustering. Cluster RCT, illustrated on right, allocates treatment according to which cluster the individual falls in.

Figure 18.1: With regular RCT (on left), individuals are sampled at random from the population for allocation to intervention (blue) or control (white). With cluster RCT (on right) natural clusters in the population (schools, hospitals, cities, etc) are sampled at random and assigned to intervention (blue) or control (white)

Figure 18.1 illustrates the logic.

18.3 Advantages of a cluster design

The first advantage of this approach is logistic. It avoids the practical difficulties of randomizing to intervention within established natural clusters, e.g. if children within a classroom are in different intervention arms, or a therapist is required to keep track of which cases are allocated to one intervention or another. . In addition, if it is necessary to train those administering the intervention, clustering means that fewer individuals need to be trained overall, as those working in each cluster would require training in just one method.

Another important consideration is that clustering avoids “contamination” effects. Contamination in the context of controlled trials means that individuals in one arm of the trial, say the control arm, have been exposed to another arm of the trial, say intervention. For instance, suppose in the example from the last chapter, within the same classroom we had some children who received a phonics intervention, some who received computerized intervention, and others who just had “business as usual”. The children as well as the teachers might feel this is unfair, and indeed they might demand that the control children take part in the other activities. Even if there is no sense of unfairness, a teacher or therapist who is administering two interventions may find it hard to keep them separate.

In pharmaceutical trials, we can control contamination by masking the randomization and using placebos that are physically indistinguishable from the drug, so that neither the participant nor the researcher knows who is receiving the intervention. For behavioural interventions, this is not the case, and if participants know one another, then they may readily discover which arm of the trial they are in. The practitioner administering the intervention will also be aware of who is receiving intervention and who is not, and so may treat them differently. The clustered approach minimizes the risk of contamination, as intervention clusters are less likely to have direct exposure to each other, and practitioners will consistently do the same intervention with everyone in the study.

Finally, we may note that any intervention that is administered to a whole group at a time - e.g. a modification to teacher practice that affects all children in the classroom - requires a clustered approach.

18.4 Disadvantages of a cluster design

The benefits of the clustered design are counteracted by some serious disadvantages. First of all, a clustered trial is far less efficient than a regular trial, because the unit of analysis is the cluster rather than the individual. Accordingly, much larger samples are needed to attain adequate power.

It may not be obvious why clustering should require a different analytic approach, and indeed, the literature does contain trials which have been analysed as if they were regular trials, even though the data are clearly clustered (e.g. pupils in schools A and B receive intervention whereas those in schools C and D are controls). The problem is that clustering introduces confounds. For instance, if we have classrooms as clusters, then all the children in one class are taught by the same teacher. They are also likely to be more similar within a cluster than between clusters on variables such as social background. This reduces variability within a cluster. Previously, we have regarded reduced variability as a good thing, as it helps pick out an intervention effect from a background of noise. But in this case, it is a bad thing, because it is confounded with the intervention effect, and may make it seem larger than it is. For instance, if we just had a study where all the children in school A had intervention 1, and all of those in school B had intervention 2, then if we find children in school A did better than those in school B, this is hard to interpret. It could just be that teaching is better in school A, regardless of the intervention.

Statisticians have developed sophisticated methods of analysis for dealing with the dependency among individuals within the same cluster (Campbell & Walters, 2014). It is worth taking advice from a statistician if you are contemplating doing such a trial. For a detailed account of methods, see Gelman & Hill (2007).

18.5 Check your understanding

The Education Endowment Foundation reported on a cluster RCT of “growth mindset” intervention that was completed in 2014. The project involved 101 schools and 5018 pupils across England, assigned to either intervention or control groups. The detailed rationale for the study was as follows:

Pupils were exposed to the mindset theory over a structured series of classes led by their own teachers. The aim of these sessions was to change the way that pupils think about their intelligence, in particular to build high expectations and resilience and to encourage specific plans and goals that emphasize growth, the development of skill, or the expansion of knowledge. The programme was delivered to pupils through eight sessions. These covered themes including the meaning of intelligence, dealing with mistakes and emotions, understanding the brain and encouraging challenge, effort and persistence, stereotypes, and inspirational people. In addition, teachers worked to make the intervention more effective by embedding the growth mindset approach in their everyday class activities, for example, by repeating the message that making mistakes is an opportunity to learn rather than a negative experience.

Teachers received one day of training that was prepared and delivered by a team of education psychologists from the University of Portsmouth. This training introduced teachers to mindset theory and evidence and provided tips on how to embed the approach in their classrooms/schools (for example, how to communicate incremental beliefs to pupils through feedback and praise). To supplement these suggested changes to everyday practice, teachers were given the materials and training to run an eight-week programme (up to 2.5 hours a week) of weekly lessons and activities with their Year 6 pupils. Specifically, teachers were given a training manual that included comprehensive lesson plans for the eight sessions and a USB stick with additional material to support their interaction with children and their parents. In addition, teachers were granted free access to online videos supplied by the Positive Edge Foundation; these videos were referenced in the lesson plans and teachers were encouraged to use the videos to explain or emphasize particular concepts such as resilience and learning from mistakes. The videos focus on three famous people (Darwin, Einstein, and Wilma Rudolph), describing their lives and how they overcame adversity, in line with the growth mindset message. They included, for example, a video about Charles Darwin’s less-than-promising school performance and a medal-winning athlete who overcame a difficult start in life. Other materials provided by the project team included posters on growth mindset and pupil quizzes. ˙

  • Why was a cluster RCT method used, rather than a standard two-arm RCT?

  • The results from the trial were disappointing: no evidence of any benefit was found for children who received growth mindset training. Do you think it would be worthwhile to find ways to evaluate the intervention using a non-clustered RCT? What would be the advantages/challenges of such an approach?

  • The authors of the report suggested various reasons why the trial results were so disappointing. Can you think of what these reasons might be? Would there be any way of checking the plausibility of such explanations? When you have considered this point, look at the discussion of reasons for null results on pp 42-43 of [the report], which you can find using a web search with “EEF Growth Mindset” as search terms. Did you anticipate the points raised there - and did the report consider the points you thought of? Note also how the researchers used a “implementation and process evaluation” that made it possible to evaluate different possible explanations (p. 29).

References

Campbell, M., & Walters, S. (2014). How to Design, Analyse and Report Cluster Randomised Trials in Medicine and Health Related Research. Wiley.
Gelman, A., & Hill, J. (2007). Data Analysis Using Regression and Multilevel/Hierarchical Models (Analytical Methods for Social Research). Cambridge University Press.