Chapter 4 Phase II/III design

Adaptive Multiple Arm Multiple Stage (MAMS)

Adaptive MAMS are generalizations opf 2-arm group sequential designs:

Multiple treatment arms compared to a common control
Multiple looks at accumulating data
Early stopping for efficacy or futility
Treatments may be dropped at each interim look
Sample size re-estimation permitted at each interim look
Strong control of Family Wise Error Rate (FWER)$

Example: SOCRATES Reduced Trial (Gheorghiade et al. 2015)

FWER = probability of making one or more false claims

\[\begin{array}{||l|l||||} \hline \hline \text { Null Hypotheses } & \text { Type of Incorrect Conclusion } \\ \hline \hline H^{(1,2,3)}: \delta_1=\delta_2=\delta_3=0 & \begin{array}{l} \text { The selected treatment is } \\ \text { declared superior to placebo } \end{array} \\ \hline H^{(1,2)}: \delta_1=\delta_2=0, \delta_3>0 & \begin{array}{l} \text { Treatment } 1 \text { or } 2 \text { is selected and } \\ \text { is declared superior to placebo } \end{array} \\ \hline H^{(1,3)}: \delta_1=\delta_3=0, \delta_2>0 & \begin{array}{l} \text { Treatment } 1 \text { or } 3 \text { is selected and } \\ \text { is declared superior to placebo } \end{array} \\ \hline H^{(2,3)}: \delta_2=\delta_3=0, \delta_1>0 & \begin{array}{l} \text { Treatment } 2 \text { or } 3 \text { is selected and } \\ \text { is declared superior to placebo } \end{array} \\ \hline H^{(1)}: \delta_1=0, \delta_2>0, \delta_3>0 & \begin{array}{l} \text { Treatment } 1 \text { is selected and } \\ \text { is declared superior to placebo } \end{array} \\ \hline H^{(2)}: \delta_2=0, \delta_1>0, \delta_3>0 & \begin{array}{l} \text { Treatment } 2 \text { is selected and } \\ \text { is declared superior to placebo } \end{array} \\ \hline H^{(3)}: \delta_3=0, \delta_1>0, \delta_2>0 & \begin{array}{l} \text { Treatment } 3 \text { is selected and } \\ \text { is declared superior to placebo } \end{array} \\ \hline \end{array}\]

Goal: test $H^{(i)}: \delta_i=0, i=1,2,3$, with strong FWER control

Form the Closed Set of all elementary and intersection hypotheses \[ \begin{gathered} H^{(1)}, H^{(2)}, H^{(3)} \\ H^{(1,2)}=H^{(1)} \cap H^{(2)}, H^{(1,3)}=H^{(1)} \cap H^{(3)}, H^{(2,3)}=H^{(2)} \cap H^{(3)} \\ H^{(1,2,3)}=H^{(1)} \cap H^{(2)} \cap H^{(3)} \end{gathered} \]
To reject any elementary hypothesis at level $\alpha$, must also reject every intersection hypothesis containing it at its local level- $\alpha$

Method 1: Stage-Wise MAMS

Let $\left\{p_j^{(1)}, p_j^{(2)}, p_j^{(3)}\right\}$ be unadjusted $p$-values based only on the incremental data at stages (looks) $j=1,2,3,4$
Bonferroni adjusted p-value for $H^{(123)}$ at stage $j$ \[ p_j^{(123)}=3 \min \left\{p_j^{(1)}, p_j^{(2)}, p_j^{(2)}\right\} \]
Simes adjusted $p$-values for $H^{(123)}$ at stage $j$ \[ p_j^{(1,2,3)}=\min \left\{3 p_j^{(1)}, 1.5 p_j^{(2)}, p_j^{(3)}\right\} \]
Dunnett adjusted p-value at stage $j$ \[ p_j^{(1,2,3)}=P\left(\cup_{i=1}^3 P_i \leq p_i\right) \]

Level-$\alpha$ Test of $H^{(1,2,3)}$: Look 3 Reject if $\lambda_{13} \Phi^{-1}\left(1-p_1^{(1,2,3)}\right)+\lambda_{23} \Phi^{-1}\left(1-p_2^{(1,2,3)}\right)+\lambda_{33} \Phi^{-1}\left(1-p_3^{(1,2,3)}\right) \geq 2.4$

Repeat this process to test \[ H^{(12)}, H^{(1,3)}, H^{(2,3)}, H^{(1)}, H^{(2)}, H^{(3)} \] - Reject $H^{(1)}$ under closed testing if $H^{(123)}, H^{(12)}, H^{(13)}$ and $H^{(1)}$ are all rejected at level $\alpha$ - Reject $H^{(2)}$ under closed testing if $H^{(123)}, H^{(12)}, H^{(23)}$ and $H^{(2)}$ are all rejected at level $\alpha$ - Reject $H^{(3)}$ under closed testing if $H^{(123)}, H^{(13)}, H^{(23)}$ and $H^{(3)}$ are all rejected at level $\alpha$

Summary:

Can drop treatment arms at each stage
Can change the number and spacing of future stages
Can alter the sample size of future stages
Can change the $\alpha$-spending function for future stages

Stage Wise MAMS is flexible, easy to implement and applicable under a range of distributional assumptions

Method 2: Cumulative MAMS

Exploits asymptotic normality correlations structure

Cumulative Wald statistic for each dose $i$ vs placebo, at look $j$ \[ Z_{i j}=\frac{\hat{\delta}_{i j}}{\operatorname{se}\left(\hat{\delta}_{i j}\right)} \]
Construct multiplicity adjusted level- $\alpha$ group sequential efficacy boundaries $\left\{u_j, j=1, \ldots, 4\right\}$ \[ P_0\left(\bigcup_{j=1}^4 \max \left\{Z_{1 j}, Z_{2 j}, Z_{3 j}\right\} \geq u_j\right)=\alpha \]
FWER is automatically controlled if no adaptations
But what if we have the following adaptations?
- Drop dose and re-allocate its remaining subjects
- Re-estimate sample size of future stages
- Change the number and spacing of future stages
- Change error spending function for the future stages
Must recompute boundaries by conditional error rate (CER) method if adapt

Compute CER as if did not drop dose: $CER = P_{\underline{0}}\left\{\max \left(Z_{13}, Z_{23}, Z_{33}\right) \geq 2.7 \text { or } \max \left(Z_{14}, Z_{24}, Z_{34}\right) \geq 2.4 \mid\left(z_{12}, z_{22}, z_{32}\right)\right\}$

Recompute boundaries $\left(b_3^*, b_4^*\right)$ so that $P_0\left\{\max \left(Z_{23}^*, Z_{33}^*\right) \geq b_3^* \text { or } \max \left(Z_{24}^*, Z_{34}^*\right) \geq b_4^* \mid\left(z_{22}, z_{32}\right)\right\}= CER$

Key differences between the two methods:

\[\begin{array}{|l|l|} \hline \hline \text { Stage Wise MAMS } & \text { Cumulative MAMS } \\ \hline \text { Inverse normal p-value Combination } & \text { Cumulative Wald statistic } \\ \text { Track a single statistic } & \text { Track one statistic for each dose } \\ \text { Compute two-arm boundaries } & \text { Compute multi-arm boundaries } \\ \text { Valid for any general setting } & \text { Valid for asymptotically normal setting } \\ \text { If adapt, use pre-specified weights } & \text { If adapt, use CER for FWER control } \\ \hline \end{array}\]

Final Comments:

Generally speaking Cumulative MAMS would be preferred to Stage Wise MAMS because of greater power
But Stage Wise MAMS controls FWER even for non-normal data and small sample sizes; hence might be preferable for rare disease trials
Decision rules for dropping arms play an important role in power comparisons and need further investigation
MAMS methods can be extended to investigating multiple populations and multiple endpoints with FWER
FWER control required on a case by case basis

Graphical testing for group sequential design

Basket trial

A basket trial evaluates a single therapy in multiple diseases or disease subtypes

Heng Zhou’s papers: (Zhou et al. 2019), (Wu et al. 2021), (Jing et al. 2022)

Umberlla trial

An umbrella trial evaluates multiple therapies simultaneously for a single disease

Platform trial

A platform trial evaluates multiple therapies for a single disease in a perpetual manner, with therapies allowed to enter or leave the platform over time

References

Gheorghiade, Mihai, Stephen J Greene, Javed Butler, Gerasimos Filippatos, Carolyn SP Lam, Aldo P Maggioni, Piotr Ponikowski, et al. 2015. “Effect of Vericiguat, a Soluble Guanylate Cyclase Stimulator, on Natriuretic Peptide Levels in Patients with Worsening Chronic Heart Failure and Reduced Ejection Fraction: The SOCRATES-REDUCED Randomized Trial.” Jama 314 (21): 2251–62.

Jing, Naimin, Fang Liu, Heng Zhou, Cong Chen, et al. 2022. “An Optimal Two-Stage Exploratory Basket Trial Design with Aggregated Futility Analysis.” Contemporary Clinical Trials 116: 106741.

Wu, Xiaoqiang, Cai Wu, Fang Liu, Heng Zhou, and Cong Chen. 2021. “A Generalized Framework of Optimal Two-Stage Designs for Exploratory Basket Trials.” Statistics in Biopharmaceutical Research 13 (3): 286–94.

Zhou, Heng, Fang Liu, Cai Wu, Eric H Rubin, Vincent L Giranda, and Cong Chen. 2019. “Optimal Two-Stage Designs for Exploratory Basket Trials.” Contemporary Clinical Trials 85: 105807.