D Some theory on equivalence trials

D.1 Type I error in equivalence trials

Assume we have a continuous outcome with variance \(\sigma^2\) and two treatment groups of size \(n_A\) and \(n_B\), respectively. Now consider the difference \(\bar d\) of the mean outcomes as treatment effect, so \(\bar d \sim \mathop{\mathrm{N}}(\Delta, \sigma^2 \, \lambda^2)\) where the true treatment difference is \(\Delta\) and \(\lambda = \sqrt{1/n_A+1/n_B}\).

Equivalence is established if the \(\gamma \cdot 100\) % confidence interval for the treatment difference is within the pre-specified interval of equivalence \((-\delta, \delta)\), i.e. \[ \bar d \pm z_{(1+\gamma)/2} \, \sigma \, \lambda \subset (-\delta, \delta). \] With \(z = z_{(1+\gamma)/2}\) we obtain the equivalent requirement \[ \bar d \in (-\xi, \xi) \, \mbox{ where } \xi = \delta - z_{(1+\gamma)/2} \, \sigma \, \lambda \]

to establish equivalence. This happens with probability

\[\begin{eqnarray} \tag{D.1} \operatorname{\mathsf{Pr}}(\bar d \in (-\xi, \xi) ) &=& \Phi\left(\frac{\xi - \Delta }{\lambda \, \sigma}\right) - \Phi\left(\frac{-\xi - \Delta }{\lambda \, \sigma}\right) \nonumber \\ &=& \Phi\left(\frac{\delta - \Delta }{\lambda \, \sigma} - z_{(1+\gamma)/2}\right) - \Phi\left(\frac{-\delta - \Delta }{\lambda \, \sigma} + z_{(1+\gamma)/2}\right). \end{eqnarray}\]

Now evaluate (D.1) at \(\Delta = \delta\) to obtain the Type I error rate (more precisely an upper bound on the Type I error rate)

\[\begin{eqnarray} \tag{D.2} \alpha = \Phi(-z_{(1+\gamma)/2}) - \Phi\left(z_{(1+\gamma)/2} - \frac{2 \delta}{\lambda \, \sigma}\right). \end{eqnarray}\]

Likewise, the power \(1 - \beta\) is derived from (D.1) with \(\Delta = 0\):

\[ 1 - \beta = 2 \Phi\left(\frac{\delta}{\lambda \, \sigma} - z_{(1+\gamma)/2}\right) - 1 \] so \[ 1 - \beta/2 = \Phi\left(\frac{\delta}{\lambda \, \sigma} - z_{(1+\gamma)/2}\right). \]

Note that \(1 - \beta/2 = \Phi(z_{1-\beta/2})\), and therefore

\[\begin{eqnarray} \tag{D.3} \frac{\delta}{\lambda \, \sigma} = z_{(1+\gamma)/2} + z_{1-\beta/2}. \end{eqnarray}\]

With (D.2) we obtain

\[\begin{eqnarray} \tag{D.3} \alpha \approx \Phi(-z_{(1+\gamma)/2}) = 1-(1+\gamma)/2 = (1-\gamma)/2 \end{eqnarray}\]

for relatively small \(\beta\) (say \(\beta \leq 30\) %), since the second term in (D.2)

\[ \Phi\left(z_{(1+\gamma)/2} - \frac{2 \delta}{\lambda \, \sigma}\right) = \Phi(z_{(1+\gamma)/2} - 2 \, (z_{(1+\gamma)/2} + z_{1-\beta/2})) = \Phi(-z_{(1+\gamma)/2} - 2 \, z_{1-\beta/2}) \]

is then very close to zero. Note that (D.3) is in contrast to superiority trials, where \(\alpha = 1-\gamma\).

D.2 Sample size calulations in equivalence trials

Assume equal group sizes, i.e. \(n=n_A=n_B\), then \(\lambda=\sqrt{2/n}\).

With (D.3) we then obtain

\[\begin{eqnarray*} n &=& \frac{2 \sigma^2 (z_{(1+\gamma)/2} + z_{1-\beta/2})^2}{\delta^2} \\ &=& \frac{2 \sigma^2 (z_{1-\alpha} + z_{1-\beta/2})^2}{\delta^2} \end{eqnarray*}\]

as the required sample size in each group for Type I error rate \(\alpha\) and power \(1-\beta\).

ACRE Trial Collaborators and others. 2009. “Effect of ‘Collaborative Requesting’ on Consent Rate for Organ Donation: Randomised Controlled Trial (ACRE Trial).” The BMJ 339.
Adamsen, Lis, Morten Quist, Christina Andersen, Tom Møller, Jørn Herrstedt, Dorte Kronborg, Marie T Baadsgaard, et al. 2009. Effect of a multimodal high intensity exercise intervention in cancer patients undergoing chemotherapy: randomised controlled trial.” BMJ 339: 895–99.
Ahmed, Irfan, Jemma Hudson, Karen Innes, Rodolfo Hernández, Katie Gillies, Rebecca Bruce, Victoria Bell, et al. 2023. “Effectiveness of Conservative Management Versus Laparoscopic Cholecystectomy in the Prevention of Recurrent Symptoms and Complications in Adults with Uncomplicated Symptomatic Gallstone Disease (c-GALL Trial): Pragmatic, Multicentre Randomised Controlled Trial.” BMJ, December, e075383. https://doi.org/10.1136/bmj-2023-075383.
Allan, Laurie, Helen Hays, Niels-Henrik Jensen, Bernard Le Polain de Waroux, Michiel Bolt, Royden Donald, and Eija Kalso. 2001. “Randomised Crossover Trial of Transdermal Fentanyl and Sustained Release Oral Morphine for Treating Chronic Non-Cancer Pain.” BMJ 322 (7295): 1154.
Altman, Douglas G. 1991. Practical Statistics for Medical Research. London: Chapman & Hall.
———. 1998. Confidence intervals for the number needed to treat.” BMJ 317: 1309–12.
Altman, Douglas G., and J. Martin Bland. 1994a. Statistics Notes: Diagnostic tests 3: receiver operating characteristic plots.” BMJ 309: 188.
Altman, Douglas G, and J Martin Bland. 1994b. Statistics Notes: Diagnostic tests 1: sensitivity and specificity.” BMJ 308: 1552.
———. 1994c. Statistics Notes: Diagnostic tests 2: predictive values.” BMJ 309: 102.
———. 1996a. Statistics Notes: Measurement error.” BMJ 313: 744.
———. 1996b. Statistics Notes: Measurement error and correlation coefficients.” BMJ 313: 41–42.
———. 1996c. Statistics Notes: Measurement error proportional to the mean.” BMJ 313: 106.
———. 2003. Statistics Notes: Interaction revisited: the difference between two estimates.” BMJ 326 (7382): 219.
———. 2005. Statistics Notes: Standard deviations and standard errors.” BMJ 331: 903.
Altman, Douglas G, Jonathon J Deeks, and David L Sackett. 1998. “Odds Ratios Should Be Avoided When Events Are Common.” BMJ 317 (7168): 1318.
Altman, Douglas G, David Machin, Trevor N Bryant, and Martin J Gardner. 2000. Statistics with Confidence. Second. BMJ Books.
Anderson, J R, K C Cain, and R D Gelber. 1983. “Analysis of Survival by Tumor Response.” Journal of Clinical Oncology 1 (11): 710–19. https://doi.org/10.1200/JCO.1983.1.11.710.
Appel, Lawrence J., Mark Espeland, Paul K. Whelton, Therese Dolecek, Shiriki Kumanyika, William B. Applegate, Walter H. Ettinger, et al. 1995. “Trial of Nonpharmacologic Intervention in the Elderly (TONE).” Annals of Epidemiology 5 (2): 119–29. https://doi.org/10.1016/1047-2797(94)00056-y.
Arroll, Bruce, Natalie Khin, and Ngaire Kerse. 2003. Screening for depression in primary care with two verbally asked questions: cross sectional study.” BMJ 327: 1144–46.
Ballard, Clive, Marisa Margallo-Lana, Edmund Juszczak, Simon Douglas, Alan Swann, Alan Thomas, John O’Brien, et al. 2005. Quetiapine and rivastigmine and cognitive decline in Alzheimer’s disease: randomised double blind placebo controlled trial.” BMJ 330: 1–5.
Bennewith, Olive, Nigel Stocks, David Gunnell, Tim J Peters, Mark O Evans, and Deborah J Sharp. 2002. General practice based intervention to prevent repeat episodes of deliberate self harm: cluster randomised controlled trial.” BMJ 324 (7348).
Berlin, Ivan, Gilles Grangé, Nelly Jacob, and Marie-Laure Tanguy. 2014. Nicotine patches in pregnant smokers: randomised, placebo controlled, multicentre trial of efficacy.” BMJ 348: 1–16.
Bland, J M., and D. G Altman. 1996. “Statistics Notes: Measurement Error Proportional to the Mean.” BMJ 313 (7049): 106–6. https://doi.org/10.1136/bmj.313.7049.106.
Bland, J Martin, and Douglas G Altman. 1986. Statistical methods for assessing the agreement between two methods of clinical measurements.” Lancet 1: 307–10.
———. 1994a. Statistics Notes: Regression towards the mean.” BMJ 308: 1499.
———. 1994b. Statistics Notes: Some examples of regression towards the mean.” BMJ 309: 780.
———. 1999. “Measuring Agreement in Method Comparison Studies.” Statistical Methods in Medical Research 8 (2): 135–60. https://doi.org/10.1177/096228029900800204.
Bland, J Martin, and Sally M Kerry. 1997. Statistics Notes: Trial randomised in clusters.” BMJ 315: 600.
Bland, John Martin. 2009. The tyranny of power: is there a better way to calculate sample size? BMJ 339: 1133–35.
Bland, Martin. 2015. An Introduction to Medical Statistics. Fourth. Oxford University Press.
Blue, Lynda, Elanor Lang, John JV McMurray, Andrew P Davie, Theresa A McDonagh, David R Murdoch, Mark C Petrie, et al. 2001. “Randomised Controlled Trial of Specialist Nurse Intervention in Heart Failure.” BMJ 323 (7315): 715–18.
Briel, Matthias, Philipp Schuetz, Beat Mueller, Jim Young, Ursula Schild, Charly Nusbaumer, Pierre Périat, Heiner C Bucher, and Mirjam Christ-Crain. 2008. “Procalcitonin-Guided Antibiotic Use Vs a Standard Approach for Acute Respiratory Tract Infections in Primary Care.” Archives of Internal Medicine 168 (18): 2000–2007.
Brown, Jeremy, George Pengas, Kate Dawson, Lucy A Brown, and Philip Clatworthy. 2009. Self administered cognitive screening test (TYM) for detection of Alzheimer’s disease: cross sectional study.” BMJ 338: 1–8.
Burgess, Ian F, Christine M Brown, and Peter N Lee. 2005. “Treatment of Head Louse Infestation with 4% Dimeticone Lotion: Randomised Controlled Equivalence Trial.” Bmj 330 (7505): 1423.
Butler, Christopher C, Sharon A Simpson, Kerenza Hood, David Cohen, Tim Pickles, Clio Spanou, Jim McCambridge, et al. 2013. “Training Practitioners to Deliver Opportunistic Multiple Behaviour Change Counselling in Primary Care: A Cluster Randomised Trial.” BMJ 346.
Charig, C R, D R Webb, S R Payne, and J E Wickham. 1986. Comparison of treatment of renal calculi by open surgery, percutaneous nephrolithotomy, and extracorporeal shockwave lithotripsy.” BMJ 292: 879–82.
Cockayne, Sarah, Catherine Hewitt, Kate Hicks, Shalmini Jayakody, Arthur Ricky Kang’ombe, Eugena Stamuli, Gwen Turner, et al. 2011. Cryotherapy versus salicylic acid for the treatment of plantar warts (verrucae): a randomised controlled trial.” BMJ 344: 1–7.
Coelho, Teresa, Wilson Marques, Noel R. Dasgupta, Chi-Chao Chao, Yeşim Parman, Marcondes Cavalcante França, Yuh-Cherng Guo, et al. 2023. “Eplontersen for Hereditary Transthyretin Amyloidosis with Polyneuropathy.” JAMA 330 (15): 1448. https://doi.org/10.1001/jama.2023.18688.
Collett, David. 2003. Modelling Survival Data in Medical Research. Second. Chapman & Hall/CRC.
Colorectal Cancer Collaborative Group. 2000. “Palliative Chemotherapy for Advanced Colorectal Cancer: Systematic Review and Meta-Analysis.” BMJ 321 (7260): 531–35.
Cushny, Arthur R., and A. Roy Peebles. 1905. “The Action of Optical Isomers: II. Hyoscines.” The Journal of Physiology 32 (5–6): 501–10. https://doi.org/10.1113/jphysiol.1905.sp001097.
Deeks, Jonathan J, and Douglas G Altman. 2004. Statistics Notes: Diagnostic tests 4: likelihood ratios.” BMJ 329: 168–69.
Dirnagl, Ulrich. 1996. “Believe It or Not!” https://dirnagl.com/2018/02/09/believe-it-or-not/.
Doyle, Joseph, Marcella Alsan, Nicholas Skelley, Yutong Lu, and John Cawley. 2024. “Effect of an Intensive Food-as-Medicine Program on Health and Health Care Use: A Randomized Clinical Trial.” JAMA Internal Medicine 184 (2): 154. https://doi.org/10.1001/jamainternmed.2023.6670.
Durán-Cantolla, Joaquı́n, Felipe Aizpuru, Jose Marı́a Montserrat, Eugeni Ballester, Joaquı́n Terán-Santos, Jose Ignacio Aguirregomoscorta, Mónica Gonzalez, et al. 2010. “Continuous Positive Airway Pressure as Treatment for Systemic Hypertension in People with Obstructive Sleep Apnoea: Randomised Controlled Trial.” BMJ 341.
Egger, M., J. P. T. Higgins, and G. D. Smith. 2022. Systematic Reviews in Health Research – Meta-Analysis in Context. Third. Hoboken NJ, USA: John Wiley & Sons Ltd.
Fine Olivarius, Niels de, Henning Beck-Nielsen, Anne Helms Andreasen, Mogens Hørder, and Poul A Pedersen. 2001. Randomised controlled trial of structured personal care of type 2 diabetes mellitus.” BMJ 323 (7319).
Fisher, Ronald Aylmer. 1925. Statistical Methods for Research Workers. Edinburgh: Oliver; Boyd.
Francis Jr, Thomas, Robert F Korns, Robert B Voight, Morton Boisen, Fay M Hemphill, John A Napier, and Eva Tolchinsky. 1955. “An Evaluation of the 1954 Poliomyelitis Vaccine Trials.” American Journal of Public Health and the Nation’s Health 45 (5 Pt 2): 1–63.
Frank, B, MG Serpell, J Hughes, JNS Matthews, and Dilip Kapur. 2008. “Comparison of Analgesic Effects and Patient Tolerability of Nabilone and Dihydrocodeine for Chronic Neuropathic Pain: Randomised, Crossover, Double Blind Study.” BMJ 336 (7637): 199–201.
Freedman, Stephen B, Patricia C Parkin, Andrew R Willan, and Suzanne Schuh. 2011. Rapid versus standard intravenous rehydration in paediatric gastroenteritis: pragmatic blinded randomised clinical trial.” BMJ 343: 1–12.
Futier, Emmanuel, Samir Jaber, Matthias Garot, Marie Vignaud, Yves Panis, Karem Slim, Jean-Christophe Lucet, et al. 2022. “Effect of Oral Antimicrobial Prophylaxis on Surgical Site Infection After Elective Colorectal Surgery: Multicentre, Randomised, Double Blind, Placebo Controlled Trial.” BMJ, November, e071476. https://doi.org/10.1136/bmj-2022-071476.
Gail, M, and Richard Simon. 1985. “Testing for Qualitative Interactions Between Treatment Effects and Patient Subsets.” Biometrics 41 (2): 361–72.
Giesecke, J. 2002. Modern Infectious Disease Epidemiology. Second. London: Hodder Arnold.
Glas, Afina S., Jeroen G. Lijmer, Martin H. Prins, Gouke J. Bonsel, and Patrick M. M. Bossuyt. 2003. “The Diagnostic Odds Ratio: A Single Indicator of Test Performance.” Journal of Clinical Epidemiology 56 (11): 1129–35. https://doi.org/10.1016/S0895-4356(03)00177-X.
Hackshaw, AK. 1998. “Lung Cancer and Passive Smoking.” Statistical Methods in Medical Research 7 (2): 119–36.
Heal, Clare F, Petra G Buettner, Robert Cruickshank, David Graham, Sheldon Browning, Jayne Pendergast, Herwig Drobetz, Robert Gluer, and Carl Lisec. 2009. Does single application of topical chloramphenicol to high risk sutured wounds reduce incidence of wound infection after minor surgery? Prospective randomised placebo controlled double blind trial.” BMJ 338: 1–6.
Held, Leonhard, and Daniel Sabanés Bové. 2020. Likelihood and Bayesian Inference - With Applications in Biology and Medicine. Second. Springer.
Hernán, Miguel A., Wei Wang, and David E. Leaf. 2022. “Target Trial Emulation: A Framework for Causal Inference from Observational Data.” JAMA 328 (24): 2446. https://doi.org/10.1001/jama.2022.21383.
Hijazi, Ramzi, David Taylor, and Joanna Richardson. 2009. Effect of topical alkane vapocoolant spray on pain with intravenous cannulation in patients in emergency departments: randomised double blind placebo controlled trial.” BMJ 338: 1–5.
Hill, Austin Bradford. 1937. Principles of Medical Statistics. London: The Lancet.
Hochberg, Y. 1988. “A Sharper Bonferroni Procedure for Multiple Tests of Significance.” Biometrika 75: 800–802.
Holland, Anne E, Ajay Mahal, Catherine J Hill, Annemarie L Lee, Angela T Burge, Narelle S Cox, Rosemary Moore, et al. 2017. Home-based rehabilitation for COPD using minimal resources: a randomised, controlled equivalence trial.” Thorax 72 (1): 57–65.
Hommel, G. 1988. “A Stagewise Rejective Multiple Test Procedure Based on a Modified Bonferroni Test.” Biometrika 75: 383–86.
Hond, Anne A H de, Ewout W Steyerberg, and Ben van Calster. 2022. “Interpreting Area Under the Receiver Operating Characteristic Curve.” The Lancet Digital Health 4 (12): e853–55. https://doi.org/10.1016/s2589-7500(22)00188-1.
Hopewell, Sally, Isabelle Boutron, An-Wen Chan, Gary S. Collins, Jennifer A. de Beyer, Asbjørn Hróbjartsson, Camilla Hansen Nejstgaard, et al. 2022. “An Update to SPIRIT and CONSORT Reporting Guidelines to Enhance Transparency in Randomized Trials.” Nature Medicine 28 (9): 1740–43. https://doi.org/10.1038/s41591-022-01989-8.
Hosmer, David W, and Stanley Lemeshow. 1989. Applied Logistic Regression. John Wiley.
ICH Expert Working Group. 1999. “Statistical Principles for Clinical Trials.” Stat Med 18: 1905–42.
Ikeda, Masayuki, Takashi Matsunaga, Noritsugu Irabu, and Shohji Yoshida. 2002. Using vital signs to diagnose impaired consciousness: cross sectional observational study.” BMJ 325: 1–5.
Islami, Farhad, Akram Pourshams, Dariush Nasrollahzadeh, Farin Kamangar, Saman Fahimi, Ramin Shakeri, Behnoush Abedi-Ardekani, et al. 2009. Tea drinking habits and oesophageal cancer in a high risk area in northern Iran: population based case-control study.” BMJ 338: 1–6.
James, Janet, Peter Thomas, David Cavan, and David Kerr. 2004. Preventing childhood obesity by reducing consumption of carbonated drinks: cluster randomised controlled trial.” BMJ 328 (7450).
Jen, Hsiao-Hsuan, Wei-Jung Chang, Chen-Yang Hsu, Amy Ming-Fang Yen, Anssi Auvinen, Tony Hsiu-Hsi Chen, and Sam Li-Sheng Chen. 2020. “Sojourn-Time-Corrected Receiver Operating Characteristic Curve (ROC) for Prostate Specific Antigen (PSA) Test in Population-Based Prostate Cancer Screening.” Scientific Reports 10 (1). https://doi.org/10.1038/s41598-020-77668-w.
Johnson, Valen E. 2013. “Revised Standards for Statistical Evidence.” Proceedings of the National Academy of Sciences 110 (48): 19313–17. https://doi.org/10.1073/pnas.1313476110.
Kaur, Gunjeet, P. V. M. Lakshmi, Ashu Rastogi, Anil Bhansali, Sanjay Jain, Yot Teerawattananon, Henna Bano, and Shankar Prinja. 2020a. “Forest Plot of HbA1c 6.5.” Figure 2 in: Diagnostic accuracy of tests for type 2 diabetes and prediabetes: A systematic review and meta-analysis. https://doi.org/10.1371/journal.pone.0242415.g002.
———. 2020b. “Diagnostic Accuracy of Tests for Type 2 Diabetes and Prediabetes: A Systematic Review and Meta-Analysis.” Edited by Stephen L. Atkin. PLOS ONE 15 (11): e0242415. https://doi.org/10.1371/journal.pone.0242415.
Kerry, Sally M, and J Martin Bland. 1998a. Statistics Notes: Analysis of a trial randomised in clusters.” BMJ 316: 54.
———. 1998b. Statistics Notes: Sample size in cluster randomisation.” BMJ 316 (7130): 549.
———. 1998c. Statistics Notes: The intracluster correlation coefficient in cluster randomisation.” BMJ 316: 1455.
Kinmonth, Ann Louise, Alison Woodcock, Simon Griffin, Nicki Spiegal, and Michael J Campbell. 1998. “Randomised Controlled Trial of Patient Centred Care of Diabetes in General Practice: Impact on Current Wellbeing and Future Disease Risk.” BMJ 317 (7167): 1202–8.
Kirkwood, Betty R, and Jonathan A C Sterne. 2003. Essential Medical Statistics. Second. Blackwell Publishing.
Lee, Katherine J, Margarita Moreno‐Betancur, Jessica Kasza, Ian C Marschner, Adrian G Barnett, and John B Carlin. 2019. “Biostatistics: A Fundamental Discipline at the Core of Modern Health Data Science.” Medical Journal of Australia 211 (10): 444. https://doi.org/10.5694/mja2.50372.
Little, Paul, Jane Barnett, Lucy Barnsley, Jean Marjoram, Alex Fitzgerald-Barron, and David Mant. 2002. Comparison of agreement between different measures of blood pressure in primary care and daytime ambulatory blood pressure.” BMJ 325: 1–6.
Lovell, Karina, Debbie Cox, Gillian Haddock, Christopher Jones, David Raines, Rachel Garvey, Chris Roberts, and Sarah Hadley. 2006. “Telephone Administered Cognitive Behaviour Therapy for Treatment of Obsessive Compulsive Disorder: Randomised Controlled Non-Inferiority Trial.” BMJ 333 (7574): 883.
Marion, Joe D., and Andrew D. Althouse. 2023. “The Use of Historical Controls in Clinical Trials.” JAMA 330 (15): 1484. https://doi.org/10.1001/jama.2023.16182.
Matthews, Anthony A, Goodarz Danaei, Nazrul Islam, and Tobias Kurth. 2022. “Target Trial Emulation: Applying Principles of Randomised Trials to Observational Studies.” BMJ, August, e071108. https://doi.org/10.1136/bmj-2022-071108.
Matthews, John N. S. 2006. Introduction to Randomized Controlled Clinical Trials. Second. Chapman & Hall/CRC.
Matthews, John NS, and Douglas G Altman. 1996. Statistics Notes: Interaction 2: compare effect sizes not P values.” BMJ 313 (7060): 808.
Meinertz, Thomas, Martin Kasper Wolfgang Schumacher, and Hanjörg Just. 1988. The German multicenter trial of anisoylated plasminogen streptokinase activator complex versus heparin for acute myocardial infarction.” Am J Card 62 (7): 347–51.
Mirpour, Shahriar, Sara Fathollah, Parvin Mansouri, Bagher Larijani, Mahmood Ghoranneviss, Mohammadreza Mohajeri Tehrani, and Mohammad Reza Amini. 2020. “Cold Atmospheric Plasma as an Effective Method to Treat Diabetic Foot Ulcers: A Randomized Clinical Trial.” Scientific Reports 10 (1). https://doi.org/10.1038/s41598-020-67232-x.
Moher, D, S Hopewell, KF Schulz, V Montori, PC Gøtzsche, PJ Devereaux, D Elbourne, M Egger, and DG Altman. 2010. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials.” BMJ 340.
Nab, Linda, Maarten van Smeden, Ruth H. Keogh, and Rolf H. H. Groenwold. 2021. “Mecor: An r Package for Measurement Error Correction in Linear Regression Models with a Continuous Outcome.” Computer Methods and Programs in Biomedicine 208 (September): 106238. https://doi.org/10.1016/j.cmpb.2021.106238.
Newcombe, R. G. 2013. Confidence Intervals for Proportions and Related Measures of Effect Size. Boca Ration, FL: Chapman & Hall/CRC.
Newcombe, Robert G. 1998a. Improved confidence intervals for the difference between binomial proportions based on paired data.” Stat Med 17 (22): 2635–50.
———. 1998b. Interval estimation for the difference between independent proportions: Comparison of eleven methods.” Stat Med 17 (8): 873–90.
O’Brien, P. C. 1984. “Procedures for Comparing Samples with Multiple Endpoints.” Biometrics 40: 1079–87.
Oakeshott, P., S. M. Kerry, and J. E. Williams. 1994. “Randomized Controlle Trial of the Effect of the Royal College of Radiologists’ Guidelines on General Practitioners’ Referral for Radiographic Examination.” British Journal of General Practice 44: 197–200. https://doi.org/10.1046/j.1365-2125.2001.01382.x.
Oxford Population Health. 1996. “The Million Women Study.”
Pan, Guohua, and Douglas A Wolfe. 1997. “Test for Qualitative Interaction of Clinical Significance.” Statistics in Medicine 16 (14): 1645–52.
Pepe, Margaret S. 2003. The Statistical Evaluation of Medical Tests for Classification and Prediction. Oxford University Press.
Pinkney, Thomas D, Melanie Calvert, David C Bartlett, Adrian Gheorghe, Val Redman, George Dowswell, William Hawkins, et al. 2013. Impact of wound edge protection devices on surgical site infection after laparotomy: multicentre randomised controlled trial (ROSSINI Trial).” BMJ 347.
Polack, Fernando P., Stephen J. Thomas, Nicholas Kitchin, Judith Absalon, Alejandra Gurtman, Stephen Lockhart, John L. Perez, et al. 2020. “Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine.” New England Journal of Medicine 383 (27): 2603–15. https://doi.org/10.1056/nejmoa2034577.
Porto, Ana Maria Feitosa, Isabela Cristina Coutinho, Jailson Barros Correia, and Melania Maria Ramos Amorim. 2011. Effectiveness of antenatal corticosteroids in reducing respiratory disorders in late preterm infants: randomised clinical trial.” BMJ 342.
Puhan, Milo A, Alex Suarez, Christian Lo Cascio, Alfred Zahn, Markus Heitz, and Otto Braendli. 2005. Didgeridoo playing as alternative treatment for obstructive sleep apnoea syndrome: randomised controlled trial.” BMJ 332: 1–5.
Ravaud, P, RM Flipo, I Boutron, C Roy, A Mahmoudi, B Giraudeau, and T Pham. 2009. ARTIST (osteoarthritis intervention standardized) study of standardised consultation versus usual care for patients with osteoarthritis of the knee in primary care in France: pragmatic randomised controlled trial.” BMJ 338.
Redelmeier, Donald A, and Sheldon M Singh. 2001. “Survival in Academy Award–Winning Actors and Actresses.” Annals of Internal Medicine 134 (10): 955–62.
Roldán Nofuentes, José Antonio, Juan de Dios Luna del Castillo, and Miguel Ángel Montero Alonso. 2012. “Global Hypothesis Test to Simultaneously Compare the Predictive Values of Two Binary Diagnostic Tests.” Computational Statistics &Amp; Data Analysis 56 (5): 1161–73. https://doi.org/10.1016/j.csda.2011.06.003.
Rothwell, Peter M. 2005. “Subgroup Analysis in Randomised Controlled Trials: Importance, Indications, and Interpretation.” The Lancet 365 (9454): 176–86.
Sackett, David L, Jonathan J Deeks, and Douglas G Altman. 1996. “Down with Odds Ratios!” BMJ Evidence-Based Medicine 1 (6): 164–66.
Schmid, C. H., T. Stijnen, and I. R. White. 2021. Handbook of Meta-Analysis. Boca Raton FL, USA: CRC Press.
Schulz, KF, DG Altman, and D for the CONSORT Group Moher. 2010. CONSORT 2010 Statement: updated guidelines for reporting parallel group randomised trials.” BMJ 340.
Senn, S. 2002. Cross-over Trials in Clinical Research. Second. Ney York: Wiley.
———. 2021. Statistical Issues in Drug Development. Third. Ney York: Wiley.
Senn, Stephen. 2011. Francis Galton and regression to the mean.” Significance 8 (3): 124–26.
Simes, R. J. 1986. “An Improved Bonferroni Procedure for Multiple Tests of Significance.” Biometrika 73: 751–54.
Spector, T D, R W Keen, N K Arden, N A Morrison, P J Major, T V Nguyen, P J Kelly, et al. 1995. Influence of vitamin D receptor genotype on bone mineral density in postmenopausal women: a twin study in Britain.” BMJ 310: 1357–60.
Stolberg, M. 2006. “Inventing the Randomized Double-Blind Trial: The Nuremberg Salt Test Of1835.” Journal of the Royal Society of Medicine 99 (12): 642–43. https://doi.org/10.1258/jrsm.99.12.642.
Streptomycin in Tuberculosis Trials Committee. 1948. “Stretomycin Treatment of Pulmonary Tuberculosis.” British Medical Journal 2 (October): 769–82.
Sylvestre, Marie-Pierre, Ella Huszti, and James A Hanley. 2006. “Do Oscar Winners Live Longer Than Less Successful Peers? A Reanalysis of the Evidence.” Annals of Internal Medicine 145 (5): 361–63.
Todd, Susan, Anne Whitehead, Nigel Stallard, and John Whitehead. 2001. “Interim Analyses and Sequential Designs in Phase III Studies.” British Journal of Clinical Pharmacology 51 (5): 394–99. https://doi.org/10.1046/j.1365-2125.2001.01382.x.
Turck, Natacha, Laszlo Vutskits, Paola Sanchez-Pena, Xavier Robin, Alexandre Hainard, Marianne Gex-Fabry, Catherine Fouda, et al. 2010. A multiparameter panel method for outcome prediction following aneurysmal subarachnoid hemorrhage.” Intensive Care Med 36: 107–15.
Vadillo-Ortega, Felipe, Otilia Perichart-Perera, Salvador Espino, Marco Antonio Avila-Vergara, Isabel Ibarra, Roberto Ahued, Myrna Godines, Samuel Parry, George Macones, and Jerome F Strauss. 2011. Effect of supplementation during pregnancy with L-arginine and antioxidant vitamins in medical food on pre-eclampsia in high risk population: randomised controlled trial.” BMJ 342.
Van den Aardweg, MTA, CWB Boonacker, MM Rovers, AW Hoes, and AGM Schilder. 2011. Effectiveness of adenoidectomy in children with recurrent upper respiratory tract infections: open randomised controlled trial.” BMJ 343: 1–13.
Vickers, Andrew J, and Douglas G Altman. 2001. Statistics Notes: Analysing controlled trials with baseline and follow up measurements.” BMJ 323 (7321): 1123–24.
Wen, Li Ming, Louise A Baur, Judy M Simpson, Chris Rissel, Karen Wardle, and Victoria M Flood. 2012. Effectiveness of home based early intervention on children’s BMI at age 2: randomised controlled trial.” BMJ 344: 1–11.
Wieand, Steven, Mitchell H. Gail, Brian R. James, and Kathleen L. James. 1989. “A Family of Nonparametric Statistics for Comparing Diagnostic Markers with Paired or Unpaired Data.” Biometrika 76 (3): 585–92.
Zar, Heather J, Mark F Cotton, Stanzi Strauss, Janine Karpakis, Gregory Hussey, H Simon Schaaf, Helena Rabie, and Carl J Lombard. 2007. “Effect of Isoniazid Prophylaxis on Mortality and Incidence of Tuberculosis in Children with HIV: Randomised Controlled Trial.” BMJ 334 (7585): 136.