Methods: Variables (7)

The items from STROBE state that you should report:
- Clearly define all outcomes, exposures, predictors, potential confounders, and effect modifiers. Give diagnostic criteria, if applicable.


Some key items to consider adding:
- The start and stop of any therapies or treatment
- The mean, median, and range for each exposure group
- The theoretical/conceptual rationale for the design of the intervention/ exposure
- The intervention/exposure described with sufficient detail to permit replication
- Description of potential confounders (other than epidemiological variables) and correlates
- For hypothesis-driven studies, the putative causal structure (consider a diagram like a directed acyclic graph)
- Sources of data and methods of assessment for each variable
- Comparability of assessment methods among groups and over time
- The level of organization at which each variable was measured

Explanation

Authors should define all variables considered for and included in the analysis, including outcomes, exposures, predictors, potential confounders and potential effect modifiers. Disease outcomes require adequately detailed description of the diagnostic criteria. This applies to criteria for cases in a case-control study, disease events during follow-up in a cohort study and prevalent disease in a cross-sectional study.

Clear definitions and steps taken to adhere to them are particularly important for any disease condition of primary interest in the study. For some studies, ‘determinant’ or ‘predictor’ may be appropriate terms for exposure variables and outcomes may be called ‘endpoints’. In multivariable models, authors sometimes use ‘dependent variable’ for an outcome and ‘independent variable’ or ‘explanatory variable’ for exposure and confounding variables. The latter is not precise as it does not distinguish exposures from confounders.

If many variables have been measured and included in exploratory analyses in an early discovery phase, consider providing a list with details on each variable in an appendix, additional table or separate publication. Of note, the International Journal of Epidemiology recently launched a new section with ‘cohort profiles’, that includes detailed information on what was measured at different points in time in particular studies. (Ebrahim, 2004; Walker et al., 2004) Finally, we advise that authors declare all ‘candidate variables’ considered for statistical analysis, rather than selectively reporting only those included in the final models (see also item 16a). (Vandenbroucke et al., 2007)


Examples

  • “Only major congenital malformations were included in the analyses. Minor anomalies were excluded according to the exclusion list of European Registration of Congenital Anomalies (EUROCAT). If a child had more than one major congenital malformation of one organ system, those malformations were treated as one outcome in the analyses by organ system (…) In the statistical analyses, factors considered potential confounders were maternal age at delivery and number of previous parities. Factors considered potential effect modifiers were maternal age at reimbursement for antiepileptic medication and maternal age at delivery” (Artama et al., 2006; Vandenbroucke et al., 2007)

Box 2. Matching

In any case-control study, sensible choices need to be made on whether to use matching of controls to cases, and if so, what variables to match on, the precise method of matching to use, and the appropriate method of statistical analysis. Not to match at all may mean that the distribution of some key potential confounders (e.g., age, sex) is radically different between cases and controls. Although this could be adjusted for in the analysis there could be a major loss in statistical efficiency.


The use of matching in case-control studies and its interpretation are fraught with difficulties, especially if matching is attempted on several risk factors, some of which may be linked to the exposure of prime interest (K. J. Rothman et al., 1998b; Szklo & Nieto, 2000). For example, in a case-control study of myocardial infarction and oral contraceptives nested in a large pharmaco-epidemiologic data base, with information about thousands of women who are available as potential controls, investigators may be tempted to choose matched controls who had similar levels of risk factors to each case of myocardial infarction. One objective is to adjust for factors that might influence the prescription of oral contraceptives and thus to control for confounding by indication. However, the result will be a control group that is no longer representative of the oral contraceptive use in the source population: controls will be older than the source population because patients with myocardial infarction tend to be older. This has several implications. A crude analysis of the data will produce odds ratios that are usually biased towards unity if the matching factor is associated with the exposure. The solution is to perform a matched or stratified analysis (see item 12d). In addition, because the matched control group ceases to be representative for the population at large, the exposure distribution among the controls can no longer be used to estimate the population attributable fraction (see Box 7) (Cole & MacMahon, 1971). Also, the effect of the matching factor can no longer be studied, and the search for well-matched controls can be cumbersome – making a design with a non-matched control group preferable because the non-matched controls will be easier to obtain and the control group can be larger. Overmatching is another problem, which may reduce the efficiency of matched case-control studies, and, in some situations, introduce bias. Information is lost and the power of the study is reduced if the matching variable is closely associated with the exposure. Then many individuals in the same matched sets will tend to have identical or similar levels of exposures and therefore not contribute relevant information. Matching will introduce irremediable bias if the matching variable is not a confounder but in the causal pathway between exposure and disease. For example, in vitro fertilization is associated with an increased risk of perinatal death, due to an increase in multiple births and low birth weight infants (Gissler & Hemminki, 1996). Matching on plurality or birth weight will bias results towards the null, and this cannot be remedied in the analysis.


Matching is intuitively appealing, but the complexities involved have led methodologists to advise against routine matching in case-control studies. They recommend instead a careful and judicious consideration of each potential matching factor, recognizing that it could instead be measured and used as an adjustment variable without matching on it. In response, there has been a reduction in the number of matching factors employed, an increasing use of frequency matching, which avoids some of the problems discussed above, and more case-control studies with no matching at all (Gefeller et al., 1998). Matching remains most desirable, or even necessary, when the distributions of the confounder (e.g., age) might differ radically between the unmatched comparison groups (Costanza, 1995; Stürmer & Brenner, 2002; Vandenbroucke et al., 2007)



Field-specific guidance

Anti-microbial stewardship programs (Tacconelli et al., 2016)
- Specify antimicrobial usage according to: type, dosage, duration and route of administration
- Provide information using defined daily dosages (DDDs) and, in addition, other definitions closer to local reality (packages, prescriptions). Provide justification for the measurement presented
- Address antimicrobial combinations
- Explain rationale for grouping of antimicrobials
- Define time at risk for antimicrobial exposure and for resistance development
- Provide definition of resistance, multidrug resistance, including pattern of co-resistance; whether studies performed to identify location or resistance eg, plasmid, chromosome, integron, transposon
- Definition of infection and/or colonisation. If not a validated reference, provide evidence of robustness of the new definition

Genetic association studies (Little et al., 2009)
- Clearly define genetic exposures (genetic variants) using a widely-used nomenclature system. Identify variables likely to be associated with population stratification (confounding by ethnic origin)

Medical Abortion (Creinin & Chen, 2016)
- Exposure: Detail the medications used, including dose(s) and route(s) of administration. If more than one medication is used, state the planned time interval between medications, preferably in hours
- Outcome: Define successful medical abortion (should most commonly be considered as successful expulsion of the intrauterine pregnancy without need for surgical intervention)
- Outcome: Define the types of medical abortion failure (e.g., ongoing pregnancy, incomplete abortion, participant symptoms). Continuing pregnancy should be defined as a viable pregnancy following treatment (to be differentiated from a non-viable [i.e., retained gestational sac]

Neonatal infections (Fitchett et al., 2016)
- State criteria used to define clinically significant organisms for each sample type

Nutritional data (Lachat et al., 2016)
- Clearly define foods, food groups, nutrients, or other food components
- When using dietary patterns or indices, describe the methods to obtain them and their nutritional properties

Seroepidemiologic studies for influenza (Horby et al., 2017)
- Describe illness definitions and methods for ascertaining the presence or absence of clinical illness in subjects
- Describe the potential for immunization (specify vaccine and timing of vaccination in relationship to collection of serum), if applicable, to affect the outcome measures
- Describe any known or potential immunological cross-reactivity that may bias the outcome measures

Response-driven sampling (White et al., 2015)
- State how recruiter–recruit relationship was tracked

Routinely collected health data (Benchimol et al., 2015)
- A complete list of codes and algorithms used to classify exposures, outcomes, confounders, and effect modifiers should be provided. If these cannot be reported, an explanation should be provided

Seroepidemiologic studies for influenza (Horby et al., 2017)
- Describe any known or potential immunological cross* reactivity that may bias the outcome measures

Resources

Do you know of any good guidance or resources related to this item? Suggest them via comments below, Twitter, GitHub, or e-mail.

References

Artama, M., Ritvanen, A., Gissler, M., Isojärvi, J., & Auvinen, A. (2006). Congenital structural anomalies in offspring of women with epilepsy—a population-based cohort study in Finland. International Journal of Epidemiology, 35(2), 280–287. https://doi.org/10.1093/ije/dyi234

Benchimol, E. I., Smeeth, L., Guttmann, A., Harron, K., Moher, D., Petersen, I., Sorensen, H. T., Elm, E. von, Langan, S. M., & Committee, R. W. (2015). The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLOS Medicine, 12(10), e1001885. https://doi.org/10.1371/journal.pmed.1001885

Cole, P., & MacMahon, B. (1971). Attributable risk percent in case-control studies. British Journal of Preventive & Social Medicine, 25(4), 242–244. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC478665/

Costanza, M. C. (1995). Matching. Preventive Medicine, 24(5), 425–433. https://doi.org/10.1006/pmed.1995.1069

Creinin, M. D., & Chen, M. J. (2016). Medical abortion reporting of efficacy: The MARE guidelines. Contraception, 94(2), 97–103. https://doi.org/10.1016/j.contraception.2016.04.013

Ebrahim, S. (2004). Cohorts, infants and children. International Journal of Epidemiology, 33(6), 1165–1166. https://doi.org/10.1093/ije/dyh368

Fitchett, E. J. A., Seale, A. C., Vergnano, S., Sharland, M., Heath, P. T., Saha, S. K., Agarwal, R., Ayede, A. I., Bhutta, Z. A., Black, R., Bojang, K., Campbell, H., Cousens, S., Darmstadt, G. L., Madhi, S. A., Meulen, A. S.-t., Modi, N., Patterson, J., Qazi, S., … Lawn, J. E. (2016). Strengthening the Reporting of Observational Studies in Epidemiology for Newborn Infection (STROBE-NI): An extension of the STROBE statement for neonatal infection research. The Lancet Infectious Diseases, 16(10), e202–e213. https://doi.org/10.1016/S1473-3099(16)30082-2

Gefeller, O., Pfahlberg, A., Brenner, H., & Windeler, J. (1998). An empirical investigation on matching in published case-control studies. European Journal of Epidemiology, 14(4), 321–325. https://doi.org/10.1023/A:1007497104800

Gissler, M., & Hemminki, E. (1996). The danger of overmatching in studies of the perinatal mortality and birthweight of infants born after assisted conception. European Journal of Obstetrics & Gynecology and Reproductive Biology, 69(2), 73–75. https://doi.org/10.1016/0301-2115(95)02517-0

Horby, P. W., Laurie, K. L., Cowling, B. J., Engelhardt, O. G., Sturm-Ramirez, K., Sanchez, J. L., Katz, J. M., Uyeki, T. M., Wood, J., Van Kerkhove, M. D., & the CONSISE Steering Committee. (2017). CONSISE statement on the reporting of Seroepidemiologic Studies for influenza (ROSES-I statement): An extension of the STROBE statement. Influenza and Other Respiratory Viruses, 11(1), 2–14. https://doi.org/10.1111/irv.12411

Lachat, C., Hawwash, D., Ocké, M. C., Berg, C., Forsum, E., Hörnell, A., Larsson, C., Sonestedt, E., Wirfält, E., Åkesson, A., Kolsteren, P., Byrnes, G., De Keyzer, W., Van Camp, J., Cade, J. E., Slimani, N., Cevallos, M., Egger, M., & Huybrechts, I. (2016). Strengthening the Reporting of Observational Studies in Epidemiology—Nutritional Epidemiology (STROBE-nut): An Extension of the STROBE Statement. PLOS Medicine, 13(6), e1002036. https://doi.org/10.1371/journal.pmed.1002036

Little, J., Higgins, J. P. T., Ioannidis, J. P. A., Moher, D., Gagnon, F., Elm, E. von, Khoury, M. J., Cohen, B., Davey-Smith, G., Grimshaw, J., Scheet, P., Gwinn, M., Williamson, R. E., Zou, G. Y., Hutchings, K., Johnson, C. Y., Tait, V., Wiens, M., Golding, J., … Birkett, N. (2009). STrengthening the REporting of Genetic Association Studies (STREGA)— An Extension of the STROBE Statement. PLOS Med, 6(2), e1000022. https://doi.org/10.1371/journal.pmed.1000022

Rothman, K. J., Lash, T. L., & Greenland, S. (1998b). Matching. In Modern Epidemiology (2nd ed., pp. 147–161). LWW.

Shrier, I., & Platt, R. W. (2008). Reducing bias through directed acyclic graphs. BMC Medical Research Methodology, 8(1), 70. https://doi.org/10.1186/1471-2288-8-70

Students 4 Best Evidence. (2018). A beginner’s guide to confounding. In Students 4 Best Evidence. https://www.students4bestevidence.net/blog/2018/10/01/a-beginners-guide-to-confounding/

Stürmer, T., & Brenner, H. (2002). Flexible Matching Strategies to Increase Power and Efficiency to Detect and Estimate Gene-Environment Interactions in Case-Control Studies. American Journal of Epidemiology, 155(7), 593–602. https://doi.org/10.1093/aje/155.7.593

Szklo, M., & Nieto, J. (2000). Communicating results of epidemiologic studies (chapter 9). In Epidemiology, Beyond the Basics (pp. 408–430). Jones; Bartlett.

Tacconelli, E., Cataldo, M. A., Paul, M., Leibovici, L., Kluytmans, J., Schröder, W., Foschi, F., Angelis, G. D., Waure, C. D., Cadeddu, C., Mutters, N. T., Gastmeier, P., & Cookson, B. (2016). STROBE-AMS: Recommendations to optimise reporting of epidemiological studies on antimicrobial resistance and informing improvement in antimicrobial stewardship. BMJ Open, 6(2), e010134. https://doi.org/10.1136/bmjopen-2015-010134

Vandenbroucke, J. P., Elm, E. von, Altman, D. G., Gotzsche, P. C., Mulrow, C. D., Pocock, S. J., Poole, C., Schlesselman, J. J., & Egger, M. (2007). Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and Elaboration. Epidemiology, 18(6), 805–835. https://doi.org/10.1097/EDE.0b013e3181577511

VanderWeele, T. J. (2019). Principles of confounder selection. European Journal of Epidemiology, 34(3), 211–219. https://doi.org/10.1007/s10654-019-00494-6

Walker, M., Whincup, P., & Shaper, A. (2004). The British Regional Heart Study 1975-2004. 33, 1185–1192.

White, R. G., Hakim, A. J., Salganik, M. J., Spiller, M. W., Johnston, L. G., Kerr, L., Kendall, C., Drake, A., Wilson, D., Orroth, K., Egger, M., & Hladik, W. (2015). Strengthening the Reporting of Observational Studies in Epidemiology for respondent-driven sampling studies: "STROBE-RDS" statement. Journal of Clinical Epidemiology, 68(12), 1463–1471. https://doi.org/10.1016/j.jclinepi.2015.04.002