Methods: Participants (6)

The items from STROBE state that you should report:
Cohort study
- Give the eligibility criteria, and the sources and methods of selection of participants. Describe methods of follow up
- For matched studies, give matching criteria and number of exposed and unexposed
Case-control study
- Give the eligibility criteria, and the sources and methods of case ascertainment and control selection. Give the rationale for the choice of cases and controls
- For matched studies, give matching criteria and the number of controls per case
Cross-sectional study
- Give the eligibility criteria, and the sources and methods of selection of participants

Some key items to consider adding:

- Define the unit analysed (person, family, twin pairs, department, school, etc.)
- Report the source of participants/clinical specimens (e.g., if the participants were a subset from a larger study)
- Clearly describe sampling frame and strategy
- Report inclusion and exclusion criteria (psychological, dietary/nutritional, physiological, clinical conditions) etc. especially if they might affect key indicators or surrogate endpoints (e.g., biomarkers)
- Clear deﬁnitions of exposed and nonexposed cohorts. Justify the choice of comparator
- Describe the conditions where subjects may change from one cohort to the other
- Describe whether treatment is restricted to new starts or encompasses all individuals with ongoing treatment
- Describe incentives for participation and recruitment
- Describe follow-up methods and timepoints of assessemnt of serial follow-up
- For matched studies, describe matching criteria and the reasons (epidemiological and clinical) for this criteria
- For matched studies, detail the number of matched individuals per subject (e.g, number of controls per case)

Explanation 6a

Detailed descriptions of the study participants help readers understand the applicability of the results. Investigators usually restrict a study population by defining clinical, demographic and other characteristics of eligible participants. Typical eligibility criteria relate to age, gender, diagnosis and comorbid conditions. Despite their importance, eligibility criteria often are not reported adequately. In a survey of observational stroke research, 17 of 49 reports (35%) did not specify eligibility criteria.(Tooth et al., 2005)

Eligibility criteria may be presented as inclusion and exclusion criteria, although this distinction is not always necessary or useful. Regardless, we advise authors to report all eligibility criteria and also to describe the group from which the study population was selected (eg, the general population of a region or country), and the method of recruitment (eg, referral or self-selection through advertisements).

Knowing details about follow-up procedures, including whether procedures minimized nonresponse and loss to follow-up and whether the procedures were similar for all participants, informs judgments about the validity of results. For example, in a study that used IgM antibodies to detect acute infections, readers needed to know the interval between blood tests for IgM antibodies so that they could judge whether some infections likely were missed because the interval between blood tests was too long. (Metzkor-Cotter et al., 2003) In other studies where follow-up procedures differed between exposed and unexposed groups, readers might recognize substantial bias due to unequal ascertainment of events or differences in nonresponse or loss to follow-up.(Johnson, 1990) Accordingly, we advise that researchers describe the methods used for following participants and whether those methods were the same for all participants, and that they describe the completeness of ascertainment of variables (see also item 14).

In case-control studies, the choice of cases and controls is crucial to interpreting the results, and the method of their selection has major implications for study validity. In general controls should reflect the population from which the cases arose. Various methods are used to sample controls, all with advantages and disadvantages: for cases that arise from a general population, population roster sampling, random digit dialling, neighborhood or friend controls are used. Neighborhood or friend controls may present intrinsic matching on exposure.(K. Rothman, 1998) Controls with other diseases may have advantages over population-based controls, in particular for hospital-based cases, because they better reflect the catchment population of a hospital, have greater comparability of recall and ease of recruitment. However, they can present problems if the exposure of interest affects the risk of developing or being hospitalized for the control condition(s). (Berkson, 1946; Feinstein et al., 1986) To remedy this problem often a mixture of the best defensible control diseases is used.(Jick & Vessey, 1978; Vandenbroucke et al., 2007)

Examples 6a

Cohort Studies
“Participants in the Iowa Women’s Health Study were a random sample of all women ages 55 to 69 years derived from the state of Iowa automobile driver’s license list in 1985, which represented approximately 94% of Iowa women in that age group. (. . .) Follow-up questionnaires were mailed in October 1987 and August 1989 to assess vital status and address changes. (. . .) Incident cancers, except for nonmelanoma skin cancers, were ascertained by the State Health Registry of Iowa (. . .). The Iowa Women’s Health Study cohort was matched to the registry with combinations of first, last, and maiden names, zip code, birthdate, and social security number.”(Canto et al., 2000)

Case-Control Studies
“Cutaneous melanoma cases diagnosed in 1999 and 2000 were ascertained through the Iowa Cancer Registry(. . .). Controls, also identified through the Iowa Cancer Registry, were colorectal cancer patients diagnosed during the same time. Colorectal cancer controls were selected because they are common and have a relatively long survival, and because arsenic exposure has not been conclusively linked to the incidence of colorectal cancer.”(Beane Freeman et al., 2004)

Cross-Sectional Studies
“We retrospectively identified patients with a principal diagnosis of myocardial infarction (code 410) according to the International Classification of Diseases, 9th Revision, Clinical Modification, from codes designating discharge diagnoses, excluding the codes with a fifth digit of 2, which designates a subsequent episode of care (. . .). A random sample of the entire Medicare cohort with myocardial infarction from February 1994 to July 1995 was selected (. . .). To be eligible, patients had to present to the hospital after at least 30 minutes but less than 12 hours of chest pain and had to have ST-segment elevation of at least 1 mm on two contiguous leads on the initial electrocardiogram.”(Canto et al., 2000)

Explanation 6b

Matching is much more common in case-control studies, but occasionally, investigators use matching in cohort studies to make groups comparable at the start of follow-up. Matching in cohort studies presents fewer intricacies than with case-control studies. For example, it is not necessary to take the matching into account for the estimation of the relative risk. (Costanza, 1995) Because matching in cohort studies may increase statistical precision investigators might allow for the matching in their analyses and thus obtain narrower confidence intervals.

In case-control studies matching is done to increase a study’s efficiency by ensuring similarity in the distribution of variables between cases and controls, in particular the distribution of potential confounding variables. (Costanza, 1995; Stürmer & Brenner, 2002) Because matching can be done in various ways, with one or more controls per case, the rationale for the choice of matching variables and the details of the method used should be described. Commonly used forms of matching are frequency matching (also called group matching) and individual matching. In frequency matching, investigators choose controls so that the distribution of matching variables becomes identical or similar to that of cases. Individual matching involves matching one or several controls to each case. Although intuitively appealing and sometimes useful, matching in case-control studies has a number of disadvantages, is not always appropriate, and needs to be taken into account in the analysis (see box 2).

Even apparently simple matching procedures may be poorly reported. For example, authors may state that controls were matched to cases ‘within five years’, or using ‘five year age bands’. Does this mean that, if a case was 54 years old, the respective control needed to be in the five-year age band 50 to 54, or aged 49 to 59, which is within five years of age 54? If a wide (eg, 10-year) age band is chosen, there is a danger of residual confounding by age (see also [box 4][## Box 4. Grouping]), for example because controls may then be younger than cases on average.

Examples 6b

Cohort Studies
“For each patient who initially received a statin, we used propensity-based matching to identify one control who did not receive a statin according to the following protocol. First, propensity scores were calculated for each patient in the entire cohort on the basis of an extensive list of factors potentially related to the use of statins or the risk of sepsis. Second, each statin user was matched to a smaller pool of nonstatin-users by sex, age (plus or minus 1 year), and index date (plus or minus 3 months). Third, we selected the control with the closest propensity score (within 0.2 SD) to each statin user in a 1:1 fashion and discarded the remaining controls.”(Hackam et al., 2006)

Case-Control Studies
“We aimed to select five controls for every case from among individuals in the study population who had no diagnosis of autism or other pervasive developmental disorders (PDD) recorded in their general practice record and who were alive and registered with a participating practice on the date of the PDD diagnosis in the case. Controls were individually matched to cases by year of birth (up to 1 year older or younger), sex, and general practice. For each of 300 cases, five controls could be identified who met all the matching criteria. For the remaining 994, one or more controls was excluded . . .”(Smeeth et al., 2004)

Field-specific guidance

Rheumatology (Zavada et al., 2014)
- Describe whether treatment reﬂects ﬁrst start until ﬁrst stop of therapy or multiple treatment episodes. If the latter, discuss deﬁnition of duration of exposure and any implications for combining treatment intervals

Routinely collected health data (Benchimol et al., 2015)
- The methods of study population selection (such as codes or algorithms used to identify subjects) should be listed in detail. If this is not possible, an explanation should be provided
- Any validation studies of the codes or algorithms used to select the population should be referenced. If validation was conducted for this study and not published elsewhere, detailed methods and results should be provided
- If the study involved linkage of databases, consider use of a ﬂow diagram or other graphical display to demonstrate the data linkage process, including the number of individuals with linked data at each stage

Seroepidemiologic studies for influenza (Horby et al., 2017)
- For case-ascertained transmission studies, describe the method of case ascertainment and criteria for defining a “case”
- For household-or institution-based transmission studies, describe the definition of a household or the institution - For outbreak investigations involving serologic sampling, describe the setting in which the cases were identified, for example, village/residential setting, occupational workplace
- To aid the interpretation of seroepidemiologic studies of novel influenza A virus subtypes, the results from exposed populations should be compared with the results from unexposed populations. Efforts to validate the assay in virologically confirmed cases should be reported

Neonatal infections (Fitchett et al., 2016)
- State age of participants (eg, 0–27 days deﬁ nes neonates; day 0 as day of birth). Disaggregate neonatal data from that of older infants and from stillbirths
- Detail the range of gestational age for participants determined a priori to be included in the research, including a lower limit when applicable.
- Explain the methods used to determine gestational age (e.g., physical examination, last menstrual period, ultrasonography). If ultrasonography is used, detail the type (vaginal and/or abdominal) and consider describing the criteria used for determination of gestational age.

Response-driven sampling (White et al., 2015)
- Describe how participants were trained/instructed to recruit others, number of coupons issued per person, any time limits for referral
- Describe methods of seed selection and state number at start of study and number added later
- State if there was any variation in study procedures during data collection (e.g., changing numbers of coupons per recruiter, interruptions in sampling, or stopping recruitment chains)
- Report wording of personal network size question(s)

Veterinary epidemiology (O’Connor et al., 2016)
- Describe the sources and methods of selection for the owners/managers and for the animals, at each relevant level of organization
- Describe the eligibility criteria for the owners/ managers and for the animals, at each relevant level of organization

Resources

Do you know of any good guidance or resources related to this item? Suggest them via comments below, Twitter, GitHub, or e-mail.

References

Beane Freeman, L. E., Dennis, L. K., Lynch, C. F., Thorne, P. S., & Just, C. L. (2004). Toenail Arsenic Content and Cutaneous Melanoma in Iowa. American Journal of Epidemiology, 160(7), 679–687. https://doi.org/10.1093/aje/kwh267

Benchimol, E. I., Smeeth, L., Guttmann, A., Harron, K., Moher, D., Petersen, I., Sorensen, H. T., Elm, E. von, Langan, S. M., & Committee, R. W. (2015). The REporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLOS Medicine, 12(10), e1001885. https://doi.org/10.1371/journal.pmed.1001885

Berkson, J. (1946). Limitations of the Application of Fourfold Table Analysis to Hospital Data. Biometrics Bulletin, 2(3), 47–53. https://doi.org/10.2307/3002000

Canto, J. G., Allison, J. J., Kiefe, C. I., Fincher, C., Farmer, R., Sekar, P., Person, S., & Weissman, N. W. (2000). Relation of Race and Sex to the Use of Reperfusion Therapy in Medicare Beneficiaries with Acute Myocardial Infarction. New England Journal of Medicine, 342(15), 1094–1100. https://doi.org/10.1056/NEJM200004133421505

Costanza, M. C. (1995). Matching. Preventive Medicine, 24(5), 425–433. https://doi.org/10.1006/pmed.1995.1069

Feinstein, A. R., Walter, S. D., & Horwitz, R. I. (1986). An analysis of Berkson’s bias in case-control studies. Journal of Chronic Diseases, 39(7), 495–504. https://doi.org/10.1016/0021-9681(86)90194-3

Fitchett, E. J. A., Seale, A. C., Vergnano, S., Sharland, M., Heath, P. T., Saha, S. K., Agarwal, R., Ayede, A. I., Bhutta, Z. A., Black, R., Bojang, K., Campbell, H., Cousens, S., Darmstadt, G. L., Madhi, S. A., Meulen, A. S.-t., Modi, N., Patterson, J., Qazi, S., … Lawn, J. E. (2016). Strengthening the Reporting of Observational Studies in Epidemiology for Newborn Infection (STROBE-NI): An extension of the STROBE statement for neonatal infection research. The Lancet Infectious Diseases, 16(10), e202–e213. https://doi.org/10.1016/S1473-3099(16)30082-2

Hackam, D. G., Mamdani, M., Li, P., & Redelmeier, D. A. (2006). Statins and sepsis in patients with cardiovascular disease: A population-based cohort analysis. The Lancet, 367(9508), 413–418. https://doi.org/10.1016/S0140-6736(06)68041-0

Horby, P. W., Laurie, K. L., Cowling, B. J., Engelhardt, O. G., Sturm-Ramirez, K., Sanchez, J. L., Katz, J. M., Uyeki, T. M., Wood, J., Van Kerkhove, M. D., & the CONSISE Steering Committee. (2017). CONSISE statement on the reporting of Seroepidemiologic Studies for influenza (ROSES-I statement): An extension of the STROBE statement. Influenza and Other Respiratory Viruses, 11(1), 2–14. https://doi.org/10.1111/irv.12411

Jick, H., & Vessey, M. P. (1978). CASE-CONTROL STUDIES IN THE EVALUATION OF DRUG-INDUCED ILLNESS. American Journal of Epidemiology, 107(1), 1–7. https://doi.org/10.1093/oxfordjournals.aje.a112502

Johnson, E. S. (1990). Bias on Withdrawing Lost Subjects from the Analysis at the Time of Loss, in Cohort Mortality Studies, and in Follow-Up Methods: Journal of Occupational and Environmental Medicine, 32(3), 250–254. https://doi.org/10.1097/00043764-199003000-00013

Metzkor-Cotter, E., Kletter, Y., Avidor, B., Varon, M., Golan, Y., Ephros, M., & Giladi, M. (2003). Long-Term Serological Analysis and Clinical Follow-Up of Patients with Cat Scratch Disease. Clinical Infectious Diseases, 37(9), 1149–1154. https://doi.org/10.1086/378738

O’Connor, A. M., Sargeant, J. M., Dohoo, I. R., Erb, H. N., Cevallos, M., Egger, M., Ersbøll, A. K., Martin, S. W., Nielsen, L. R., Pearl, D. L., Pfeiffer, D. U., Sanchez, J., Torrence, M. E., Vigre, H., Waldner, C., & Ward, M. P. (2016). Explanation and Elaboration Document for the STROBE-Vet Statement: Strengthening the Reporting of Observational Studies in Epidemiology – Veterinary Extension. Zoonoses and Public Health, 63(8), 662–698. https://doi.org/10.1111/zph.12315

Rothman, K. (1998). Types of Epidemiologic Studies. In Modern epidemiology (2nd ed., pp. 74–75). Lippincott Raven.

Smeeth, L., Cook, C., Fombonne, E., Heavey, L., Rodrigues, L. C., Smith, P. G., & Hall, A. J. (2004). MMR vaccination and pervasive developmental disorders: A case-control study. The Lancet, 364(9438), 963–969. https://doi.org/10.1016/S0140-6736(04)17020-7

Stürmer, T., & Brenner, H. (2002). Flexible Matching Strategies to Increase Power and Efficiency to Detect and Estimate Gene-Environment Interactions in Case-Control Studies. American Journal of Epidemiology, 155(7), 593–602. https://doi.org/10.1093/aje/155.7.593

Tooth, L., Ware, R., Bain, C., Purdie, D. M., & Dobson, A. (2005). Quality of Reporting of Observational Longitudinal Research. American Journal of Epidemiology, 161(3), 280–288. https://doi.org/10.1093/aje/kwi042

Vandenbroucke, J. P., Elm, E. von, Altman, D. G., Gotzsche, P. C., Mulrow, C. D., Pocock, S. J., Poole, C., Schlesselman, J. J., & Egger, M. (2007). Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): Explanation and Elaboration. Epidemiology, 18(6), 805–835. https://doi.org/10.1097/EDE.0b013e3181577511

White, R. G., Hakim, A. J., Salganik, M. J., Spiller, M. W., Johnston, L. G., Kerr, L., Kendall, C., Drake, A., Wilson, D., Orroth, K., Egger, M., & Hladik, W. (2015). Strengthening the Reporting of Observational Studies in Epidemiology for respondent-driven sampling studies: "STROBE-RDS" statement. Journal of Clinical Epidemiology, 68(12), 1463–1471. https://doi.org/10.1016/j.jclinepi.2015.04.002

Zavada, J., Dixon, W. G., & Askling, J. (2014). Launch of a checklist for reporting longitudinal observational drug studies in rheumatology: A EULAR extension of STROBE guidelines based on experience from biologics registries. Annals of the Rheumatic Diseases, 73(3), 628. https://doi.org/http://dx.doi.org/10.1136/annrheumdis-2013-204102