28.4 Case-Crossover Study Design
A case-crossover study is an observational epidemiological method designed primarily to assess the transient effects of acute exposures on the risk of sudden-onset outcomes. Introduced by Maclure (1991), this design utilizes individuals as their own controls, thus inherently adjusting for stable, within-subject confounders.
For example, consider assessing whether vigorous exercise increases the risk of myocardial infarction (MI) (Mittleman et al. 1993):
- Hazard Period: 1-hour preceding MI onset.
- Control Periods: Identical 1-hour windows on other days.
Applying conditional logistic regression, we estimate the odds ratio (OR) indicating how acute exercise influences MI risk.
Implementation in R: Z. Zhang (2016) and season
package
Advantages
- Self-Matching: Controls individual-level confounding, including genetics, demographics, and chronic health status (Maclure 1991).
- Efficiency: Requires fewer subjects due to within-person matching (Mittleman, Maclure, and Robins 1995).
- Applicability: Highly suitable for studying acute triggers (e.g., air pollution, physical exertion, drug use) (Lumley and Levy 2000).
Limitations
- Carryover Effects: Difficulties arise if exposure has prolonged effects (Maclure and Mittleman 2000).
- Time-Varying Confounding: Cannot inherently control for confounders changing within short timeframes unless explicitly modeled (Navidi 1998).
- Selection Bias: Incorrect selection of control periods can bias results (Lumley and Levy 2000).
28.4.1 Mathematical Foundations
The case-crossover design compares exposure status during a “hazard” or “case” period immediately preceding the event to exposure status during “control” periods when the event did not occur (Maclure and Mittleman 2000).
Formally, let:
- Xi,t denote the exposure status (binary or continuous) of individual i at time t.
- Yi,t indicate the occurrence (1) or absence (0) of an acute event at time t.
The probability of an event occurring at time t given exposure is modeled by a conditional logistic regression:
logit(P(Yi,t=1|Xi,t))=αi+βXi,t
Here:
αi is a subject-specific fixed effect that controls for all time-invariant confounding within subjects.
β is the log-odds ratio quantifying the exposure effect.
The conditional logistic regression explicitly considers matched strata (individual cases) and compares the distribution of exposure during the case and control periods within each individual:
L(β)=n∏i=1exp(βXi,case)exp(βXi,case)+∑j∈Ciexp(βXi,j)
- Xi,case represents exposure during the hazard period for case i.
- Xi,j represents exposure during control periods Ci for case i.
28.4.2 Selection of Control Periods
28.4.2.1 Bidirectional
Bidirectional (symmetrical) selection involves choosing control periods before and after the event equally (Navidi 1998):
Control Periods={t−k,t+k},k∈N
This method mitigates biases related to temporal trends in exposure.
28.4.2.2 Unidirectional
Unidirectional selection (either exclusively prior or subsequent to the event) might introduce biases if exposure trends are strong (Lumley and Levy 2000):
Control Periods={t−k},k∈N
28.4.3 Assumptions
- Transient Effect Assumption: Exposure effects are acute and short-lived (Maclure 1991).
- Exchangeability of Hazard and Control Periods: Conditional on individual fixed effects, control periods accurately represent exposure probabilities during hazard periods (Mittleman, Maclure, and Robins 1995).
- No Carryover Effects: Exposure in one period does not affect subsequent periods (Maclure and Mittleman 2000).