Introduction to Regression Methods for Public Health Using R
2023-01-25
Welcome
Thanks for visiting!
This is a FIRST DRAFT and is not yet peer reviewed. The goal is publication as a printed version (through CRC Press) with the online version remaining freely available. I am hopeful the review and editing will be complete by the end of 2023.
Status
- Chapters 1-7: Sent out for peer review
- Chapter 8: Proofing the first draft prior to peer review
- Chapters 9: First draft
If you have any comments or suggestions, feel free to contact me at ramzi.nahhas@wright.edu. Thank you!
Use

Introduction to Regression Methods for Public Health using R by Ramzi W. Nahhas is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Preface
This text is suitable as a second biostatistics course for Master of Public Health students or public health professionals. Almost all public health students take an introductory biostatistics course, providing foundational competencies but perhaps not enough to use more advanced methods without additional training. There are a plethora of textbooks covering topics such as linear regression, logistic regression, and survival analysis. Many texts, however, are aimed at those with a background in mathematical statistics and/or without a focus specifically on public health and/or without a focus on using R. The goal of this text is to bridge that gap – to provide a gentle introduction to regression methods, using R, that covers all the basics and a bit more with examples drawn from public health data. There are excellent texts that cover each of the methods covered herein, as well as R programming, in much greater detail than is provided here (e.g., Faraway 2016; Fox 2015; Fox and Weisberg 2019; Harrell 2015; Klein and Moeschberger 2010; Kleinman and Horton 2014; van Buuren 2018; Weisberg 2013; Wickham and Grolemund 2017; Wickham 2019). My hope is that for any who feel such texts are currently beyond your reach, what you learn here will give you the knowledge and skills to understand and carry out appropriate basic regression analyses and the foundation and confidence to go deeper.
Software information and conventions
The knitr package (Xie 2015) and the bookdown package (Xie 2022) were used to compile this text. The R session information is shown below:
::session_info() xfun
## R version 4.0.5 (2021-03-31)
## Platform: x86_64-w64-mingw32/x64 (64-bit)
## Running under: Windows 10 x64 (build 19044)
##
## Locale:
## LC_COLLATE=English_United States.1252
## LC_CTYPE=English_United States.1252
## LC_MONETARY=English_United States.1252
## LC_NUMERIC=C
## LC_TIME=English_United States.1252
##
## Package version:
## base64enc_0.1.3 bookdown_0.27 bslib_0.4.0
## cachem_1.0.6 cli_3.3.0 compiler_4.0.5
## digest_0.6.29 evaluate_0.15 fastmap_1.1.0
## fs_1.5.2 glue_1.6.2 graphics_4.0.5
## grDevices_4.0.5 highr_0.9 htmltools_0.5.3
## jquerylib_0.1.4 jsonlite_1.8.0 knitr_1.39
## magrittr_2.0.3 memoise_2.0.1 methods_4.0.5
## R6_2.5.1 rappdirs_0.3.3 rlang_1.0.4
## rmarkdown_2.14 rstudioapi_0.13 sass_0.4.2
## stats_4.0.5 stringi_1.7.8 stringr_1.4.0
## tinytex_0.40 tools_4.0.5 utils_4.0.5
## xfun_0.31 yaml_2.3.5
Package names and inline code are formatted in a typewriter font (e.g., knitr
, lm(Y ~ X)
), and function names are followed by parentheses (e.g., lm()
).