About

i2ds

This book is still being written and revised to teach the course Introduction to Data Science (using R, ADILT) at the University of Konstanz in 2024. It currently serves as a scaffold for the curriculum that will be filled with content as we go along.

Contents and audience

This book will eventually contain materials needed to teach a variety of introductory courses on data science for undergraduate students of various disciplines. The materials and examples are designed to engage and motivate students from different fields to apply computational tools to solve challenging problems. Hopefully, students from all backgrounds and levels of experience will welcome the summaries of essential commands and find solving the exercises both enjoyable and enlightening.

Potential courses

The materials covered in the parts and chapters of this book can flexibly arranged to support both basic and more advanced courses and curricula:

  • An introductory course would cover Parts 1 to 3, with selected chapters from Parts 4 and 5. Combining chapters of Parts 1,  2 and 3 could provide a basic introduction to data literacy and reproducible research that is using R for visualizing data, but would not be focusing on particular packages of the tidyverse.
  • Based on student needs and a course’s goals and scope, some of the more specialized chapters (e.g., in Parts 3 to 6) can first be skipped, but used as elements of more advanced curricula later. Potential courses could focus on data visualization (Part 3), on specific data types (Part 5), or on applications (Part 6).

Providing feedback

As this text is still being revised and data science is a dynamic field, it is likely that the current version contains some typos and mistakes. Please email me (as h.neth at uni.kn) to report any errors, possible improvements, or any other feedback or observations that you are willing to share.

Linking and citing

Everyone likes being linked or cited. Feel free to adopt this book or parts of it to your own purposes, but please acknowledge its use in your own work. As the structure of the book’s chapters and sections may change, links should only use the base URL https://bookdown.org/hneth/i2ds/:

  • Neth, H. (2024). i2ds: Introduction to Data Science.
    Social Psychology and Decision Sciences, University of Konstanz, Germany.
    Online textbook (version 0.2.0, July 16, 2024).
    Retrieved from https://bookdown.org/hneth/i2ds/.

The book has been started as a more basic and general version of Data Science for Psychologists (Neth, 2023a). The full reference of this companion book is:

The corresponding R package ds4psy is available at https://CRAN.R-project.org/package=ds4psy.

License

Creative Commons License

Introduction to data science (i2ds) by Hansjörg Neth is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

The HTML-version of this book uses Google Analytics to evaluate the popularity of its different chapters. The website does not collect any personal data of individual users.

Colophon

This book is still being written. Its current version was generated using R version 4.3.2 (2023-10-31) and the following packages:

  • base (4.3.2), bookdown (0.39), bslib (0.7.0), colorspace (2.1.0), datasets (4.3.2), dplyr (1.1.4), ds4psy (1.0.0.9009), FFTrees (2.0.0.9000), forcats (1.0.0), ggplot2 (3.5.1), graphics (4.3.2), grDevices (4.3.2), here (1.0.1), HistData (0.9.1), knitr (1.45), lubridate (1.9.3), methods (4.3.2), palmerpenguins (0.1.1), purrr (1.0.2), RColorBrewer (1.1.3), readr (2.1.5), riskyr (0.4.0.9016), rmarkdown (2.26), shiny (1.8.1.1), shinythemes (1.2.0), shinyWidgets (0.8.6), stats (4.3.2), stringr (1.5.1), tibble (3.2.1), tidyr (1.3.1), tidyverse (2.0.0), unicol (0.3.0.9003), unikn (1.0.0.9001), utils (4.3.2).

Thanks to all package authors and the R community for making this book possible!