Notes

Our goal is not to teach you R, but to teach you the basic process of doing a Data Science project that many other programming languages like Java and Python can do. We use R in our lessons because:

  • we have to use something for examples;
  • it’s free, well-documented, and runs almost everywhere;
  • it has a large (and growing) user base among scientists; and
  • it has a large library of external packages available for performing diverse tasks.

But the two most important things are to use whatever language your colleagues are using, so you can share your work with them easily, and to use that language well. apparently. R is the most used language in Data Science.