15.1 Fundamental rules!

  • Always save the data in multiple places (and on your harddrive)
  • Always save the address or source of your data
  • Research must be reproducible, ideally in a 1000 years!!
  • Ideally both code of the analysis and data are saved in one place (see e.g. Harvard Dataverse Network)
  • Some journals require that now anyways (good!)
  • Prepare replication immediatly on submission!
  • Lots of different packages for scraping/downloading data
    • Check them before you to it manually!
    • e.g.┬ácheck out the initiatives rOpenSci and rOpenGov
  • Cache parts of the analysis when using random techniques
    • e.g.┬átopic models