3.15 SQL at scale: Strategy

  • General Strategy: Store data in data warehouse
    • Pass subset of data from warehouse to R
    • Transform R code and pass to warehouse
    • Normally you use different packages for that communication (dplyr, DBI, RHadoop, SparkR)
  • Many many data warehouse solutions
  • We’ll have a look at…