3.15 SQL at scale: Strategy
- General Strategy: Store data in data warehouse
- Pass subset of data from warehouse to R
- Transform R code and pass to warehouse
- Normally you use different packages for that communication (
dplyr
,DBI
,RHadoop
,SparkR
)
- Many many data warehouse solutions
- We’ll have a look at…
- Google BigQuery: https://cloud.google.com/bigquery/
- Google Cloud Storage: https://cloud.google.com/storage/