4.8 Exercise

  1. Go to USAFACTS COVID-19 data, download the “Cases” and “County Populations” datasets, and import them into R. There should be a “DOWNLOAD DATA” section on the right side-bar of the web page.

  2. List the names of the columns in each dataset. How are these data structured? What names do they have in common?

  3. Merge “Cases” with “County Populations”. What should be the key variable or variables to merge on? Are there columns in common for the two datasets that you want to exclude or rename from one dataset before merging to avoid duplication? Are there certain rows you want to exclude?

  4. For the county you live in, what is the total number of cases to date? What is the total number of cases per 100,000 population?