Data Analytics Module
Lecturer: Hans van der Zwan
Research wk 03
Topic: statistical modelling Type: individual assignment

Dutch healthcare costs (1) Download the file with Dutch reimbursed healthcare costs per municipality in 2018: vektis2018.xlsx.
Each student is assigned a category of healthcare costs, see below.
Investigate whether the first digits of the costs in this column comply with Benford’s law.
Report a bar graph comparable with Figure 4 and a table with expected absolute frequencies (Fexp) according Benfords law and the observed absolute frequencies (Fobs) in the data set as well as the expected relative frequencies and the observed relative frquencies. Comment on what can be seen in the graph and the table.

Dutch healthcare costs (2) Create an overview with all Dutch Municipalities the total insured years, the total reimbursed healthcare costs and the average reimbursed healthcare costs per insured year.
Sort this table based on the average reimbursed healthcare costs. Create a histogram and a boxplot for the average reimbursed healthcare costs per insured year for the 355 municipalities.
Comment on the results.

Assigning Categories to Groups

Group 1: COSTS_SPECIALISTIC_HEALTHCARE; COSTS_PHARMACIE

Group 2: COSTS_GENPRACT_REGISTRATION_RATE; COSTS_GENPRACT_CONSULT COSTS_GENPRACT_MDZ

Group 3: COSTS_GENPRACT_OTHER; COSTS_TOOLS; COSTS_ORAL_CARE

Group 4: COSTS_PHYSIOTHERAPY; COSTS_PARAMEDICAL_CARE_OTHER; COSTS_HOSPITAL_TRANSPORT_SITTING

Group 5: COSTS_HOSPITAL_TRANSPORT_BED; COSTS_MATERNITY_CARE COSTS_OBSTETRIC_CARE