3 Data collection

  • Collected data should always be organized in matrix form, with observtions presented in rows and variables in columns, and saved in a file format such as .xlsx, .txt or .csv

  • Most common problem’s with data:

  1. Missing values (NA)
  2. Measurement errors (collected data may not always present true values)
  3. Outliers (extreme values above or below the mean)
  • Sample data are usually transformed:
  1. Taking the logs, squares, inverse values, square root, …
  2. Seasonally and/or calendary adjusted
  3. First differences are sometimes required as well as lagged values
  4. Deflating nominal values