3 Data collection
Sample data should be always organized in matrix form, with observations presented in rows and variables in columns, and saved in a file format such as .xlsx, .txt or .csv
Most common problem’s with data:
- Missing values (NA)
- Measurement errors (collected data may not always present true values)
- Outliers (extreme values above or below the mean)
- Sample data are usually transformed:
- Taking the logs, squares, inverse values, square root, …
- Seasonally and/or calendar adjusted
- First differences are sometimes required as well as lagged values
- Deflating nominal values