3 Data collection
Sample data should be always organized in matrix form, with observations presented in rows and variables in columns, and saved in a file format such as .xlsx, .txt or .csv
Most common data issues:
- Missing values (NA)
- Measurement errors (collected data may not always present true values)
- Outliers (extreme values above or below the mean)
- Raw data are usually transformed:
- Taking the logs, squares, inverse values, square roots, …
- Seasonally and/or calendar adjusted
- First differences are sometimes required as well as lagged values
- Deflating nominal values