1 Usage
For both R Scripts, simply open the files with RStudio, change the settings if needed and run the functions inside.
R Scripts
-
gdp_etl_pipeline.RContains one function which runs the entire ETL pipeline. -
gdp_etl_plots.RContains one function which generates comparison plots for all individual economies and groups of economies. The plots are exported to a pdf file.
Temporal Settings
These settings must be updated both in the R script gdp_etl_pipeline.R and in the documentation file index.Rmd.
-
last_yearLast year. Equal to the estimated year if estimation is enabled. -
enable_estimate_last_yearBoolean variable which tells the script if the last year should be estimated using GDP deflators and CPI. Also enables Chapter 8 of this document.
Working Directory
The scripts automatically set the working directory to the location of the active R script file in RStudio.
Economy Metadata
Files found in datadir:
-
Dim_countries.csv: Contains economy codes and valid year ranges. -
Dim_Countries_Hierarchy_All.csv: Contains hierarchical definitions of groups of economies. -
lab_all.csv: Contains economy codes and labels. -
GDP growth rates.xlsx: Contains growth rates for all economies. (needed only when estimating values for the last year) -
US.GDPTotal_20250718_104458.csvLast release of GDP dataset at UNCTADstat.
Output Files
Files found in outputdir:
-
gdp_update.csv: An updated dataset in a generic format. -
gdp_update_usis.csv: An updated dataset in a specific format to be used by USIS for uploading to UNCTADstat. -
gdp_comparison.csv: A dataset containing old and new GDP values. -
GDP_comparison_groups.pdf: Comparison plots of all individual economies and groups of economies. -
estimated_with_cpi.csv: Economies for which the real GDP was estimated using CPI.