3.1 Data structure
Sample data which consists of multiple units (\(i=1,~2,~3,...,n\)) observed at a single point in time or one time interval, represents cross-sectional data or spatial data
Conversely, if the values of the variables for a single unit are observed over time (\(t=1,~2,~3,...,T\)) these are known as time-series or historical data
Data observed for same multiple units across same multiple time periods are known as panel data, which are indexed by two subscripts \(i\) and \(t\), where the total number of observations is \(n\times T\).
|
|
Exercise 10. According to above tables in which year the cross-sectional data are observed? For which country the time-series data are observed?
Solution
BBy finding a matching pair of life expectancy and poverty rate [\(76.7, ~ 20.9\)], we conclude that the cross-sectional data were observed in the year 2021, while the time-series data pertain to Croatia.Country | Year | Life expec. | poverty rate |
---|---|---|---|
Bulgaria | 2013 | 74.9 | NA |
Bulgaria | 2014 | 74.5 | NA |
Bulgaria | 2015 | 74.7 | 43.3 |
Bulgaria | 2016 | 74.9 | 41 |
Bulgaria | 2017 | 74.8 | 38 |
Bulgaria | 2018 | 75 | 33 |
Bulgaria | 2019 | 75.1 | 33.2 |
Bulgaria | 2020 | 73.6 | 33.5 |
Bulgaria | 2021 | 71.4 | 31.7 |
Bulgaria | 2022 | 74.3 | 32.2 |
Czech | 2013 | 78.3 | NA |
Czech | 2014 | 78.9 | NA |
Czech | 2015 | 78.7 | 13 |
Czech | 2016 | 79.1 | 12.4 |
Czech | 2017 | 79.1 | 12.1 |
Czech | 2018 | 79.1 | 11.8 |
Czech | 2019 | 79.3 | 12.1 |
Czech | 2020 | 78.2 | 11.5 |
Czech | 2021 | 77.2 | 10.7 |
Czech | 2022 | 79.1 | 11.8 |
… | … | … | … |
… | … | … | … |
Panel data are repeated measurements of the same cross-sectional units
If there are missing values, they are referred to as unbalanced panel data