Descriptive/causal inference vs. prediction
Learning outcomes/objective:
- Understand difference between descriptive/causal inference and prediction from a data perspective
- Clarification of different terminology: Inference; Prediction; Forecasting; Imputation; etc.
1 Inference (1): Descriptive inference
- Goal of descriptive inference: Estimate a parameter in a population
- e.g., Research question: What is the average of life satisfaction/unemployment among Mannheim University students?
- Q: Easy to find out? What could be the problem?
- e.g., Research question: What is the average of life satisfaction/unemployment among Mannheim University students?
- Table 1 displays our sample
- Assuming it were the population we could add a vector
that indicates whether someone in the population has been sampled (cf. Abadie et al. 2020)
- Assuming it were the population we could add a vector
2 Inference (2): Causal inference
- Goal of causal inference: Identify whether particular cause/treatment
has a causal effect on in a population- e.g., Research question: What is the causal effect of unemployment
on life satisfaction among Mannheim students?
- e.g., Research question: What is the causal effect of unemployment
3 Inference (3): Causal inference
Causal inference: Every-day notion of causality
formalized through potential outcomes framework (Rubin 1974, ~2012) , e.g.,- FPCI (Holland 1986): Either observe
or … missing data problem! - Usual focus on average treatment effect:
(or ATT)
Designs, methods & models (with examples from my own research)
- experiments (Bauer et al. 2019, Bauer & Clemm 2021, Bauer et al. 2021, Bauer & Poama 2020), matching, instrumental variables (Bauer & Fatke 2014), regression discontinuity design, difference-in-differences, fixed-effects model (Bauer 2015, 2019), etc. (e.g., Gangl 2010 for overview)
Potential outcomes & identification revolution (Imai 2011):
- Statistical inference: Models + statistical assumptions
Causal inference: Models + statistical assumptions + identification assumptions
- Statistical inference: Models + statistical assumptions
4 Inference (3): Missing data perspective
- Data perspective: Both causal inference and machine learning are about missing data!
- Causal inference perspective
- Replace (predict) Sofia’s (and others’) missing potential outcome(s) on variable
with other people’s observed outcomes!
- Replace (predict) Sofia’s (and others’) missing potential outcome(s) on variable
- Prediction/ML perspective
- Train model to predict missing observations on variable
(see “?”s)
- Train model to predict missing observations on variable
8 Timeline of statistical learning (James et al. 2013, 6–7)
- Beginning of the 19th century: Legendre and Gauss - method of least squares
- Earliest form linear regression [Astronomy, quantitative output values
- 1936: Fisher - Linear Discriminant Analysis
- 1940s: various authors - Logistic Regression
- 1970: Nelder and Wedderburn - Generalized Linear Models (GLM) of which linear and logistic regression are special cases
- By end of the 1970s: Many more techniques available but almost exclusively linear methods
- Fitting non-linear relationships was computationally infeasible at the time
- By the 1980s: Better computing technology facility non-linear methods
- Mid 1980s: Breiman, Friedman, Olshen and Stone - Classification and Regression Trees
- practical implementation including cross-validation for model selection
- 1986: Hastie/Tibshirani coin term “generalized additive models” for a class of non-linear extensions to generalized linear models (+ practical software implementation)
- Since then statistical learning has emerged as a new subfield!
References
Abadie, Alberto, Susan Athey, Guido W Imbens, and Jeffrey M Wooldridge. 2020. “Sampling-Based Versus Design-Based Uncertainty in Regression Analysis.” Econometrica 88 (1): 265–96.
James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2013. An Introduction to Statistical Learning: With Applications in R. Springer Texts in Statistics. Springer.
Salganik, Matthew J, Ian Lundberg, Alexander T Kindel, Caitlin E Ahearn, Khaled Al-Ghoneim, Abdullah Almaatouq, Drew M Altschul, et al. 2020. “Measuring the Predictability of Life Outcomes with a Scientific Mass Collaboration.” Proc. Natl. Acad. Sci. U. S. A. 117 (15): 8398–8403.