Full Reading List
Aldrich, John. 2008. “R. A. Fisher on Bayes and Bayes’
Theorem.” Bayesian Analysis 3 (1). https://doi.org/10.1214/08-BA306.
Breiman, Leo. 2001. “Statistical Modeling: The Two Cultures (with
Comments and a Rejoinder by the Author).” Statistical
Science 16 (3): 199–231. https://doi.org/10.1214/ss/1009213726.
Cassidy, Scott A., Ralitza Dimova, Benjamin Giguère, Jeffrey R. Spence,
and David J. Stanley. 2019. “Failing Grade: 89.”
Advances in Methods and Practices in Psychological Science 2
(3): 233–39. https://doi.org/10.1177/2515245919858072.
Cobb, George W., and David S. Moore. 1997. “Mathematics,
Statistics, and Teaching.” The American Mathematical
Monthly 104 (9): 801–23. https://doi.org/10.1080/00029890.1997.11990723.
Cumming, Geoff. 2014. “The New Statistics: Why and How.”
Psychological Science 25 (1): 7–29. https://doi.org/10.1177/0956797613504966.
Dienes, Zoltan. 2011. “Bayesian Versus Orthodox Statistics: Which
Side Are You On?” Perspectives on Psychological Science
6 (3): 274–90. https://doi.org/10.1177/1745691611406920.
Fisher, R. A. 1922. “On the Mathematical Foundations of
Theoretical Statistics.” Philosophical Transactions of the
Royal Society of London. Series A, Containing Papers of a Mathematical
or Physical Character 222: 309–68. https://www.jstor.org/stable/91208.
Gelman, Andrew, and John Carlin. 2014. “Beyond Power
Calculations.” Perspectives on Psychological Science 9
(November): 641–51. https://doi.org/10.1177/1745691614551642.
Gelman, Andrew, and Christian Hennig. 2017. “Beyond Subjective and
Objective in Statistics.” Journal of the Royal Statistical
Society: Series A (Statistics in Society) 180 (4): 967–1033. https://doi.org/10.1111/rssa.12276.
Gelman, Andrew, and Cosma Rohilla Shalizi. 2013. “Philosophy and
the Practice of Bayesian Statistics: Philosophy and the
Practice of Bayesian Statistics.” British Journal
of Mathematical and Statistical Psychology 66 (1): 8–38. https://doi.org/10.1111/j.2044-8317.2011.02037.x.
Greenland, Sander, Stephen J. Senn, Kenneth J. Rothman, John B. Carlin,
Charles Poole, Steven N. Goodman, and Douglas G. Altman. 2016.
“Statistical Tests, p Values, Confidence Intervals, and Power: A
Guide to Misinterpretations.” European Journal of
Epidemiology 31 (April): 337–50. https://doi.org/10.1007/s10654-016-0149-3.
Haller, Heiko, and Stefan Krauss. 2002. “Misinterpretations of
Significance: A Problem Students Share with Their Teachers?”
Methods of Psychological Research Online 7.
Hoekstra, Rink, Richard D. Morey, Jeffrey N. Rouder, and Eric-Jan
Wagenmakers. 2014. “Robust Misinterpretation of Confidence
Intervals.” Psychonomic Bulletin & Review 21
(January): 1157–64. https://doi.org/10.3758/s13423-013-0572-3.
Kass, Robert E. 2011. “Statistical Inference: The Big
Picture.” Statistical Science 26 (1). https://doi.org/10.1214/10-STS337.
Kvarven, Amanda, Eirik Strømland, and Magnus Johannesson. 2020.
“Comparing Meta-Analyses and Preregistered Multiple-Laboratory
Replication Projects.” Nature Human Behaviour 4 (4):
423–34. https://doi.org/10.1038/s41562-019-0787-z.
Lehmann, E L. 1993. “The Fisher, Neyman-Pearson Theories of
Testing Hypotheses: One Theory or Two?” Journal of the
American Statistical Association, 8.
Lenhard, Johannes. 2006. “Models and Statistical Inference: The
Controversy Between Fisher and NeymanPearson.”
The British Journal for the Philosophy of Science 57 (1):
69–91. https://doi.org/10.1093/bjps/axi152.
Lennox, Kristin. 2016. “All about That Bayes: Probability,
Statistics, and the Quest to Quantify Uncertainty.” https://www.youtube.com/watch?v=eDMGDhyDxuY.
Lewis, Molly, Maya B. Mathur, Tyler J. VanderWeele, and Michael C.
Frank. 2022. “The Puzzling Relationship Between Multi-Laboratory
Replications and Meta-Analyses of the Published Literature.”
Royal Society Open Science 9 (2): 211499. https://doi.org/10.1098/rsos.211499.
Lipton, Peter. 2000. “Inference to the Best Explanation.”
A Companion to the Philosophy of Science, 184–93.
Mayo, Deborah G. 2013. “The Error-Statistical Philosophy and the
Practice of Bayesian Statistics: Comments on Gelman and Shalizi:
‘Philosophy and the Practice of Bayesian
Statistics’.” British Journal of Mathematical
and Statistical Psychology 66 (1): 57–64. https://doi.org/10.1111/j.2044-8317.2012.02064.x.
McElreath, Richard. 2017. “Bayesian Statistics Without Frequentist
Language.” https://www.youtube.com/watch?v=yakg94HyWdE.
McShane, Blakeley B., and David Gal. 2016. “Blinding Us to the
Obvious? The Effect of Statistical Training on the Evaluation of
Evidence.” Management Science 62 (June): 1707–18. https://doi.org/10.1287/mnsc.2015.2212.
Rohrer, Julia M. 2018. “Thinking Clearly about Correlations and
Causation: Graphical Causal Models for Observational Data.”
Advances in Methods and Practices in Psychological Science 1
(January): 27–42. https://doi.org/10.1177/2515245917745629.
Scheel, Anne M., Mitchell R. M. J. Schijen, and Daniël Lakens. 2021.
“An Excess of Positive Results: Comparing the Standard Psychology
Literature With Registered Reports.” Advances in Methods and
Practices in Psychological Science 4 (2): 25152459211007467. https://doi.org/10.1177/25152459211007467.
Schield, Milo. 1999. “Simpson’s Paradox and Cornfield’s
Conditions.” ASA Proceedings of the Section on Statistical
Education 1999 (August): 106–11.
Schuirmann, Donald J. 1987. “A Comparison of the Two One-Sided
Tests Procedure and the Power Approach for Assessing the Equivalence of
Average Bioavailability.” Journal of Pharmacokinetics and
Biopharmaceutics 15 (6): 657–80. https://doi.org/10.1007/BF01068419.
“Severe Testing as a Basic Concept in a NeymanPearson
Philosophy of Induction.” n.d. https://www.journals.uchicago.edu/doi/epdf/10.1093/bjps/axl003.
Shirani-Mehr, Houshmand, David Rothschild, Sharad Goel, and Andrew
Gelman. 2018. “Disentangling Bias and Variance in Election
Polls.” Journal of the American Statistical Association
113 (March): 607–14. https://doi.org/10.1080/01621459.2018.1448823.
Simpson, Adrian. 2018. “Princesses Are Bigger Than Elephants:
Effect Size as a Category Error in Evidence-Based Education.”
British Educational Research Journal 44 (5): 897–913. https://doi.org/10.1002/berj.3474.
Sullivan, Gail M., and Richard Feinn. 2012. “Using Effect
Sizeor Why the p Value Is Not Enough.” Journal
of Graduate Medical Education 4 (3): 279–82. https://doi.org/10.4300/JGME-D-12-00156.1.
Sweeney, Latanya. 2013. “Discrimination in Online Ad
Delivery.” Communications of the ACM 56 (5): 4454. https://doi.org/10.1145/2447976.2447990.
Sylvetsky, Allison C., Janet Figueroa, Talia Zimmerman, Susan E.
Swithers, and Jean A. Welsh. 2019. “Consumption of Low-Calorie
Sweetened Beverages Is Associated with Higher Total Energy and Sugar
Intake Among Children, NHANES 20112016.”
Pediatric Obesity 14 (10): e12535. https://doi.org/10.1111/ijpo.12535.
Tukey, John W. 1991. “The Philosophy of Multiple
Comparisons.” Statistical Science 6 (1). https://doi.org/10.1214/ss/1177011945.