Full Reading List

Aldrich, John. 2008. “R. A. Fisher on Bayes and Bayes’ Theorem.” Bayesian Analysis 3 (1). https://doi.org/10.1214/08-BA306.

Breiman, Leo. 2001. “Statistical Modeling: The Two Cultures (with Comments and a Rejoinder by the Author).” Statistical Science 16 (3): 199–231. https://doi.org/10.1214/ss/1009213726.

Cassidy, Scott A., Ralitza Dimova, Benjamin Giguère, Jeffrey R. Spence, and David J. Stanley. 2019. “Failing Grade: 89.” Advances in Methods and Practices in Psychological Science 2 (3): 233–39. https://doi.org/10.1177/2515245919858072.

Cobb, George W., and David S. Moore. 1997. “Mathematics, Statistics, and Teaching.” The American Mathematical Monthly 104 (9): 801–23. https://doi.org/10.1080/00029890.1997.11990723.

Cumming, Geoff. 2014. “The New Statistics: Why and How.” Psychological Science 25 (1): 7–29. https://doi.org/10.1177/0956797613504966.

Dienes, Zoltan. 2011. “Bayesian Versus Orthodox Statistics: Which Side Are You On?” Perspectives on Psychological Science 6 (3): 274–90. https://doi.org/10.1177/1745691611406920.

Fisher, R. A. 1922. “On the Mathematical Foundations of Theoretical Statistics.” Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character 222: 309–68. https://www.jstor.org/stable/91208.

Gelman, Andrew, and John Carlin. 2014. “Beyond Power Calculations.” Perspectives on Psychological Science 9 (November): 641–51. https://doi.org/10.1177/1745691614551642.

Gelman, Andrew, and Christian Hennig. 2017. “Beyond Subjective and Objective in Statistics.” Journal of the Royal Statistical Society: Series A (Statistics in Society) 180 (4): 967–1033. https://doi.org/10.1111/rssa.12276.

Gelman, Andrew, and Cosma Rohilla Shalizi. 2013. “Philosophy and the Practice of Bayesian Statistics: Philosophy and the Practice of Bayesian Statistics.” British Journal of Mathematical and Statistical Psychology 66 (1): 8–38. https://doi.org/10.1111/j.2044-8317.2011.02037.x.

Greenland, Sander, Stephen J. Senn, Kenneth J. Rothman, John B. Carlin, Charles Poole, Steven N. Goodman, and Douglas G. Altman. 2016. “Statistical Tests, p Values, Confidence Intervals, and Power: A Guide to Misinterpretations.” European Journal of Epidemiology 31 (April): 337–50. https://doi.org/10.1007/s10654-016-0149-3.

Haller, Heiko, and Stefan Krauss. 2002. “Misinterpretations of Significance: A Problem Students Share with Their Teachers?” Methods of Psychological Research Online 7.

Hoekstra, Rink, Richard D. Morey, Jeffrey N. Rouder, and Eric-Jan Wagenmakers. 2014. “Robust Misinterpretation of Confidence Intervals.” Psychonomic Bulletin & Review 21 (January): 1157–64. https://doi.org/10.3758/s13423-013-0572-3.

Kass, Robert E. 2011. “Statistical Inference: The Big Picture.” Statistical Science 26 (1). https://doi.org/10.1214/10-STS337.

Kvarven, Amanda, Eirik Strømland, and Magnus Johannesson. 2020. “Comparing Meta-Analyses and Preregistered Multiple-Laboratory Replication Projects.” Nature Human Behaviour 4 (4): 423–34. https://doi.org/10.1038/s41562-019-0787-z.

Lehmann, E L. 1993. “The Fisher, Neyman-Pearson Theories of Testing Hypotheses: One Theory or Two?” Journal of the American Statistical Association, 8.

Lenhard, Johannes. 2006. “Models and Statistical Inference: The Controversy Between Fisher and NeymanPearson.” The British Journal for the Philosophy of Science 57 (1): 69–91. https://doi.org/10.1093/bjps/axi152.

Lennox, Kristin. 2016. “All about That Bayes: Probability, Statistics, and the Quest to Quantify Uncertainty.” https://www.youtube.com/watch?v=eDMGDhyDxuY.

Lewis, Molly, Maya B. Mathur, Tyler J. VanderWeele, and Michael C. Frank. 2022. “The Puzzling Relationship Between Multi-Laboratory Replications and Meta-Analyses of the Published Literature.” Royal Society Open Science 9 (2): 211499. https://doi.org/10.1098/rsos.211499.

Lipton, Peter. 2000. “Inference to the Best Explanation.” A Companion to the Philosophy of Science, 184–93.

Mayo, Deborah G. 2013. “The Error-Statistical Philosophy and the Practice of Bayesian Statistics: Comments on Gelman and Shalizi: ‘Philosophy and the Practice of Bayesian Statistics’.” British Journal of Mathematical and Statistical Psychology 66 (1): 57–64. https://doi.org/10.1111/j.2044-8317.2012.02064.x.

McElreath, Richard. 2017. “Bayesian Statistics Without Frequentist Language.” https://www.youtube.com/watch?v=yakg94HyWdE.

McShane, Blakeley B., and David Gal. 2016. “Blinding Us to the Obvious? The Effect of Statistical Training on the Evaluation of Evidence.” Management Science 62 (June): 1707–18. https://doi.org/10.1287/mnsc.2015.2212.

Rohrer, Julia M. 2018. “Thinking Clearly about Correlations and Causation: Graphical Causal Models for Observational Data.” Advances in Methods and Practices in Psychological Science 1 (January): 27–42. https://doi.org/10.1177/2515245917745629.

Scheel, Anne M., Mitchell R. M. J. Schijen, and Daniël Lakens. 2021. “An Excess of Positive Results: Comparing the Standard Psychology Literature With Registered Reports.” Advances in Methods and Practices in Psychological Science 4 (2): 25152459211007467. https://doi.org/10.1177/25152459211007467.

Schield, Milo. 1999. “Simpson’s Paradox and Cornfield’s Conditions.” ASA Proceedings of the Section on Statistical Education 1999 (August): 106–11.

Schuirmann, Donald J. 1987. “A Comparison of the Two One-Sided Tests Procedure and the Power Approach for Assessing the Equivalence of Average Bioavailability.” Journal of Pharmacokinetics and Biopharmaceutics 15 (6): 657–80. https://doi.org/10.1007/BF01068419.

“Severe Testing as a Basic Concept in a NeymanPearson Philosophy of Induction.” n.d. https://www.journals.uchicago.edu/doi/epdf/10.1093/bjps/axl003.

Shirani-Mehr, Houshmand, David Rothschild, Sharad Goel, and Andrew Gelman. 2018. “Disentangling Bias and Variance in Election Polls.” Journal of the American Statistical Association 113 (March): 607–14. https://doi.org/10.1080/01621459.2018.1448823.

Simpson, Adrian. 2018. “Princesses Are Bigger Than Elephants: Effect Size as a Category Error in Evidence-Based Education.” British Educational Research Journal 44 (5): 897–913. https://doi.org/10.1002/berj.3474.

Sullivan, Gail M., and Richard Feinn. 2012. “Using Effect Sizeor Why the p Value Is Not Enough.” Journal of Graduate Medical Education 4 (3): 279–82. https://doi.org/10.4300/JGME-D-12-00156.1.

Sweeney, Latanya. 2013. “Discrimination in Online Ad Delivery.” Communications of the ACM 56 (5): 4454. https://doi.org/10.1145/2447976.2447990.

Sylvetsky, Allison C., Janet Figueroa, Talia Zimmerman, Susan E. Swithers, and Jean A. Welsh. 2019. “Consumption of Low-Calorie Sweetened Beverages Is Associated with Higher Total Energy and Sugar Intake Among Children, NHANES 20112016.” Pediatric Obesity 14 (10): e12535. https://doi.org/10.1111/ijpo.12535.

Tukey, John W. 1991. “The Philosophy of Multiple Comparisons.” Statistical Science 6 (1). https://doi.org/10.1214/ss/1177011945.