Will generative AI replace biostatisticians? Opportunities, challenges, and professional responsibility in the era of large language models.
Generative AI and biostatisticians
DOI:
https://doi.org/10.54103/2282-0930/31293Keywords:
Accountability, Biostatistics, Generative Artificial Intelligence, Natural Language Processing, Reproducibility of ResultsAbstract
The rapid diffusion of large language models (LLMs) is reshaping many aspects of biomedical research, prompting reflection on the evolving role of biostatisticians in a changing landscape. This commentary addresses this transformation from two complementary perspectives. First, we examine how LLMs can assist biostatisticians in professional practice, highlighting their potential to enhance efficiency, support complex analytical reasoning, and facilitate communication and training. Second, we discuss the limitations and risks associated with LLM use, including challenges to reproducibility, susceptibility to bias, data protection and regulatory constraints, and limited accountability. Finally, we outline a vision for the role of scientific societies in actively guiding this transition. By promoting methodological competence, ethical awareness, and professional identity, they can help ensure that generative AI becomes an instrument of responsible innovation rather than a source of methodological bias and epistemic uncertainty.
Downloads
References
European Parliament, Directorate General for Parliamentary Research Services. Artificial intelligence: how does it work, why does it matter, and what we can do about it? LU: Publications Office; 2020. https://data.europa.eu/doi/10.2861/44572.
Topol EJ. As artificial intelligence goes multimodal, medical applications multiply. Science 2023;381:eadk6139. https://doi.org/10.1126/science.adk6139.
Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW. Large language models in medicine. Nat Med 2023;29:1930–40. https://doi.org/10.1038/s41591-023-02448-8.
Grambow SC, Desai M, Weinfurt KP et al. Integrating large language models in biostatistical workflows for clinical and translational research. J Clin Trans Sci 2025;9:e131. https://doi.org/10.1017/cts.2025.10064.
Donoho DL, Kang J, Lin X et al. “Rebuilding” Statistics in the Age of AI: A Town Hall Discussion on Culture, Infrastructure, and Training 2026. https://doi.org/10.48550/arXiv.2601.17510.
Dobler D, Binder H, Boulesteix A et al. ChatGPT as a Tool for Biostatisticians: A Tutorial on Applications, Opportunities, and Limitations. Statistics in Medicine 2025;44:e70263. https://doi.org/10.1002/sim.70263.
Zhu B. Biostatisticians Meet AI : Navigating Shifts While Preserving Principles. Statistics in Medicine 2025;44:e70271. https://doi.org/10.1002/sim.70271.
Gilson A, Safranek CW, Huang T et al. How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment. JMIR Med Educ 2023;9:e45312. https://doi.org/10.2196/45312.
Gould E, Fraser HS, Parker TH et al. Same data, different analysts: variation in effect sizes due to analytical decisions in ecology and evolutionary biology. BMC Biol 2025;23:35. https://doi.org/10.1186/s12915-024-02101-x.
Omar M, Soffer S, Agbareia R et al. Sociodemographic biases in medical decision making by large language models. Nat Med 2025;31:1873–81. https://doi.org/10.1038/s41591-025-03626-6.
Bender EM, Gebru T, McMillan-Major A, Shmitchell S. On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, Virtual Event Canada: ACM; 2021, p. 610–23. https://doi.org/10.1145/3442188.3445922.
ChatGPT, il Garante privacy chiude l’istruttoria. OpenAI dovrà realizzare una campagna informativa di sei mesi e pagare una sanzione di 15 milioni di euro (press release) 2024. https://www.garanteprivacy.it:443/home/docweb/-/docweb-display/docweb/10085432 (accessed February 24, 2026).
Ranganathan A, Ye XM. AI Doesn’t Reduce Work—It Intensifies It. Harvard Business Review 2026. https://hbr.org/2026/02/ai-doesnt-reduce-work-it-intensifies-it.
Liu X, Cruz Rivera S, Moher D et al. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. The Lancet Digital Health 2020;2:e537–48. https://doi.org/10.1016/S2589-7500(20)30218-1.
Cruz Rivera S, Liu X, Chan A-W et al. Guidelines for clinical trial protocols for interventions involving artificial intelligence: the SPIRIT-AI extension. Nat Med 2020;26:1351–63. https://doi.org/10.1038/s41591-020-1037-7.
Hamm B, Marti-Bonmati L, Sardanelli F. ESR Journals editors’ joint statement on Guidelines for the Use of Large Language Models by Authors, Reviewers, and Editors. Eur Radiol Exp 2024;8:7. https://doi.org/10.1186/s41747-023-00420-2.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Alessandro Marcon, Valentina Panetta, Giuseppe Maglietta, Lorenza Scotti, Vittorio Simeon, Giovanni Veronesi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Accepted 2026-05-12
Published 2026-06-15


