New chapter in pediatric medicine: technological evolution, application, and evaluation system of large language models

Vaswani A, Shazeer N, Parmar N, et al (2017) Attention is all you need[J]. Advances in neural information processing systems, p 30

Radford A, Wu J, Child R et al (2019) Language models are unsupervised multitask learners [J]. OpenAI blog 1(8):9

Google Scholar 

Brown T, Mann B, Ryder N et al (2020) Language models are few-shot learners [J]. Adv Neural Inf Process Syst 33:1877–1901

Google Scholar 

Achiam J, Adler S, Agarwal S, et al (2023) Gpt-4 technical report[J]. arXiv preprint arXiv:2303.08774

Ferber D, El Nahhas OSM, WöLFLEIN G et al (2025) Development and validation of an autonomous artificial intelligence agent for clinical decision-making in oncology. Nat Cancer 6(8):1337–1349

Article  CAS  PubMed  PubMed Central  Google Scholar 

Riedemann L, Labonne M, Gilbert S (2024) The path forward for large language models in medicine is open. NPJ Digit Med 7(1):339

Article  PubMed  PubMed Central  Google Scholar 

Singhal K, Tu T, Gottweis J et al (2025) Toward expert-level medical question answering with large language models. Nat Med 31(3):943–950

Article  CAS  PubMed  PubMed Central  Google Scholar 

Chen TJ, Dong B, Dong Y et al (2024) Matching actions to needs: shifting policy responses to the changing health needs of Chinese children and adolescents. Lancet 403(10438):1808–1820

Article  PubMed  Google Scholar 

Black MM, Walker SP, Fernald LCH et al (2017) Early childhood development coming of age: science through the life course. Lancet 389(10064):77–90

Article  PubMed  Google Scholar 

Seniwati T, Wanda D, Nurhaeni N (2023) Effects of patient and family-centered care on quality of care in pediatric patients: a systematic review. Nurse Media J Nurs. https://doi.org/10.14710/nmjn.v13i1.48114

Article  Google Scholar 

Hodgson CR, Mehra R, Franck LS (2024) Child and family outcomes and experiences related to family-centered care interventions for hospitalized pediatric patients: a systematic review. Children 11(8):949

Article  PubMed  PubMed Central  Google Scholar 

Mccarthy E, Guerin S (2022) Family-centred care in early intervention: a systematic review of the processes and outcomes of family-centred care and impacting factors. Child Care Health Dev 48(1):1–32

Article  PubMed  Google Scholar 

Arnold D, Jones BL (2009) Personalized medicine: a pediatric perspective. Curr Allergy Asthma Rep 9(6):426–432

Article  PubMed  Google Scholar 

Kearns GL, Abdel-Rahman SM, Alander SW et al (2003) Developmental pharmacology—drug disposition, action, and therapy in infants and children. N Engl J Med 349(12):1157–1167

Article  CAS  PubMed  Google Scholar 

Stein A, Dalton L, Rapa E et al (2019) Communication with children and adolescents about the diagnosis of their own life-threatening condition. Lancet 393(10176):1150–1163

Article  PubMed  Google Scholar 

Klassen TP, Hartling L, Craig JC et al (2008) Children are not just small adults: the urgent need for high-quality trial evidence in children [J]. PLoS Med 5(8):e172

Article  PubMed  PubMed Central  Google Scholar 

Chng SY, Tern MJW, Lee YS et al (2025) Ethical considerations in AI for child health and recommendations for child-centered medical AI. NPJ Digit Med 8(1):152

Article  PubMed  PubMed Central  Google Scholar 

Douma H, McNamara C, Bakola M et al (2025) Leveraging ChatGPT to strengthen pediatric healthcare systems: a systematic review [J]. Eur J Pediatr 184(8):1–11

Article  Google Scholar 

Kaplan J, Mccandlish S, Henighan T et al (2020) Scaling laws for neural language models. arXiv preprint arXiv:200108361 https://doi.org/10.48550/arXiv.2001.08361

Chowdhery A, Narang S, Devlin J et al (2023) Palm: scaling language modeling with pathways [J]. J Mach Learn Res 24(240):1–113

Google Scholar 

Team G, Anil R, Borgeaud S et al (2023) Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:231211805 https://doi.org/10.48550/arXiv.2312.11805

Touvron H, Lavril T, Izacard G et al (2023) Llama: open and efficient foundation language models. arXiv preprint arXiv:230213971. https://doi.org/10.48550/arXiv.2302.13971

Hoffmann J, Borgeaud S, Mensch A et al (2022) Training compute-optimal large language models. arXiv preprint arXiv:220315556. https://doi.org/10.48550/arXiv.2203.15556

Nori H, King N, Mckinney SM et al (2023) Capabilities of gpt-4 on medical challenge problems. arXiv preprint arXiv:230313375. https://doi.org/10.48550/arXiv.2303.13375

Singhal K, Azizi S, Tu T et al (2023) Large language models encode clinical knowledge. Nature 620(7972):172–180

Article  CAS  PubMed  PubMed Central  Google Scholar 

Jeblick K, Schachtner B, Dexl J et al (2024) ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports [J]. Eur Radiol 34(5):2817–2825

Article  PubMed  Google Scholar 

Kung TH, Cheatham M, Medenilla A et al (2023) Performance of ChatGPT on USMLE: potential for AI-assisted medical education using large language models. PLoS Digit Health 2(2):e0000198

Article  PubMed  PubMed Central  Google Scholar 

Thirunavukarasu AJ, Ting DSJ, Elangovan K et al (2023) Large language models in medicine [J]. Nat Med 29(8):1930–1940

Article  CAS  PubMed  Google Scholar 

Shen Y, Heacock L, Elias J, Radiological Society of North America et al (2023) ChatGPT and other large language models are double-edged swords. Radiology. https://doi.org/10.1148/radiol.230163

Article  PubMed  PubMed Central  Google Scholar 

Liu X, Liu H, Yang G et al (2025) A generalist medical language model for disease diagnosis assistance. Nat Med 31(3):932–942

Article  CAS  PubMed  Google Scholar 

Gu Y, Tinn R, Cheng H et al (2020) Domain-specific language model pretraining for biomedical natural language processing [J]. ACM Transactions on Computing for Healthcare (HEALTH) 3:1–23

Google Scholar 

Jin D, Pan E, Oufattole N et al (2021) What disease does this patient have? A large-scale open domain question answering dataset from medical exams. Appl Sci 11(14):6421

Article  CAS  Google Scholar 

Sellergren A, Kazemzadeh S, Jaroensri T, et al (2025) Medgemma technical report[J]. arXiv preprint arXiv:2507.05201

Dou C, Liu C, Yang F, et al (2025) Baichuan-m2: Scaling medical capability with large verifier system[J]. arXiv preprint arXiv:2509.02208

Arora R K, Wei J, Hicks R S, et al (2025) Healthbench: Evaluating large language models towards improved human health[J]. arXiv preprint arXiv:2505.08775

Agarwal S, Ahmad L, Ai J, et al (2025) gpt-oss-120b & gpt-oss-20b model card[J]. arXiv preprint arXiv:2508.10925

Yang A, Li A, Yang B, et al (2025) Qwen3 technical report[J]. arXiv preprint arXiv:2505.09388

Wu Y, Qian B, Li T et al (2025) An eyecare foundation model for clinical assistance: a randomized controlled trial. Nat Med. https://doi.org/10.1038/s41591-025-03900-7

Article  PubMed  PubMed Central  Google Scholar 

Moor M, Banerjee O, Abad ZSH et al (2023) Foundation models for generalist medical artificial intelligence. Nature 616(7956):259–265

Article  CAS  PubMed  Google Scholar 

Tu T, Azizi S, Driess D et al (2024) Towards generalist biomedical AI. Nejm Ai 1(3):AIoa2300138

Article  Google Scholar 

Guo H, Xing X, Zhou Y et al (2025) A survey of large language model for drug research and development. IEEE Access. https://doi.org/10.1109/ACCESS.2025.3552256

Article  Google Scholar 

Artsi Y, Klang E, Collins JD et al (2025) Large language models in radiology reporting—a systematic review of performance, limitations, and clinical implications. medRxiv: 2025.03.18.25324193. https://doi.org/10.1101/2025.03.18.25324193

Fedus W, Zoph B, Shazeer N (2022) Switch transformers: scaling to trillion parameter models with simple and efficient sparsity. J Mach Learn Res 23(120):1–39

Google Scholar 

Jiang S, Zheng T, Zhang Y, et al (2024) Med-moe: Mixture of domain-specific experts for lightweight medical vision-language models[J]. arXiv preprint arXiv:2404.10237

Xiao H, Zhou F, Liu X et al (2025) A comprehensive survey of large language models and multimodal large language models in medicine. Inf Fusion 117:102888

Article  Google Scholar 

Peng C, Yang X, Chen A et al (2023) A study of generative large language model for medical research and healthcare. NPJ Digit Med 6(1):210

Article  PubMed  PubMed Central  Google Scholar 

Yang X, Chen A, Pournejatian N et al (2022) A large language model for electronic health records. NPJ Digit Med 5(1):194

Article  PubMed  PubMed Central  Google Scholar 

Alsentzer E, Murphy J, Boag W, et al (2019) Publicly available clinical BERT embeddings[C]//. In: Proceedings of the 2nd clinical natural language processing workshop, pp 72–78

Comments (0)

No login
gif