智能系统学报2025,Vol.20Issue(6):1295-1303,9.DOI:10.11992/tis.202410020
医学大语言模型的研发与应用系统综述
Developing and employing large language models in medicine
摘要
Abstract
Since the introduction of ChatGPT(chat generative pre-trained Transformer)in November 2022,studies re-lated to large language models(LLMs)for medical applications are increasing;however,a systematic review of this field is lacking.This review covered studies indexed in PubMed,Google Scholar,arXiv,bioXiv,and medRxiv up until June 31,2024,and identified 129 medical LLMs.LLMs were evaluated in clinical contexts,including their responses to medical queries,performance comparison,and specialist evaluation.The results revealed that general-purpose LLMs,such as ChatGPT and GPT-4,demonstrate better accuracy in generating medical records,whereas disease-specific LLMs excel in niche areas but may lack comprehensiveness.Challenges include variability in responses,readability is-sues,and biases,with few studies on LLM trustworthiness from patient or insurance perspectives.关键词
聊天机器人/人工智能/大语言模型/ChatGPT/医疗保健/临床诊断/医疗咨询/医疗信息学Key words
chatbot/artificial intelligence/large language models/ChatGPT/health care/clinical diagnosis/medical consultation/medical informatics分类
信息技术与安全科学引用本文复制引用
WANG Lu,DING Mufei,ZHOU He,HE Qianqian,SONG Jiangdian..医学大语言模型的研发与应用系统综述[J].智能系统学报,2025,20(6):1295-1303,9.基金项目
国家自然科学基金项目(92259104). (92259104)