军事医学2025,Vol.49Issue(3):207-213,7.DOI:10.7644/j.issn.1674-9960.2025.03.007
基于生成式大语言模型的中医医案信息抽取系统设计与实现
Design and implementation of a TCM record information retrieval system based on generative large language models
王煊泽 1李江域 1郑翔文 1肖宇 1毛华坚 1赵东升1
作者信息
- 1. 军事科学院军事医学研究院,北京 100850
- 折叠
摘要
Abstract
Objective To develop a system for retrieving information from clinical records of Traditional Chinese Medicine(TCM)based on generative large language models(LLMs).Methods Applicational needs of the system were analyzed,and entity types to be retrieved were identified.The functions,workflows,and architecture of the system were designed by combining the automatic retrieval capabilities of LLMs with human-in-the-loop(HITL).The software was developed using such frameworks as vLLM and Node.js.Interaction of multiple commercial/open source LLMs was implemented using OpenAI-compatible interfaces.The quality of information retrieved from LLMs was enhanced by prompt engineering.Results This system supported task allocation,automatic retrieval of structured information,and manual review.To evaluate its performance,the moonshot-v1-8k model was used to retrieve clinical records of TCM before manual edition was performed.Combining large language model pre-annotation with meticulous annotator edits improved accuracy by 26.6%compared to the BERT-BiLSTM-CRF model,and enhanced extraction efficiency by 1.6-fold relative to purely manual methods.Conclusion General generative LLMs can retrieve a wide range of entity information from TCM records with high accuracy and scalability.The design and implementation of this system approach may provide a useful reference for developing other biomedical information retrieval systems.关键词
生成式大语言模型/中医医案/信息抽取/提示词优化Key words
generative large language models/TCM records/information retrieval/prompt optimization分类
计算机与自动化引用本文复制引用
王煊泽,李江域,郑翔文,肖宇,毛华坚,赵东升..基于生成式大语言模型的中医医案信息抽取系统设计与实现[J].军事医学,2025,49(3):207-213,7.