计算机工程2026,Vol.52Issue(5):360-370,11.DOI:10.19678/j.issn.1000-3428.0070408
基于大小模型融合的医疗数据分类方法
Classification Method for Medical Data Based on the Fusion of Large and Small Models
摘要
Abstract
In response to the difficulty of privacy protection in medical data owing to its wide coverage,large quantity,and diverse types and to effectively classify medical data reasonably and take corresponding privacy protection measures based on the classification results,this article proposes a fusion classification method for large and small models based on different levels of medical information sensitivity,achieving the goal of medical data classification encryption.A Large Language Model(LLM)deep neural network combined with Medical Data Classification Standards(MDCS)is used to annotate and output features from the medical dataset.Then,the output features of the LLM are used as inputs for the small-text classification model.The Long Short-Term Memory(LSTM)network of the small-text classification model is used to learn feature representations in the text.Finally,the erroneous prediction results of the small-text classification model are returned to the LLM for reclassification,and the classification results of the large and small models are fused to achieve an accurate classification of medical data according to different levels of sensitivity.The experimental results show that the fusion classification method for large and small models improves model convergence,classification accuracy,and data classification balance than those of other classification models and standards.This verifies that the iterative mechanism of large and small models fusion is highly compatible with the medical data scenario and can significantly improve the classification accuracy,achieve more efficient classification,and ensure the privacy protection of medical data.关键词
医疗数据分类/隐私保护/分类标准/大小模型融合/大语言模型/机器学习Key words
medical data classification/privacy protection/classification standard/fusion of large and small models/Large Language Model(LLM)/machine learning分类
信息技术与安全科学引用本文复制引用
李江涛,马礼,李阳..基于大小模型融合的医疗数据分类方法[J].计算机工程,2026,52(5):360-370,11.基金项目
北京市自然科学基金(4234083) (4234083)
国家重点研发计划(2024YFE0200500,2023YFC3107804) (2024YFE0200500,2023YFC3107804)
北京市教育委员会科学研究计划项目(KM202410009003). (KM202410009003)