中医药信息2024,Vol.41Issue(7):1-6,6.DOI:10.19656/j.cnki.1002-2406.20240701
基于深度学习的《脉经》中医术语命名实体识别研究
Study on Named Entity Recognition of Traditional Chinese Medicine Terms in the Pulse Classic Based on Deep Learning
摘要
Abstract
Objective:Based on deep learning method,this paper studies named entity recognition of terms in an ancient traditional Chinese medicine(TCM)book Pulse Classic.Methods:The book covers a large number of professional terms,the knowledge system is complicated,and the classification of words is difficult.Therefore,we used the combination of transfer learning and BERT to preprocess the Pulse Classic data set,and compared it with Bert-CRF,BiLSTM-CRF and Bert-BilstM-CRF models.Results:The F1 value of named entity recognition of Bert-Bilstm-CRF-radical feature model constructed in this experiment was 84.77%.Compared with BERT-CRF,BiLSTM-CRF and BERT-BiLSTM-CRF models,this model fully considered the professionalism and particularity of the field of Chinese medicine during the construction of word vectors,and learns not only the context,but also the radical features of entity words,with the optimal effect.Conclusion:The Bert-BilstM-CRF-radically feature model can effectively realize the named entity category recognition of terms of TCM ancient books,effectively improve the entity recognition accuracy of Chinese ancient books,lay a technical foundation for the subsequent knowledge map construction,and provide high-quality data support for clinical diagnosis.关键词
深度学习/迁移学习/命名实体识别/中医文本/BERTKey words
Deep learning/Transfer learning/Named entity recognition/TCM Text/BERT引用本文复制引用
宋熹玥,冯鑫雅,胡为,刘伟..基于深度学习的《脉经》中医术语命名实体识别研究[J].中医药信息,2024,41(7):1-6,6.基金项目
湖南省自然科学基金项目(2022JJ30438) (2022JJ30438)
长沙市自然科学基金项目(kq2202260) (kq2202260)
湖南省中医药科研课题项目(B2023039) (B2023039)