内蒙古民族大学学报(自然科学版)2025,Vol.40Issue(5):51-60,10.DOI:10.14045/j.cnki.15-1220.2025.05.008
基于多层次特征融合与序列依赖的蒙医方剂命名实体识别
Named Entity Recognition of Mongolian Medical Prescription Based on Multi-level Feature Fusion and Sequence Dependence
摘要
Abstract
The named entity recognition of Mongolian medicine prescriptions plays an important role in the con-struction of knowledge graph in the field of Mongolian medicine.There are challenges in the named entity recogni-tion in the field of Mongolian medical prescriptions,such as the difficulty of long-term dependence modeling and the inaccurate recognition of entity boundaries.In order to solve the above challenges,an MBTC named entity recog-nition model based on multi-level feature fusion and sequence-dependent modeling was constructed.By capturing the deep semantic of Mongolian medical texts through MacBERT,and a multi-scale convolution-expansion percep-tion feature module is introduced,which incorporates the long-short dependence into the same feature space at the same time to solve the problem of long-distance dependence.BiLSTM-CRF joint decoding is used to jointly strengthen the label dependence and correct the boundary,so as to improve the recognition accuracy and label con-sistency.Based on the Mongolian medicine formula dataset constructed by integrating classic Mongolian medicine books and authoritative website data and after being reviewed by Mongolian medicine experts,a comparative experi-ment was conducted with seven mainstream baseline models.The results showed that MBTC achieved the highest F1 value of 87.7%.At the same time,the generalization ability of this model was verified on the public dataset of People's Daily.关键词
实体识别/蒙医方剂/空洞卷积Key words
entity recognition/Mongolian medicine prescriptions/hollow convolution分类
信息技术与安全科学引用本文复制引用
杨一帆,刘忠博,白青海,张军,刁宇峰,周玉新..基于多层次特征融合与序列依赖的蒙医方剂命名实体识别[J].内蒙古民族大学学报(自然科学版),2025,40(5):51-60,10.基金项目
内蒙古自治区自然科学基金面上项目(2022MS06028) (2022MS06028)
内蒙古自治区研究生科研创新项目(KC2024074S) (KC2024074S)
内蒙古民族大学智慧农牧创新团队项目 ()
内蒙古民族大学博士科研启动基金项目(BS438) (BS438)