| 注册
首页|期刊导航|电子科技大学学报|结合全局信息增强的医学领域命名实体识别研究

结合全局信息增强的医学领域命名实体识别研究

要媛媛 付潇 杨东瑛 王洁宁 郑文

电子科技大学学报2024,Vol.53Issue(3):431-439,9.
电子科技大学学报2024,Vol.53Issue(3):431-439,9.DOI:10.12178/1001-0548.2023064

结合全局信息增强的医学领域命名实体识别研究

Research on Named Entity Recognition in Medical Domain with Global Information Augmentation

要媛媛 1付潇 2杨东瑛 1王洁宁 1郑文3

作者信息

  • 1. 太原理工大学计算机科学与技术学院(大数据学院),晋中 030600
  • 2. 中国船舶集团有限公司综合技术经济研究院,北京 100081
  • 3. 太原理工大学计算机科学与技术学院(大数据学院),晋中 030600||长治医学院山西省智能数据辅助诊疗工程研究中心,长治 046000
  • 折叠

摘要

Abstract

Entities such as drug names are difficult to identify accurately in Chinese medical questioning texts due to the frequent occurrence of colloquial irregular expressions and jargon.To make full use of the important role of inter-word relations in Chinese sentences,a medical named entity recognition model for enhancing global information is proposed.The model enhances the word embedding representation using an attention mechanism and enriches the global information representation of sentences in two ways simultaneously,based on the use of the sequence processing capability of bidirectional long and short-term memory networks to obtain contextual information.Firstly,a graphical convolutional network layer is constructed to enrich inter-word dependencies based on syntactic relationships to obtain additional dependencies between words;secondly,an auxiliary task is constructed to predict the class of syntactic dependencies between words.Experimental results on the Chinese medical consultation dataset show that the model is very competitive,with an F1 value of 94.54%.Significant improvements are achieved in the recognition of entity classes such as drugs and symptoms compared to other models.Experiments on the Weibo public dataset also show that the model has general-domain applications.

关键词

注意力机制/双向长短时记忆网络/图卷积网络/医疗问诊/命名实体识别

Key words

attention mechanism/bidirectional long and short-term memory network/graph convolutional network/medical consultation/named entity recognition

分类

信息技术与安全科学

引用本文复制引用

要媛媛,付潇,杨东瑛,王洁宁,郑文..结合全局信息增强的医学领域命名实体识别研究[J].电子科技大学学报,2024,53(3):431-439,9.

基金项目

国家自然科学基金(11702289) (11702289)

山西省关键核心技术和共性技术研发攻关专项(2020XXX013) (2020XXX013)

电子科技大学学报

OA北大核心CSTPCD

1001-0548

访问量0
|
下载量0
段落导航相关论文