| 注册
首页|期刊导航|现代信息科技|基于实体知识的石油炼化领域命名实体识别

基于实体知识的石油炼化领域命名实体识别

丁建新 王晓伟 温欣 屈克将 王建华 赵艳红 胡思颍

现代信息科技2024,Vol.8Issue(12):40-46,7.
现代信息科技2024,Vol.8Issue(12):40-46,7.DOI:10.19850/j.cnki.2096-4706.2024.12.010

基于实体知识的石油炼化领域命名实体识别

Named Entity Recognition in Petroleum Refining Domain Based on Entity Knowledge

丁建新 1王晓伟 2温欣 1屈克将 1王建华 1赵艳红 1胡思颍2

作者信息

  • 1. 昆仑数智科技有限责任公司 智慧油服事业部,北京 100071
  • 2. 中国石油大学(北京) 信息科学与工程学院/人工智能学院,北京 102249
  • 折叠

摘要

Abstract

Named entity recognition task in the petroleum refining domain suffers from the problems of scarcity of labeled data as well as the existing pre-trained language models cannot recognize domain combination and nested entities well.Based on this,a data augmentation method EEKR(External Entity Knowledge Replacement,EEKR)based on external entity knowledge is firstly proposed,which effectively solves the problem of scarcity of labeled data by introducing an external entity knowledge base and completing data augmentation by replacing it with entities in the labeled data at the entity level.After that,a named entity recognition model IIEKNER(Namd Entity Recognition Incorporating Internal Entity Knowledge,IIEKNER)is proposed,which incorporates internal entity knowledge into the pre-training model by obtaining internal entity embeddings in the labeled samples.Thus,nested and combined entities in the petroleum refining domain can be recognized more accurately.The experimental results show that compared to other models,the IIEKNER model based on EEKR data augmentation method has better recognition performance.

关键词

命名实体识别/石油炼化领域/数据增强/BERT

Key words

named entity recognition/petroleum refining domain/data augmentation/BERT

分类

信息技术与安全科学

引用本文复制引用

丁建新,王晓伟,温欣,屈克将,王建华,赵艳红,胡思颍..基于实体知识的石油炼化领域命名实体识别[J].现代信息科技,2024,8(12):40-46,7.

基金项目

国家重点研发计划(2019YFC0312003) (2019YFC0312003)

现代信息科技

2096-4706

访问量6
|
下载量0
段落导航相关论文