| 注册
首页|期刊导航|铁道科学与工程学报|面向高速铁路道岔运维文本的知识抽取方法

面向高速铁路道岔运维文本的知识抽取方法

林海香 白万胜 赵正祥 胡娜娜 李冬 陆人杰

铁道科学与工程学报2024,Vol.21Issue(7):2569-2580,12.
铁道科学与工程学报2024,Vol.21Issue(7):2569-2580,12.DOI:10.19713/j.cnki.43-1423/u.T20231577

面向高速铁路道岔运维文本的知识抽取方法

Knowledge extraction method for operation and maintenance texts of high-speed railway turnout

林海香 1白万胜 1赵正祥 1胡娜娜 1李冬 1陆人杰2

作者信息

  • 1. 兰州交通大学 自动化与电气工程学院,甘肃 兰州 730070
  • 2. 卡斯柯信号有限公司,上海 200071
  • 折叠

摘要

Abstract

To achieve the automatic construction of the knowledge graph and provide decision-making support for intelligent operation and maintenance of high-speed railway turnout,it is necessary to use knowledge extraction technology to extract key knowledge from high-speed railway turnout maintenance texts.At the same time,to further solve the problem of entity nesting and overlapping knowledge triplets in these texts,this article proposed a knowledge extraction model RTOM-KE for high-speed railway turnout operation and maintenance based on multi-module joint learning.Firstly,based on the defined entity and relation types,a two-stage knowledge labeling strategy based on BIOES was proposed to label the head entity and corresponding tail entity under the relation.Secondly,the encoding module composed of the lightweight pre-training BERT-base model and BiLSTM neural network was used to obtain the multi-dimensional shared encoding representation of the text.The hidden state of the encoding module and the global contextual features were combined as the input of the head entity extraction module.Finally,the head entity extraction module was used to extract all candidate head entities in the text.The candidate head entity labels and the multi-dimensional shared word representation from the encoding module were used as the joint input of the tail entity extraction module.The specific relation gate mechanism was used to filter the tail entities associated with the head entity to obtain the knowledge triplet of the high-speed railway turnout maintenance.Through sufficient comparative experiments and ablation experiments,the results are drawn as follows.The RTOM-KE model can accurately and comprehensively extract triplets of different complexities and effectively solve the problems of entity nesting and triplet overlapping.The Precision,Recall,and F1 values of RTOM-KE model based on the turnout operation and maintenance dataset can reach 88.3%,86.9%,and 87.6%,respectively.The research results can provide reference for further improving the knowledge extraction efficiency of more complex high-speed railway turnout maintenance texts and information extraction in other professional fields.

关键词

高速铁路/道岔/运维/知识抽取/BERT模型

Key words

high-speed railway/turnout/operation and maintenance/knowledge extraction/BERT model

分类

交通工程

引用本文复制引用

林海香,白万胜,赵正祥,胡娜娜,李冬,陆人杰..面向高速铁路道岔运维文本的知识抽取方法[J].铁道科学与工程学报,2024,21(7):2569-2580,12.

基金项目

甘肃省重点研发计划-工业类(23YFGA0046) (23YFGA0046)

四电BIM工程与智能应用铁路行业重点实验室2022年度开放课题(BIMKF-2022-02) (BIMKF-2022-02)

铁道科学与工程学报

OA北大核心CSTPCDEI

1672-7029

访问量0
|
下载量0
段落导航相关论文