| 注册
首页|期刊导航|山东电力技术|电网技改与检修工程造价文本的信息抽取与知识图谱构建

电网技改与检修工程造价文本的信息抽取与知识图谱构建

张丽萍 吴莉琳 李志翔 庞圣养 舒胜文

山东电力技术2026,Vol.53Issue(4):76-86,11.
山东电力技术2026,Vol.53Issue(4):76-86,11.DOI:10.20097/j.cnki.issn1007-9904.250120

电网技改与检修工程造价文本的信息抽取与知识图谱构建

Information Extraction and Knowledge Graph Construction for Cost Documents of Technical Reform and Maintenance Projects in Power Grid

张丽萍 1吴莉琳 2李志翔 2庞圣养 2舒胜文3

作者信息

  • 1. 广东电网有限责任公司,广东 广州 510000
  • 2. 广东电网有限责任公司湛江供电局,广东 湛江 524000
  • 3. 福州大学 电气工程与自动化学院,福建 福州 350000
  • 折叠

摘要

Abstract

To address the challenges of massive,various data formats,prominent unstructured characteristics,and difficulties in effective mining and utilization of power grid engineering cost documents,this paper designs a format conversion method to preprocess semi-structured tables and extract the cost information by manually defined rules.A BERT-Bi-LSTM-CRF model integrates bidirectional long short-term memory(Bi-LSTM),bidirectional encoder representations from transformers(BERT),and conditional random field(CRF).It is developed to perform named entity recognition(NER)on unstructured textual paragraphs and generate structured cost knowledge triplets.Then,these triplets are imported into the Neo4j graph database to construct a knowledge graph for power grid technical reform and maintenance engineering costs.Experimental results demonstrate that the proposed method achieves a precision rate of over 89.52%in recognizing diverse named entities from engineering cost paragraphs.The constructed knowledge graph effectively transforms tabular and textual data from various cost documents into nodes,entities,and attribute relationships,enabling intelligent retrieval of cost information.This study provides structured data support for engineering cost estimation,analysis,and decision-making in power grid.

关键词

工程造价/信息抽取/三元组/命名实体识别/知识图谱

Key words

engineering cost/information extraction/triplet/named entity recognition/knowledge graph

分类

信息技术与安全科学

引用本文复制引用

张丽萍,吴莉琳,李志翔,庞圣养,舒胜文..电网技改与检修工程造价文本的信息抽取与知识图谱构建[J].山东电力技术,2026,53(4):76-86,11.

基金项目

广东电网有限责任公司科技项目"生产项目造价智能分析技术研究及辅助造价系统开发"(030800KC23040012). Science and Technology Foundation of Guangdong Power Grid Co.,Ltd."Research on Intelligent Analysis Technology of Production Project Cost and Development of Auxiliary Cost System"(030800KC23040012). (030800KC23040012)

山东电力技术

1007-9904

访问量0
|
下载量0
段落导航相关论文