| 注册
首页|期刊导航|控制与信息技术|机车检修数据标准化用特制大语言模型

机车检修数据标准化用特制大语言模型

陈傲 李晨 颜家云 彭联贴 田野 刘雷新元

控制与信息技术Issue(3):72-79,8.
控制与信息技术Issue(3):72-79,8.DOI:10.13889/j.issn.2096-5427.2024.03.200

机车检修数据标准化用特制大语言模型

Specialized Large Language Model for Standardization of Locomotive Maintenance Data

陈傲 1李晨 1颜家云 1彭联贴 1田野 1刘雷新元1

作者信息

  • 1. 株洲中车时代电气股份有限公司,湖南 株洲 412001
  • 折叠

摘要

Abstract

Standardization is one of the key steps to analyze locomotive overhaul data with a focus on reliability-centered maintenance(RCM).However,traditional manual methods encounter challenges such as small sample sizes,non-standardized data formats,analytical complexities,and high labour costs,hindering the achievement of data standardization.Large language models(LLM),featuring powerful performance in natural language processing comprehension and handling complex tasks,have made great academic and industrial progress in recent years.This study initially investigated the application performance of LLMs in information extraction from locomotive overhaul data,with the following three reveals,as the universal information extraction(UIE)LLM is suitable for information extraction in the field of locomotive overhaul;expanding the size of locomotive data helps improve the UIE performance in information extraction from locomotive overhaul data;balancing the types of fault labels does not notably help improve this performance.Subsequent explorations concentrated on difficulties in data annotation.The script writing method was utilized for automated annotation of data,and ChatGLM was leveraged to standardize locomotive overhaul data,yielding Bleu-4,Rouge-1,Rouge-2,and Rouge-L metrics of 86.87%,89.60%,87.54%,and 94.26%,respectively,in alignment with the requirements of engineering applications.Further developments introduced an auxiliary data standardization pre-processing tool to streamline the standardization process by encapsulating the LLM.

关键词

机车检修数据/以可靠性为中心的维修(RCM)/大语言模型/数据标准化/数据预处理/信息抽取

Key words

locomotive overhaul data/RCM(reliability centered maintenance)/large language model/data standardization/data preprocessing/information extraction

分类

交通工程

引用本文复制引用

陈傲,李晨,颜家云,彭联贴,田野,刘雷新元..机车检修数据标准化用特制大语言模型[J].控制与信息技术,2024,(3):72-79,8.

基金项目

湖南省科技创新重点研发项目(2023GK2095) (2023GK2095)

控制与信息技术

2096-5427

访问量0
|
下载量0
段落导航相关论文