浙江电力2024,Vol.43Issue(1):20-27,8.DOI:10.19585/j.zjdl.202401003
基于正则表达式和Jaccard系数的智能变电站录波通道同源匹配
Homologous matching of recording channels in intelligent substations based on regu-lar expression and Jaccard similarity coefficient
摘要
Abstract
In addressing the challenge of homologous matching for dual sets of recording channels in intelligent sub-stations of 220 kV and above,this paper presents a novel method employing regular expression and Jaccard index.To overcome the issue of irregular naming of recording channels,regular expressions to preprocess name texts of the channels are employed to ensure a standardized expression format.Furthermore,through Jieba word segmentation algorithm and stopword removal potential redundant information within the name texts of the channels.Subse-quently,the Jaccard similarity coefficient matching algorithm is employed to calculate the similarity between record-ing channel names,screening out homologous channels based on their similarity degrees.To validate the proposed method,simulations are conducted using actual recording file data from the power grid.The results affirm the effec-tiveness of the proposed method in achieving homologous matching of recording channels in intelligent substations.关键词
录波通道同源匹配/文本匹配/正则表达式/Jaccard相似系数Key words
homologous matching of recording channel/text matching/regular expression/Jaccard similarity coefficient引用本文复制引用
王冠南,郭丽娟,彭曙蓉,陈慧霞,黄浩宇..基于正则表达式和Jaccard系数的智能变电站录波通道同源匹配[J].浙江电力,2024,43(1):20-27,8.基金项目
国网江西省电力有限公司科技项目(52182022000A) (52182022000A)
湖南省教育厅重点项目(20A021) (20A021)
国家自然科学基金面上项目(52177069) (52177069)