首页|期刊导航|山西大学学报（自然科学版）|SSHGCN:基于音形异构图卷积的中文纠错方法

SSHGCN:基于音形异构图卷积的中文纠错方法

任俊黄瑞章

山西大学学报（自然科学版）2024，Vol.47Issue(3)：518-527,10.

山西大学学报（自然科学版）2024，Vol.47Issue(3)：518-527,10.DOI:10.13451/j.sxu.ns.2024003

SSHGCN:基于音形异构图卷积的中文纠错方法

SSHGCN:A Chinese Error Correction Method Based on Heterogeneous Graph Convolution with Phonological and Visual Features

任俊 ¹黄瑞章¹

作者信息

1. 贵州大学文本计算与认知智能教育部工程研究中心,贵州贵阳 550025||贵州大学公共大数据国家重点实验室,贵州贵阳 550025||贵州大学计算机科学与技术学院,贵州贵阳 550025
折叠

摘要

Abstract

Chinese spelling correction aims to detect and correct spelling errors in Chinese text.Existing methods have attempted to model character similarity as graph structure information.However,the graph structure of current methods ignores the deep phonetic proximity among Chinese characters and lacks a multimodal information fusion method that fully exploits the role of character sound and shape.Therefore,this paper obtains the phonetic similarity relationship based on the initial and final information of Chi-nese characters and the importance of pinyin,and combines the shape proximity relationship of Chinese characters to construct a Chinese character similar pinyin-shape proximity heterogeneous graph.The heterogeneous graph convolution is used on this graph to complement the use of the sound and shape information of Chinese characters,and fully integrate the tone and shape information of Chinese characters.This method surpasses all comparison methods in terms of sentence-level F1 score on the SIGHAN15 bench-mark,and is comparable to the best comparison method on the SIGHAN13 benchmark,verifying the effectiveness of this method.

关键词

中文拼写纠错/多模态信息融合方法/字符相似性/拼音相似关系

Key words

Chinese spelling correction/multimodal information fusion method/character similarity/pinyin similarity

分类

信息技术与安全科学

引用本文复制引用

任俊,黄瑞章..SSHGCN:基于音形异构图卷积的中文纠错方法[J].山西大学学报（自然科学版）,2024,47(3):518-527,10.

基金项目

国家自然科学基金(62066007) （62066007）

贵州省科技支撑计划项目(2022277) （2022277）

山西大学学报（自然科学版）

OA北大核心CSTPCD

ISSN：0253-2395

访问量5

下载量0

段落导航