计算机工程与科学2024,Vol.46Issue(6):1121-1127,7.DOI:10.3969/j.issn.1007-130X.2024.06.019
融合音素的缅甸语语音识别文本纠错
Text error correction of Burmese speech recognition based on phoneme fusion
摘要
Abstract
The Burmese language speech recognition text contains a large number of homophones and space errors.General methods use text semantic information to correct erroneous characters,but they are not accurate in locating and correcting Burmese space and homophone errors.Considering that Bur-mese is a tonal language with tone information embedded within its phonemes,this paper proposes a method for correcting errors in Burmese language speech recognition text that incorporates phonemes.Parameter sharing strategy is used to jointly model the transcribed texts and theirs phonemes,phoneme information is used to assist in detecting and correcting Burmese homophones and space errors.Experi-mental results show that compared with ConvSeq2Seq method,the F1 value of the proposed method in the Burmese speech recognition correction task has increased by 85.97%,reaching 79.15%.关键词
缅甸语/语音识别文本纠错/音素/共享参数/BERTKey words
Burmese language/speech recognition text correction/phoneme/shared parameter/bidi-rectional encoder representations from transformers(BERT)分类
信息技术与安全科学引用本文复制引用
陈璐,董凌,王文君,王剑,余正涛,高盛祥..融合音素的缅甸语语音识别文本纠错[J].计算机工程与科学,2024,46(6):1121-1127,7.基金项目
国家自然科学基金(U21B2027,61972186) (U21B2027,61972186)
云南高新技术产业发展项目(201606) (201606)
云南省重大科技专项计划(202103AA080015,202302AD080003) (202103AA080015,202302AD080003)
云南省基础研究计划(202001AS070014) (202001AS070014)
云南省学术和技术带头人后备人才(202105AC160018) (202105AC160018)