计算机工程与科学2025,Vol.47Issue(3):524-533,10.DOI:10.3969/j.issn.1007-130X.2025.03.014
基于失匹配负波潜伏期优化的语音增强研究
Optimization of speech enhancement based on mismatched negative latency
摘要
Abstract
Addressing the mismatch between the existing speech enhancement loss function and the evaluation index,the performance of the speech enhancement algorithm is effectively improved by com-bining the EEG component evaluation speech index with the loss function.Firstly,it is verified that the latency of mismatched negative waves of EEG components can be used as an objective evaluation index of speech.A latency function of mismatched negative waves is proposed,and it is connected to the signal-to-noise ratio,so as to solve the problem that the currently commonly used evaluation index cannot be directly used as a loss function to optimize the speech enhancement algorithm.Secondly,the latency function is trained jointly with the learning objectives in the traditional neural network,and the latency function is continuously optimized through training.Finally,the latency function is applied to the loss function of the discriminator that generates the adversarial network.Combining Conformer can effec-tively capture long-term dependencies and extract local features in both time and frequency dimensions.The experimental results show that the speech enhancement algorithm can effectively improve the speech characteristics by using the objective measures of EEG component evaluation.The effectiveness of the proposed algorithm is verified from the aspects of speech enhancement quality,intelligibility and distortion.关键词
语音增强/失匹配负波/语音质量评估/生成对抗网络Key words
speech enhancement/mismatch negativity/speech quality assessment/generative adver-sarial network分类
信息技术与安全科学引用本文复制引用
吉陈果,贾海蓉,裴意静,段淑斐..基于失匹配负波潜伏期优化的语音增强研究[J].计算机工程与科学,2025,47(3):524-533,10.基金项目
国家自然科学基金(12004275) (12004275)
山西省自然科学基金(20210302123186,202403021211098) (20210302123186,202403021211098)