| 注册

基于WGAN的语音增强算法研究

王怡斐 韩俊刚 樊良辉

重庆邮电大学学报(自然科学版)2019,Vol.31Issue(1):136-142,7.
重庆邮电大学学报(自然科学版)2019,Vol.31Issue(1):136-142,7.DOI:10.3979/j.issn.1673-825X.2019.01.018

基于WGAN的语音增强算法研究

Algorithm research of speech enhancement based on WGAN

王怡斐 1韩俊刚 1樊良辉1

作者信息

  • 1. 西安邮电大学,西安 710121
  • 折叠

摘要

Abstract

Noisy speech can be seen as a combination of an independent noise signal and a speech signal in some way. Traditional speech enhancement techniques need to make assumptions of the independence and feature distribution of noisy and clean speech signals. Unreasonable assumptions may cause problems such as residue noise and speech distortion, resulting in poor speech enhancement. In addition, the randomness and mutability of noise itself also affect the robustness of traditional speech enhancement methods. To solve these problems, this paper uses the generative adversarial network to enhance the speech, and gives a speech enhancement method based on the WGAN to accelerate the training speed and stabilize the training process. The method does not need to manually extract acoustic features, and it improves generalization capability of the speech enhancement system. There is a good enhancement effect in both the matched noise set and the unmatched noise set. The experimental results show that the PESQ is increased by an average of 23. 97% based on this end to end speech enhancement training model.

关键词

语音增强/生成对抗网络/卷积神经网络/深度学习

Key words

speech enhancement/generative adversarial nets/convolution neural network/deep learning

分类

信息技术与安全科学

引用本文复制引用

王怡斐,韩俊刚,樊良辉..基于WGAN的语音增强算法研究[J].重庆邮电大学学报(自然科学版),2019,31(1):136-142,7.

基金项目

国家自然科学基金重点资助项目 (61136002) (61136002)

重庆邮电大学学报(自然科学版)

OA北大核心CSCDCSTPCD

1673-825X

访问量0
|
下载量0
段落导航相关论文