首页|期刊导航|计算机工程与应用|基于生成对抗网络的音频音质提升方法

基于生成对抗网络的音频音质提升方法

张逸谷毅韩芳王直杰

计算机工程与应用2019，Vol.55Issue(20)：240-244,270,6.

计算机工程与应用2019，Vol.55Issue(20)：240-244,270,6.DOI:10.3778/j.issn.1002-8331.1806-0468

基于生成对抗网络的音频音质提升方法

Audio Enhancement Based on Generative Adversarial Nets

张逸 ¹谷毅 ²韩芳 ¹王直杰³

作者信息

1. 东华大学信息科学与技术学院,上海 201620
2. 上海航天电子技术研究所,上海 201109
3. 东华大学信息科学与技术学院,上海 201620
折叠

摘要

Abstract

For the convenience of network transmission as well as the ease of server harddisks’burden, large amount of audio files are compressed, while the audio quality is decreased. This paper proposes an audio quality enhancement algorithm-ASRGAN(Audio Super Resolution Generative Adversarial Networks)-for MPEG-1 Layer 3 files. In this algorithm, the generative network and discrimitive network form a competitive learning, and alternative weighting training is used. Com-bined with dilation convolution and bidirectional recurrent neural network which evidently enhances the network’s ability in disposing of overlong sequence, the optimal audio quality restoration network is finally set up. This algorithm can reduce the network bandwidth and storage space used by audio files and meanwhile maintain a decent audio quality.

关键词

生成对抗网络(GAN)/音质提升/模型压缩

Key words

Generative Adversarial Nets(GAN)/audio enhancement/model compression

分类

信息技术与安全科学

引用本文复制引用

张逸,谷毅,韩芳,王直杰..基于生成对抗网络的音频音质提升方法[J].计算机工程与应用,2019,55(20):240-244,270,6.

基金项目

国家自然科学基金(No.11572084,No.11472061) （No.11572084,No.11472061）

中央高校基本科研业务费专项资金、东华大学"励志计划"(No.18D210402). （No.18D210402）

计算机工程与应用

OA北大核心CSCDCSTPCD

ISSN：1002-8331

访问量0

下载量0

段落导航