计算机工程2017,Vol.43Issue(12):278-282,291,6.DOI:10.3969/j.issn.1000-3428.2017.12.050
基于PSOLA与DCT的情感语音合成方法
Emotional Speech Synthesis Method Based on PSOLA and DCT
李勇 1魏珰 1王柳渝1
作者信息
- 1. 重庆邮电大学自动化学院,重庆400065
- 折叠
摘要
Abstract
Emotional speech synthesis is expected to make the synthesized speech more expressive.In order to synthesis more natural emotional speech signals,this paper proposes a new emotional speech synthesis method combining Pitch Synchronous Overlap Add (PSOLA) and Discrete Cosine Transform (DCT).The research builds up emotional rules for happy,sad,neutral speech.Through analyzing the prosody parameters,it can modify the each syllable of neutral speech's fundamental frequency,energy and duration based on the emotional rules.The combination method adjusts pitch frequency for which marked pitch through DCT method,and then adjusts the pitch frequency to approach the target emotional fundamental frequency by the PSOLA algorithm.Experimental results show that the proposed method is more sensitive than the PSOLA algorithm.The subjective emotion recognition rate is higher,and the synthesized emotion speech quality is better.关键词
情感语音合成/离散余弦变换/基音同步叠加/基频/时长/能量Key words
emotional speech synthesis/Discrete Cosine Transform (DCT)/Pitch Synchronous Overlap Add (PSOLA)/fundamental frequency/duration/energy分类
信息技术与安全科学引用本文复制引用
李勇,魏珰,王柳渝..基于PSOLA与DCT的情感语音合成方法[J].计算机工程,2017,43(12):278-282,291,6.