计算机应用研究2012,Vol.29Issue(3):1002-1004,3.DOI:10.3969/j.issn.1001-3695.2012.03.055
人脸语音动画中基于PSOLA的情感语音合成系统
Emotional speech synthesis system based on PSOLA in facial speech animation
王华 1樊养余1
作者信息
- 1. 西北工业大学电子信息学院,西安710072
- 折叠
摘要
Abstract
This paper proposed a emotional speech synthesis system based on pitch synchronous overlap-add ( PSOLA) . Pro-sodic parameters could be changed in this system freely. First, analyzing pre-recorded emotional speech samples' it concluded some acoustic features associated closely with happiness, angry, surprise and sadness. Then it used TD(time domain)-PSOLA algorithm to change the speech prosodic parameters of neutral speeches. Especially, it proposed a approach to change the FO contour. Experiments demonstrates that the system is effective, which helps to express the facial speech animation more vividly.关键词
人脸语音动画/时域基音同步叠加/韵律参数/基频曲线/情感语音合成Key words
facial speech animation/ TD-PSOLA/ prosodic parameters/ FO contour/ emotional speech synthesis分类
信息技术与安全科学引用本文复制引用
王华,樊养余..人脸语音动画中基于PSOLA的情感语音合成系统[J].计算机应用研究,2012,29(3):1002-1004,3.