计算机工程与应用2017,Vol.53Issue(20):128-133,6.DOI:10.3778/j.issn.1002-8331.1705-0377
面向情感语音识别的非线性几何特征提取算法
Nonlinear geometric feature extraction algorithm for emotional speech recognition
摘要
Abstract
Aiming at addressing the limitations of the existing time domain and frequency domain attribute characteristics in distinguishing the emotional state,a nonlinear geometric feature extraction method based on phase space reconstruction theory is proposed.Firstly,the phase space is reconstructed by analyzing the minimum delay time and the embedded dimen-sion of the emotion speech signal.Secondly,five kinds of nonlinear geometric features based on trajectory contour are analyzed and extracted under reconstructed phase space. Finally, combining with the prosody features, MFCC features and chaotic characteristics, the experiments are designed to verify the ability of the proposed feature to distinguish the emotional state and to obtain the complete set of optimal features of the emotional information through the feature selec-tion.Five kinds of emotions(happy,sad,neutral,angry and fear)in the German speech library are selected as the experi-mental data source,and the support vector machine is used as the identification network.The experimental results show that the proposed feature can not only characterize the emotion difference in the speech signal,but also make up the defi-ciency of the existing feature in characterizing the emotional state compared with the prosody feature,the MFCC feature and the chaotic feature.关键词
相空间重构/情感语音识别/非线性几何特征/特征选择/最优特征集合Key words
phase space reconstruction/emotional speech recognition/nonlinear geometric features/feature selection/optimal feature set分类
信息技术与安全科学引用本文复制引用
宋春晓,孙颖..面向情感语音识别的非线性几何特征提取算法[J].计算机工程与应用,2017,53(20):128-133,6.基金项目
国家自然科学基金(No.61371193). (No.61371193)