太原理工大学学报2017,Vol.48Issue(3):469-474,6.DOI:10.16355/j.cnki.issn1007-9432tyut.2017.03.026
情感语音数据库优化及PAD情感模型量化标注
Emotional Speech Database Optimization and Quantitative Annotation Based on PAD Emotion Model
摘要
Abstract
Emotional speech database is the foundation of emotional speech recognition research,it has great significance to establish a continuous dimension emotional speech database including cognitive psychological factors for improving the performance of the speech emotion recognition and human-computer interaction.In this paper,first,hearing screening was conducted on previously established TYUT2.0 database,then the database was optimized according to recognition rate threshold.The resultant emotional speech database with 237 speeches has four types of emotion including 62,58,57,and 60 speeches representing respectively sadness,anger,happiness and surprise.The speech of this database was marked by using PAD emotion model,giving a dimensional emotion database.Each speech has its identification rate and PAD value.Statistical results of PAD value prove the validity of this dimensional emotional speech database,which lays the foundation for studying emotional speech recognition in continuous dimension in the future.关键词
情感语音数据库/维度情感描述/PAD情感模型Key words
emotional speech database/dimensional emotion description/PAD emotion model分类
信息技术与安全科学引用本文复制引用
张雪英,张婷,孙颖,张卫,畅江..情感语音数据库优化及PAD情感模型量化标注[J].太原理工大学学报,2017,48(3):469-474,6.基金项目
国家自然科学基金资助项目(61376693) (61376693)