西安电子科技大学学报(自然科学版)2018,Vol.45Issue(3):149-155,7.DOI:10.3969/j.issn.1001-2400.2018.03.026
结合语音融合特征和随机森林的构音障碍识别
Dysarthria recognition combining speech fusion feature and random forest
摘要
Abstract
This paper proposes a method for speech recognition combining the speech fusion feature and random forest to classify normal voices and voices with dysarthria.This work aimes at analyzing the differences about pronunciation between pathological people and normal people,and providing doctors with scientific and objective evidence for diagnosis and treatment.First,the proposed method uses pathological voice database developed by Toronto University as the corpus,then extracts five types of prosodic features and Mel Frequency Cepstrum Coefficient(MFCC),and calculats their statistical features,which composes the fusion feature.Finally,the random forest is used as the classifier.The results show that,compared with the single type of feature, the proposed fusion feature significantly optimizes the recognition performance,and after combining with the random forest,the classification accuracy for male reaches 99.21%,the classification accuracy for female reaches 98.97%,and comprehensive classification accuracy reaches 98.00%.Meanwhile,the research finds that the pronunciation of a patient when he/she speak short words is more accurate than when he/she speaks sentences.关键词
韵律特征/梅尔频率倒谱系数/融合特征/随机森林/构音障碍识别Key words
prosodic feature/Mel frequency cepstrum coefficient/fusion feature/random forest/dysarthria recognition分类
信息技术与安全科学引用本文复制引用
李东,张雪英,段淑斐,闫密密..结合语音融合特征和随机森林的构音障碍识别[J].西安电子科技大学学报(自然科学版),2018,45(3):149-155,7.基金项目
国家自然科学基金资助项目(61371193) (61371193)
山西省应用基础研究青年基金资助项目(201601D202045) (201601D202045)