首页|期刊导航|西安电子科技大学学报（自然科学版）|结合语音融合特征和随机森林的构音障碍识别

结合语音融合特征和随机森林的构音障碍识别

李东张雪英段淑斐闫密密

西安电子科技大学学报（自然科学版）2018，Vol.45Issue(3)：149-155,7.

西安电子科技大学学报（自然科学版）2018，Vol.45Issue(3)：149-155,7.DOI:10.3969/j.issn.1001-2400.2018.03.026

结合语音融合特征和随机森林的构音障碍识别

Dysarthria recognition combining speech fusion feature and random forest

李东 ¹张雪英 ¹段淑斐 ¹闫密密¹

作者信息

1. 太原理工大学信息工程学院,山西太原 030024
折叠

摘要

Abstract

This paper proposes a method for speech recognition combining the speech fusion feature and random forest to classify normal voices and voices with dysarthria.This work aimes at analyzing the differences about pronunciation between pathological people and normal people,and providing doctors with scientific and objective evidence for diagnosis and treatment.First,the proposed method uses pathological voice database developed by Toronto University as the corpus,then extracts five types of prosodic features and Mel Frequency Cepstrum Coefficient(MFCC),and calculats their statistical features,which composes the fusion feature.Finally,the random forest is used as the classifier.The results show that,compared with the single type of feature, the proposed fusion feature significantly optimizes the recognition performance,and after combining with the random forest,the classification accuracy for male reaches 99.21％,the classification accuracy for female reaches 98.97％,and comprehensive classification accuracy reaches 98.00％.Meanwhile,the research finds that the pronunciation of a patient when he/she speak short words is more accurate than when he/she speaks sentences.

关键词

韵律特征/梅尔频率倒谱系数/融合特征/随机森林/构音障碍识别

Key words

prosodic feature/Mel frequency cepstrum coefficient/fusion feature/random forest/dysarthria recognition

分类

信息技术与安全科学

引用本文复制引用

李东,张雪英,段淑斐,闫密密..结合语音融合特征和随机森林的构音障碍识别[J].西安电子科技大学学报（自然科学版）,2018,45(3):149-155,7.

基金项目

国家自然科学基金资助项目(61371193) （61371193）

山西省应用基础研究青年基金资助项目(201601D202045) （201601D202045）

西安电子科技大学学报（自然科学版）

OA北大核心CSCDCSTPCD

ISSN：1001-2400

访问量0

下载量0

段落导航