首页|期刊导航|福建电脑|改进MFCC特征和MLA模型的语音情感识别

改进MFCC特征和MLA模型的语音情感识别

张晓莉

福建电脑2024，Vol.40Issue(1)：52-56,5.

福建电脑2024，Vol.40Issue(1)：52-56,5.DOI:10.16707/j.cnki.fjpc.2024.01.010

改进MFCC特征和MLA模型的语音情感识别

Improved MFCC Features and MLA Model for Speech Emotion Recognition

张晓莉¹

作者信息

1. 福建师范大学教育技术学系福州 350007
折叠

摘要

Abstract

MFCC and its first-order differential features represent the static and dynamic information of speech,often used as emotional features in SER.In the traditional MFCC feature extraction process,balancing the speech signal-to-noise ratio through manual parameter tuning can easily lead to overcompensation.This article proposes two improvement methods to obtain EMFCC and AMFCC features,respectively.In order to achieve the best classification accuracy,an MLA model was constructed based on pooling layer,LSTM,and attention mechanism,which can effectively capture emotional information in features.A mixed feature consisting of MFCC and its first-order differential features,as well as two improved MFCC features,achieved an unweighted accuracy of 81.79%on the CASIA corpus.The results of the ablation experiment indicate that compared with other advanced recognition methods in the SER field,the improved MFCC feature has better performance advantages.

关键词

语音情感识别/梅尔频率倒谱系数/长短时记忆/注意力机制

Key words

Speech Emotion Recognition/MFCC/Long Short-Term Memory/Attention Mechanism

分类

信息技术与安全科学

引用本文复制引用

张晓莉..改进MFCC特征和MLA模型的语音情感识别[J].福建电脑,2024,40(1):52-56,5.

福建电脑

ISSN：1673-2782

访问量0

下载量0

段落导航