首页|期刊导航|四川大学学报（自然科学版）|面向语种识别的声学特征提取改进研究

面向语种识别的声学特征提取改进研究

周大春邵玉斌张昊阁杜庆治

四川大学学报（自然科学版）2024，Vol.61Issue(3)：195-205,11.

四川大学学报（自然科学版）2024，Vol.61Issue(3)：195-205,11.DOI:10.19907/j.0490-6756.2024.033004

面向语种识别的声学特征提取改进研究

Optimization of acoustic feature extraction for language identification

周大春 ¹邵玉斌 ¹张昊阁 ¹杜庆治¹

作者信息

1. 昆明理工大学信息工程与自动化学院,昆明 650504
折叠

摘要

Abstract

The dimensionalities of the acoustic feature matrix used in language identification studies are often very high.To address the issue of excessive dimensions in acoustic features for language identification,an im-proved method for acoustic feature extraction is proposed.By analyzing the statistical characteristics of some commonly used acoustic features and then combining with their extraction process as well as partial literature arguments,the improved features are obtained by calculating the mean value of each dimension of the features on the frame and then normalizing the vectors to eliminate the influence of the dimensions.This results in the optimization of the traditional feature matrix into a one-dimensional feature vector.Finally,based on the char-acteristics of the improved features,experiments for language identification are conducted using BP neural net-work and Support Vector Machine as the baseline systems on two distinct datasets.The experimental results show that,for the five commonly used acoustic features,the proposed improved method consistently achieves an average identification rate of 95.6%for Dataset1 and 90.2%for Dataset2 under the two models,even with a reduction of 99.8%in data volume,compared to the traditional approaches.In addition,the significant reduction in computational workload achieved by proposed method enhances the adaptability of the algorithm to embedded environments with relatively weak hardware facilities,thereby expanding for the applicability of the algorithm.

关键词

语种识别/声学特征/统计特性/特征提取

Key words

Language identification/Acoustic features/Statistical features/Feature extraction

分类

信息技术与安全科学

引用本文复制引用

周大春,邵玉斌,张昊阁,杜庆治..面向语种识别的声学特征提取改进研究[J].四川大学学报（自然科学版）,2024,61(3):195-205,11.

基金项目

云南省媒体融合重点实验室项目(320225403) （320225403）

四川大学学报（自然科学版）

OA北大核心CSTPCD

ISSN：0490-6756

访问量4

下载量0

段落导航