四川大学学报(自然科学版)2024,Vol.61Issue(3):195-205,11.DOI:10.19907/j.0490-6756.2024.033004
面向语种识别的声学特征提取改进研究
Optimization of acoustic feature extraction for language identification
摘要
Abstract
The dimensionalities of the acoustic feature matrix used in language identification studies are often very high.To address the issue of excessive dimensions in acoustic features for language identification,an im-proved method for acoustic feature extraction is proposed.By analyzing the statistical characteristics of some commonly used acoustic features and then combining with their extraction process as well as partial literature arguments,the improved features are obtained by calculating the mean value of each dimension of the features on the frame and then normalizing the vectors to eliminate the influence of the dimensions.This results in the optimization of the traditional feature matrix into a one-dimensional feature vector.Finally,based on the char-acteristics of the improved features,experiments for language identification are conducted using BP neural net-work and Support Vector Machine as the baseline systems on two distinct datasets.The experimental results show that,for the five commonly used acoustic features,the proposed improved method consistently achieves an average identification rate of 95.6%for Dataset1 and 90.2%for Dataset2 under the two models,even with a reduction of 99.8%in data volume,compared to the traditional approaches.In addition,the significant reduction in computational workload achieved by proposed method enhances the adaptability of the algorithm to embedded environments with relatively weak hardware facilities,thereby expanding for the applicability of the algorithm.关键词
语种识别/声学特征/统计特性/特征提取Key words
Language identification/Acoustic features/Statistical features/Feature extraction分类
信息技术与安全科学引用本文复制引用
周大春,邵玉斌,张昊阁,杜庆治..面向语种识别的声学特征提取改进研究[J].四川大学学报(自然科学版),2024,61(3):195-205,11.基金项目
云南省媒体融合重点实验室项目(320225403) (320225403)