首页|期刊导航|计算机应用与软件|基于QRNN-CTC的中文语音识别声学模型

基于QRNN-CTC的中文语音识别声学模型

王先欢孙自强

计算机应用与软件2023，Vol.40Issue(12)：184-188,262,6.

计算机应用与软件2023，Vol.40Issue(12)：184-188,262,6.DOI:10.3969/j.issn.1000-386x.2023.12.027

基于QRNN-CTC的中文语音识别声学模型

CHINESE SPEECH RECOGNITION ACOUSTIC MODEL BASED ON QRNN-CTC

王先欢 ¹孙自强¹

作者信息

1. 华东理工大学化工过程先进控制和优化技术教育部重点实验室上海 200237
折叠

摘要

Abstract

Aimed at the problem of insufficient processing time sequence ability of convolutional neural network(CNN)in speech recognition and high model complexity and difficulty of training in recurrent neural network(RNN)in speech recognition,a new kind of quasi-recurrent neural network and connectionist temporal classification(QRNN-CTC)acoustic model is proposed.It not only reduced the numbers of parameters but also ensured a certain cycle capability between time series.CTC was used to realize automatic alignment of input sequence and label,and dropout was introduced to prevent overfitting during training.The experimental results on the Thchs-30 dataset show that QRNN-CTC has a relative error rate of 9.8%lower than that of CNN-CTC,and the final word error rate is23.8%,and the training time is half of LSTM-CTC.

关键词

深度学习/语音识别/声学模型/准循环神经网络/连接时序分类

Key words

Deep learning/Speech recognition/Acoustic model/QRNN/CTC

分类

信息技术与安全科学

引用本文复制引用

王先欢,孙自强..基于QRNN-CTC的中文语音识别声学模型[J].计算机应用与软件,2023,40(12):184-188,262,6.

基金项目

中央高校基本科研业务费专项资金资助项目(222201917006). （222201917006）

计算机应用与软件

OA北大核心CSTPCD

ISSN：1000-386X

访问量5

下载量0

段落导航