首页|期刊导航|计算机与数字工程|基于递归神经网络的端到端语音识别

基于递归神经网络的端到端语音识别

王子龙李俊峰张劭韡王宏岩王思杰

计算机与数字工程2019，Vol.47Issue(12)：3099-3106,8.

计算机与数字工程2019，Vol.47Issue(12)：3099-3106,8.DOI:10. 3969/j. issn. 1672-9722. 2019. 12. 031

基于递归神经网络的端到端语音识别

End-to-End Speech Recognition Based on Recurrent Neural Network

王子龙 ¹李俊峰 ¹张劭韡 ²王宏岩 ³王思杰²

作者信息

1. 国家电网有限公司营销部北京 100031
2. 国家电网有限公司客户服务中心天津 300306
3. 北京中电普华信息技术有限公司北京 100031
折叠

摘要

Abstract

This paper presents a speech recognition system that transcribes audio data directly from text. A recursive neural network(RNN)structure based on deep bidirectional long-term and short-term memory(LSTM)is combined with the objective function of connection time classification(CTC). The objective function is modified to minimize the expectation of the training net?work for any transcription loss function. Even in the absence of dictionaries or language models,word error rates can be directly opti?mized. In the absence of language information,the system achieves 27.3% word error rate(WER)for the wall street journal corpus, 21.9% under the condition of only allowing word dictionaries,and 8.2% under the ternary language model. By combining the pro?posed method with the benchmark system,the error rate is further reduced to 6.7%.

关键词

递归神经网络/语音识别/长短期记忆/连接时间分类/单词错误率

Key words

RNN/speech recognition/LSTM/CTC/WER

分类

信息技术与安全科学

引用本文复制引用

王子龙,李俊峰,张劭韡,王宏岩,王思杰..基于递归神经网络的端到端语音识别[J].计算机与数字工程,2019,47(12):3099-3106,8.

基金项目

国家自然科学基金项目(编号:51776082)资助. （编号:51776082）

计算机与数字工程

OACSTPCD

ISSN：1672-9722

访问量0

下载量0

段落导航