语音识别与大语言模型融合技术研究综述OA北大核心

Review of Research on Fusion Technology of Speech Recognition and Large Language Models

中文摘要

英文摘要

在当今时代背景下,多种大语言模型层出不穷,推动了人工智能众多领域的发展和创新.归纳大语言模型在语音识别技术中的积极作用,并探讨其发展前景,可以为语音识别技术的发展提供创新思路.在目前主流的端到端语音识别模型中,常使用额外的语言模型对语音识别结果重打分或结合WFST算法辅助解码来提升语音识别结果的准确率.最新研究发现,将大型语言模型融入语音识别模型的端到端训练中,能够更好地提升语音识别结果的准确率.以浅融合、深度融合、冷融合三类语音识别与语言模型的…查看全部>>

In the current era,various large language models(LLMs)have emerged,driving the development and innova-tion in many fields of artificial intelligence.Summarizing the positive effects of LLMs in speech recognition technology and exploring its development prospects can provide innovative ideas for the advancement of speech recognition technology.In current mainstream end-to-end speech recognition models,additional language models are often used to rescore the s…查看全部>>

作者：王敬凯;秦董洪;白凤波;李路路;孔令儒;徐晨

作者单位：广西民族大学人工智能学院,南宁 530000广西民族大学人工智能学院,南宁 530000广西民族大学人工智能学院,南宁 530000广西民族大学人工智能学院,南宁 530000广西民族大学人工智能学院,南宁 530000广西民族大学人工智能学院,南宁 530000

分类：电子信息工程

中文关键词：语音识别大语言模型深度学习

英文关键词：speech recognitionlarge language modeldeep learning

刊名：《计算机工程与应用》 2025 (6)

页码/页数：53-63,11

基金：广西壮族自治区中央引导地方科技发展资金项目(桂科ZY24212045)广西科技基地和人才专项(桂科AD23026054).

DOI：10.3778/j.issn.1002-8331.2405-0145

您当前未登录！

去登录

点击加载更多...

语音识别与大语言模型融合技术研究综述OA北大核心

Review of Research on Fusion Technology of Speech Recognition and Large Language Models

评论