基于卷积神经网络的嵌入式视觉感知交互系统设计与实现OA
针对语音智能助理无法提供周围环境的视觉感知问题,该文设计并实现一个视觉感知交互系统.该系统的基本结构由语音识别、语音播放、图像采集、中央处理控制等模块组成,具备语音交互、目标检测等功能.该系统设计选择语音识别专用芯片、利用卷积神经网络技术完成识别,采用基于图分割截块的算法进行目标分割.实验结果表明,系统性能良好,能够实现对周围环境的视觉感知并进行语音交互.
In order to solve the problem that the speech intelligent assistant is unable to provide visual perception of the surrounding environment,a visual perception interaction system is designed and implemented in this paper.The basic structure of the system consists of speech recognition,voice playback,image acquisition,central processing control and other modules,with voice interaction,target detection and other functions.In this system,the special chip for speech recognition is selected,the convolutional neural network technology is used to complete the recognition,and the algorithm based on graph segmentation is used to segment the target.The experimental results show that the system has good performance and can realize visual perception of the surrounding environment and voice interaction.
陶金;王智勇;林鸿生;周怡伶
海军士官学校,安徽 蚌埠 23301292682 部队,广东 湛江 524000
计算机与自动化
卷积神经网络视觉感知嵌入式语音识别图分割截块
convolutional neural networkvisual perceptionembeddedspeech recognitionimage segmentation block
《科技创新与应用》 2024 (003)
35-39 / 5
评论