首页|期刊导航|郑州大学学报（工学版）|多模态数据融合的加工作业动态手势识别方法

多模态数据融合的加工作业动态手势识别方法OA北大核心CSTPCD

Dynamic Gesture Recognition Method for Machining Operations Based on Multi-modal Data Fusion

中文摘要

英文摘要

为了解决单模态数据所提供的特征信息缺乏而导致的识别准确率难以提高、模型鲁棒性较低等问题,提出了面向人机交互的加工作业多模态数据融合动态手势识别策略.首先,采用 C3D网络模型并在视频的空间维度和时间维度对深度图像和彩色图像两种模态数据进行特征提取;其次,将两种模态数据识别结果在决策层按最大值规则进行融合,同时,将原模型使用的 Relu激活函数替换为 Mish激活函数优化梯度特性;最后,通过 3 组对比实验得到 6 种动态手势的平均识别准确率为 96.8%.结果表明:所提方法实现了加工作业中动态手势识别的高准确率和高鲁棒性的目标,对人机交互技术在实际生产场景中的应用起到推动作用.

In order to solve the problem of difficulty in improving the recognition accuray and the low robustness of the model caused by the lack of feature information provided by single mode data,a dynamic gesture recognition strategy based on multi-modal data fusion of machining operations for human-computer interaction was proposed.Firstly,the C3D network model was used to extract features from the depth image and color image modal data based on the spatial and temporal dimensions of videos.Secondly,the recognition results of the two modal data were fused according to the maximum principle at the decision-making level.Meanwhile,the Relu activation function used in the original model was replaced by Mish activation function to optimize the gradient update effect.Finally,through three sets of comparative experiments,it was found that the average recognition accuracy of six dynamic gestures reached 96.8%.The results showed that the proposed method achieved the goal of high accuracy and high robust-ness of dynamic gesture recognition in machining operation,which would play a role in promoting the application of human-computer interaction technology in actual production scenes.

作者：张富强;曾夏;白筠妍;丁凯

作者单位：长安大学道路施工技术与装备教育部重点实验室,陕西西安 710064||长安大学智能制造系统研究所,陕西西安 710064

分类：机械工程

中文关键词：多模态数据融合加工作业动态手势识别C3DMish激活函数人机交互

英文关键词：multi-modal data fusionmachining operationdynamic gesture recognitionC3DMish activation functionhuman-computer interaction

刊名：《郑州大学学报（工学版）》 2024 (005)

页码/页数：30-36 / 7

基金： 国家重点研发计划项目(2021YFB3301702);陕西省科技重大专项(2018zdzx01-01-01)

DOI：10.13705/j.issn.1671-6833.2024.02.007

多模态数据融合的加工作业动态手势识别方法OA北大核心CSTPCD

Dynamic Gesture Recognition Method for Machining Operations Based on Multi-modal Data Fusion

评论