东华大学学报(英文版)2025,Vol.42Issue(6):673-682,10.DOI:10.19884/j.1672-5220.202411014
多模态多视角3D手部姿态估计
Multi-Modal Multi-View 3D Hand Pose Estimation
王浩 1王萍 1于昊冉 1丁东 1向未名1
作者信息
- 1. 东华大学 信息科学与技术学院,上海 201620
- 折叠
摘要
Abstract
With the rapid progress of the artificial intelligence(AI)technology and mobile internet,3D hand pose estimation has become critical to various intelligent application areas,e.g.,human-computer interaction.To avoid the low accuracy of single-modal estimation and the high complexity of traditional multi-modal 3D estimation,this paper proposes a novel multi-modal multi-view(MMV)3D hand pose estimation system,which introduces a registration before translation(RT)-translation before registration(TR)jointed conditional generative adversarial network(cGAN)to train a multi-modal registration network,and then employs the multi-modal feature fusion to achieve high-quality estimation,with low hardware and software costs both in data acquisition and processing.Experimental results demonstrate that the MMV system is effective and feasible in various scenarios.It is promising for the MMV system to be used in broad intelligent application areas.关键词
3D手部姿态估计/配准网络/多模态/多视角/条件式生成对抗网络(cGAN)Key words
3D hand pose estimation/registration network/multi-modal/multi-view/conditional generative adversarial network(cGAN)分类
信息技术与安全科学引用本文复制引用
王浩,王萍,于昊冉,丁东,向未名..多模态多视角3D手部姿态估计[J].东华大学学报(英文版),2025,42(6):673-682,10.