液晶与显示2025,Vol.40Issue(3):457-471,15.DOI:10.37188/CJLCD.2024-0182
类别级多目标刚体6D位姿估计方法
Estimation method of category-level multi-object rigid body 6D pose
摘要
Abstract
In order to solve the problems of poor scalability,low generality and high computational cost of the traditional method using single object CNN model,and optimize the performance of multi-objective method.In this paper,a single-stage network architecture for multi-objective 6D attitude estimation is proposed,and a multi-branch feature extraction decoder is designed to capture and aggregate detailed features effectively.This paper proposes a feature optimization and screening module,which filters input features to extract multi-scale features.Combining the above two,a new feature pyramid structure is designed to improve the overall performance of the network and improve the pose estimation effect of occlusion.The experiments are carried out on synthetic data set LINEMOD and Occluded LINEMOD.The results show that the proposed method has achieved significant improvement in the processing of blocked object scenes.Compared with the most advanced methods such as PyraPose,SD-Pose and CASAPose,the proposed method has increased the ADD/S-Recall index by 43.1%,16.1%and 12%,respectively.It performed better when the number of targets is small,increasing performance by 17%when the number of targets is 4.The ablation experiment further verifies the effectiveness of each module.By introducing multi-branch feature extraction decoder,feature optimization and screening module,and feature pyramid structure,the proposed single-stage multi-objective network architecture can process any number of targets by training only one network,and can perform 6D pose estimation better under the condition of synthetic data.Experimental results verify the effectiveness of the proposed method.关键词
6D位姿估计/多目标单阶段网络/多分支特征提取解码器/特征选择/合成数据Key words
6D pose estimation/multi-objective single-stage network/multi-drop feature extraction layer/feature selection/composite data分类
计算机与自动化引用本文复制引用
程硕,贾迪,杨柳,何德堃..类别级多目标刚体6D位姿估计方法[J].液晶与显示,2025,40(3):457-471,15.基金项目
国家自然科学基金(No.61601213) (No.61601213)
辽宁工程技术大学鄂尔多斯研究院校地科技合作培育项目(No.YJY-XD-2023-003) Supported by National Natural Science Foundation of China(No.61601213) (No.YJY-XD-2023-003)
University-local Government Scientific and Technical Cooperation Cultivation Project of Ordos Institute-LNTU(No.YJY-XD-2023-03) (No.YJY-XD-2023-03)