首页|期刊导航|工程科学学报|面向抓取检测的位姿估计数据集自动采集标注系统

面向抓取检测的位姿估计数据集自动采集标注系统

陈鹏白勇孙翰翔

工程科学学报2024，Vol.46Issue(8)：1458-1468,11.

工程科学学报2024，Vol.46Issue(8)：1458-1468,11.DOI:10.13374/j.issn2095-9389.2023.09.28.001

面向抓取检测的位姿估计数据集自动采集标注系统

Automatic data collection and annotation system for a pose estimation dataset designed for grasping detection

陈鹏 ¹白勇 ¹孙翰翔¹

作者信息

1. 河北工业大学人工智能与数据科学学院,天津 300401
折叠

摘要

Abstract

Robotic grasping has extensive applications in fields such as logistics sorting,automated assembly,and medical surgery.Grasping detection is an important step in robotic grasping.Recently,with the decrease in their costs,depth cameras have been gradually applied for grasping detection,which has promoted the application of pose estimation-based methods for robotic grasping.However,most publicly available RGB-D image-based pose estimation datasets rely on equipment such as expensive 3D laser scanners to obtain 3D models of target objects.Meanwhile,the annotation process relies heavily on manual operation,which is time-consuming,labor-intensive,and unfavorable for the creation of large-scale datasets.To address these issues,this study implements a dataset automatic acquisition and annotation system aimed at developing RGB-D image-based pose estimation methods for robotic grasping.The proposed system deploys easily and does not require an expensive 3D laser scanner.RGB-D image sequences are obtained only by an off-the-shelf depth camera,and the system can automatically acquire the reconstructed 3D model of the target object,annotated pose information,and 2D image segmentation masks.During the process of developing the automatic annotation algorithm for the proposed system,a novel minimum spanning tree-based normal propagation method is proposed to guarantee that consistent normal directions can be acquired so that deformations or tearing on the reconstructed 3D surface caused by inconsistent normal directions can be avoided.During the experiments,the proposed system created a pose estimation dataset containing 84 objects with 8400 RGB-D images.3D models,image segmentation masks,and 6D poses were annotated by the system in every RGB-D image for each object.To evaluate the accuracy of the annotated segmentation masks,the annotated segmentation masks and the corresponding manually labeled results were compared.Furthermore,the accuracy of the annotation results was also assessed from the performance of an instance segmentation network trained by the annotated image masks.To evaluate the accuracy of the annotated poses,a point cloud registration mission was launched to align the model point cloud and the scene point cloud using the annotated pose parameters.Furthermore,a category-level pose estimation network was trained using the annotated pose parameters,and its performance can directly reflect the accuracy of the annotation results.The experimental results show that the overlapped area between the annotated mask and the manually labeled mask is greater than 98%.Additionally,a 100%alignment rate can be achieved,meaning that the model point cloud can be aligned to any scene point cloud through the corresponding annotated pose parameters.These results demonstrate that the designed and implemented system in this paper can be used to sufficiently create a high-quality dataset for developing real pose estimation-related solutions.A solid data foundation can be provided on the basis of the proposed system for future research and application of deep learning models aimed at robotic grasping detection.

关键词

抓取检测/自动标注/三维重建/位姿估计/分割掩码

Key words

grasp detection/automatic labeling/3D reconstruction/pose estimation/segmentation mask

分类

信息技术与安全科学

引用本文复制引用

陈鹏,白勇,孙翰翔..面向抓取检测的位姿估计数据集自动采集标注系统[J].工程科学学报,2024,46(8):1458-1468,11.

基金项目

国家自然科学基金资助项目(U20A20201) （U20A20201）

河北省高等学校科学技术研究项目(QN2022048) （QN2022048）

工程科学学报

OA北大核心CSTPCD

ISSN：2095-9389

访问量0

下载量0

段落导航