| 注册
首页|期刊导航|华南农业大学学报|基于单目视觉和改进YOLOv8-pose模型的篮筐位姿估计方法

基于单目视觉和改进YOLOv8-pose模型的篮筐位姿估计方法

陈琳 徐震 张春燕 吉晓升 成松松 黄嘉俊

华南农业大学学报2026,Vol.47Issue(1):106-117,12.
华南农业大学学报2026,Vol.47Issue(1):106-117,12.DOI:10.7671/j.issn.1001-411X.202504027

基于单目视觉和改进YOLOv8-pose模型的篮筐位姿估计方法

Basket pose estimation method based on monocular vision and improved YOLOv8-pose model

陈琳 1徐震 1张春燕 1吉晓升 1成松松 2黄嘉俊2

作者信息

  • 1. 上海工程技术大学机械与汽车工程学院,上海 201620
  • 2. 上海市农业机械研究所,上海 201106
  • 折叠

摘要

Abstract

[Objective]Currently,the handling operation of vegetable baskets after harvesting in protected greenhouses is still dominated by manual labor,which has problems such as low efficiency and high labor intensity,seriously restricting the large-scale and intelligent development of agricultural production.Developing a new type of agricultural robot with autonomous basket-grabbing functionality is a key technical path to break this bottleneck and improve agricultural production efficiency.Among them,achieving accurate pose estimation of baskets based on computer vision technology is the core premise and technical foundation for ensuring the stability and reliability of the robot's grabbing actions.However,the accuracy and real-time performance of existing pose estimation methods are difficult to meet the actual operational requirements in complex greenhouse environments,and further in-depth research and optimization are urgently needed.[Method]Based on the YOLOv8-pose baseline model,this approach estimated the basket's pose by detecting its feature points and integrating the PnP algorithm.Firstly,RGB images of baskets under diverse complex backgrounds were captured using a monocular camera to construct a dedicated dataset.Secondly,the Biformer module,GAM attention mechanism and Focaler_GIoU loss function were incorporated into the YOLOv8-pose framework to enhance keypoint detection robustness in challenging scenarios involving cluttered backgrounds and occlusions.Finally,leveraging the basket's predefined dimensional parameters and the detected 2D keypoint coordinates,the PnP algorithm was employed to solve for the 3D pose parameters of the basket in physical space.[Result]The mean average precision(mAP)and precision of keypoints were increased by 3.73 and 4.31 percentage points,respectively.The average positioning precision was increased by 5.20 pixels,and the root mean square error(RMSE)between these keypoints and manually identified keypoints was increased by 4.45 pixels on average.The pose estimation algorithm achieved higher accuracy when the camera was 1.7 to 1.9 m from the basket,highlighting the critical influence of relative distance between the camera and the basket on localization estimation precision.[Conclusion]The method proposed in this study can provide a low-cost and high-precision solution for basket pose estimation in a facility greenhouse scenario,and provide a technical support for agricultural robots to grasp the basket.

关键词

视觉识别/关键点检测/位姿估计/YOLOv8/农业机器人

Key words

Visual recognition/Keypoint detection/Pose estimation/YOLOv8/Agricultural robot

分类

信息技术与安全科学

引用本文复制引用

陈琳,徐震,张春燕,吉晓升,成松松,黄嘉俊..基于单目视觉和改进YOLOv8-pose模型的篮筐位姿估计方法[J].华南农业大学学报,2026,47(1):106-117,12.

基金项目

上海市农业科技创新项目(T2023215) (T2023215)

华南农业大学学报

1001-411X

访问量0
|
下载量0
段落导航相关论文