农业机械学报2024,Vol.55Issue(1):47-54,8.DOI:10.6041/j.issn.1000-1298.2024.01.004
基于改进Faster R-CNN的苹果采摘视觉定位与检测方法
Vision Detection Method for Picking Robots Based on Improved Faster R-CNN
摘要
Abstract
To address the issue of poor detection and positioning capabilities of fruit picking robots in scenes with densely distributed targets and fruits occluding each other,a method to improve the fruit detection and positioning of Faster R-CNN was proposed by introducing an efficient channel attention mechanism(EC A)and a multiscale feature fusion pyramid(FPN).Firstly,the commonly used VGG16 network was replaced with a ResNet50 residual network with strong expression capability and eliminate network degradation problem,thus extracting more abstract and rich semantic information to enhance the model's detection ability for multiscale and small targets.Secondly,the ECA module was introduced to enable the feature extraction network to focus on local and efficient information in the feature map,reduce the interference of invalid targets,and improve the model's detection accuracy.Finally,a branch and leaf grafting data augmentation method was used to improve the apple dataset and solve the problem of insufficient image data.Based on the constructed dataset,genetic algorithms were used to optimize K-means++clustering and generate adaptive anchor boxes.Experimental results showed that the improved model had average precision of 96.16%for graspable apples and 86.95%for non-graspable apples,and the mean average precision was 92.79%,which was 15.68 percentages higher than that of the traditional Faster R-CNN.The positioning accuracy for graspable and non-directly graspable apples were 97.14%and 88.93%,respectively,which were 12.53 percentages and 40.49 percentages higher than that of traditional Faster R-CNN.The weight was reduced by 38.20%.The computation time was reduced by 40.7%.The improved model was more suitable for application in fruit-picking robot visual systems.关键词
苹果采摘机器人/目标定位与检测/Faster R-CNN/注意力机制/特征金字塔Key words
apple picking robot/target localization and detection/Faster R-CNN/attention mechanism/feature pyramid分类
信息技术与安全科学引用本文复制引用
李翠明,杨柯,申涛,尚拯宇..基于改进Faster R-CNN的苹果采摘视觉定位与检测方法[J].农业机械学报,2024,55(1):47-54,8.基金项目
国家自然科学基金项目(52265065、51765031) (52265065、51765031)