|国家科技期刊平台
首页|期刊导航|智慧农业(中英文)|复杂场景下害虫目标检测算法:YOLOv8-Extend

复杂场景下害虫目标检测算法:YOLOv8-ExtendOACSTPCD

Crop Pest Target Detection Algorithm in Complex Scenes:YOLOv8-Extend

中文摘要英文摘要

[目的/意义]实现复杂的自然环境下农作物害虫的识别检测,改变当前农业生产过程中依赖于专家人工感官识别判定的现状,提升害虫检测效率和准确率具有重要意义.针对农作物害虫目标检测具有目标小、与农作物拟态、检测准确率低、算法推理速度慢等问题,本研究提出一种基于改进YOLOv8的复杂场景下农作物害虫目标检测算法.[方法]首先通过引入GSConv提高模型的感受野,部分Conv更换为轻量化的幻影卷积(Ghost Convo-lution),采用HorBlock捕捉更长期的特征依赖关系,Concat更换为BiFPN(Bi-directional Feature Pyramid Network)更加丰富的特征融合,使用VoVGSCSP模块提升微小目标检测,同时引入CBAM(Convolutional Block Attention Module)注意力机制来强化田间虫害目标特征.然后使用Wise-IoU损失函数更多地关注普通质量样本,提高网络模型的泛化能力和整体性能.之后,对改进后的YOLOv8-Extend模型与YOLOv8原模型、YOLOv5、YOLOv8-GSCONV、YOLOv8-BiFPN、YOLOv8-CBAM进行对比,验证模型检测准确度和精度.最后将模型移植到边缘设备进行推理验证,在实际应用场景中验证模型的有效性.[结果和讨论]YOLOv8-Extend在对比实验中均取得良好的表现,其中与原模型对比实验中,精确率、召回率、mAP@0.5和mAP@0.5∶0.95评价指标分别提升2.6%、3.6%、2.4%和7.2%,表现突出,具有更好的检测效果.改进前后的模型分别运行在边缘计算设备JETSON ORIN NX 16 GB上并通过TensorRT加速相比,mAP@0.5提升4.6%,达到57.6 FPS,满足实时性检测要求.在复杂农业场景中YOLOv8-Extend模型具有更好的适应性,在实际采集数据中微小害虫与生长环境相似的害虫检测方面有明显优势,在困难数据检测方面准确率提高了11.9%.[结论]本研究提出的YOLOv8改进模型有效提高了检测精度和识别率同时保持了较高的运行效率,能够部署在边缘终端计算设备上实现农作物害虫的实时检测,也为其他小目标智能检测和模型结构优化提供参考和帮助.

[Objective]It is of great significance to improve the efficiency and accuracy of crop pest detection in complex natural environments,and to change the current reliance on expert manual identification in the agricultural production process.Targeting the problems of small target size,mimicry with crops,low detection accuracy,and slow algorithm reasoning speed in crop pest detection,a complex scene crop pest target detection algorithm named YOLOv8-Entend was proposed in this research. [Methods]Firstly,the GSConv was introduecd to enhance the model's receptive field,allowing for global feature aggregation.This mechanism enables feature aggregation at both node and global levels simultaneously,obtaining local features from neighboring nodes through neighbor sampling and aggregation operations,enhancing the model's receptive field and semantic understanding abili-ty.Additionally,some Convs were replaced with lightweight Ghost Convolutions and HorBlock was utilized to capture longer-term feature dependencies.The recursive gate convolution employed gating mechanisms to remember and transmit previous information,capturing long-term correlations.Furthermore,Concat was replaced with BiFPN for richer feature fusion.The bidirectional fusion of depth features from top to bottom and from bottom to top enhances the transmission of feature information acrossed different network layers.Utilizing the VoVGSCSP module,feature maps of different scales were connected to create longer feature map vectors,increas-ing model diversity and enhancing small object detection.The convolutional block attention module(CBAM)attention mechanism was introduced to strengthen features of field pests and reduce background weights caused by complexity.Next,the Wise IoU dynam-ic non-monotonic focusing mechanism was implemented to evaluate the quality of anchor boxes using"outlier"instead of IoU.This mechanism also included a gradient gain allocation strategy,which reduced the competitiveness of high-quality anchor frames and minimizes harmful gradients from low-quality examples.This approach allowed WIoU to concentrate on anchor boxes of average quality,improving the network model's generalization ability and overall performance.Subsequently,the improved YOLOv8-Extend model was compared with the original YOLOv8 model,YOLOv5,YOLOv8-GSCONV,YOLOv8-BiFPN,and YOLOv8-CBAM to validate the accuracy and precision of model detection.Finally,the model was deployed on edge devices for inference verification to confirm its effectiveness in practical application scenarios. [Results and Discussions]The results indicated that the improved YOLOv8-Extend model achieved notable improvements in accuracy,recall,mAP@0.5,and mAP@0.5:0.95 evaluation indices.Specifically,there were increases of 2.6%,3.6%,2.4%and 7.2%,respec-tively,showcasing superior detection performance.YOLOv8-Extend and YOLOv8 run respectively on the edge computing device JETSON ORIN NX 16 GB and were accelerated by TensorRT,mAP@0.5 improved by 4.6%,FPS reached 57.6,meeting real-time de-tection requirements.The YOLOv8-Extend model demonstrated better adaptability in complex agricultural scenarios and exhibited clear advantages in detecting small pests and pests sharing similar growth environments in practical data collection.The accuracy in detecting challenging data saw a notable increased of 11.9%.Through algorithm refinement,the model showcased improved capabili-ty in extracting and focusing on features in crop pest target detection,addressing issues such as small targets,similar background tex-tures,and challenging feature extraction. [Conclusions]The YOLOv8-Extend model introduced in this study significantly boosts detection accuracy and recognition rates while upholding high operational efficiency.It is suitable for deployment on edge terminal computing devices to facilitate real-time detec-tion of crop pests,offering technological advancements and methodologies for the advancement of cost-effective terminal-based auto-matic pest recognition systems.This research can serve as a valuable resource and aid in the intelligent detection of other small tar-gets,as well as in optimizing model structures.

张荣华;白雪;樊江川

京航创智(北京)科技有限公司,北京 102404,中国国家农业信息化工程技术研究中心,北京 100097,中国||数字植物北京市重点实验室,北京 100097,中国

植物保护学

YOLOv8害虫检测注意力机制边缘计算CBAMBiFPNVoVGSCSPGSConv

YOLOv8pest detectionattention mechanismedge computingCBAMBiFPNVoVGSCSPGSConv

《智慧农业(中英文)》 2024 (002)

49-61 / 13

北京市科技新星计划(Z211100002121065,Z20220484202);"十四五"国家重点研发计划项目(2022YFD2002302-02) Beijing Nova Program(Z211100002121065,Z20220484202);National Key Research and Development Program(2022YFD2002302-02)

10.12133/j.smartag.SA202311007

评论