| 注册
首页|期刊导航|工程科学学报|基于多注意力的改进YOLOv5s小目标检测算法

基于多注意力的改进YOLOv5s小目标检测算法

马鸽 李洪伟 严梓维 刘志杰 赵志甲

工程科学学报2024,Vol.46Issue(9):1647-1658,12.
工程科学学报2024,Vol.46Issue(9):1647-1658,12.DOI:10.13374/j.issn2095-9389.2024.01.18.003

基于多注意力的改进YOLOv5s小目标检测算法

Improved small target detection algorithm based on multiattention and YOLOv5s for traffic sign recognition

马鸽 1李洪伟 1严梓维 2刘志杰 3赵志甲1

作者信息

  • 1. 广州大学机械与电气工程学院,广州 510006
  • 2. 广州市公用事业技师学院,广州 510199
  • 3. 北京科技大学智能科学与技术学院,北京100083
  • 折叠

摘要

Abstract

Traffic sign detection and recognition facilitates real-time monitoring and interpretation of various traffic signs on the road, such as those indicating speed limits, prohibition of overtaking, and navigation cues. This has substantial applications for autonomous driving and decision-making systems. Consequently, designing accurate and efficient algorithms for the automatic recognition of traffic signs is crucial in the intelligent transportation field. However, targets that need to be detected by traffic sign recognition applications are mostly small-sized, causing challenges regarding their automatic recognition. The YOLOv5s model, characterized by its minimal depth and narrowest feature map, has gained widespread popularity for executing detection owing to its features of being lightweight and easily portable. Furthermore, the YOLOv5s model uses an anchor-based prediction approach that uses anchor boxes of different sizes and shapes to regress and classify various targets. This method generates dense anchor boxes and enables the model to directly perform object classification and bounding box regression, thereby enhancing its target recall capability. Therefore, the anchor-based Yolov5s method has been applied to traffic sign detection; however, it suffers from issues such as false positives and missed detection. Detection of small targets continues to be a challenging aspect in current traffic sign recognition technology due to the following: small targets carry less information; detection of small targets requires high precision in positioning; and environmental noise may overwhelm the detection of small targets. To overcome the abovementioned issues, such as missed detection, false positives, and low detection accuracy, this study proposes a model called STD-YOLOv5s that is specifically designed for small target detection. First, by increasing the number of upsampling and prediction output layers, this model obtains abundant location information. This can enhance the global understanding of images and solve the issue of insufficient information associated with small targets. Second, the CA attention mechanism is added after each C3 module, whereas the Swin-T attention mechanism module is added before each output layer, increasing the model's ability to capture multilayer feature information and consequently improving its performance of small target detection. Finally, the accuracy of target localization is ensured using the SIoU penalty function, which considers the target shape and spatial relationships, thereby increasing the model's ability to capture the positional relationships among targets of different sizes in the image. The STD-YOLOv5s model was validated using the TT100K dataset by ablation and comparison experiments. Experimental results indicate that the proposed model not only maintains the lightweight nature and high detection speed of the YOLOv5s model but also achieves improvements in precision, recall, and average precision.

关键词

小目标检测/交通标志识别/注意力机制/YOLOv5s/深度学习

Key words

small target detection/traffic sign recognition/attention mechanism/YOLOv5s/deep learning

分类

信息技术与安全科学

引用本文复制引用

马鸽,李洪伟,严梓维,刘志杰,赵志甲..基于多注意力的改进YOLOv5s小目标检测算法[J].工程科学学报,2024,46(9):1647-1658,12.

基金项目

国家自然科学基金资助项目(NO.62173102,U20A20225) (NO.62173102,U20A20225)

工程科学学报

OA北大核心CSTPCD

2095-9389

访问量0
|
下载量0
段落导航相关论文