现代电子技术2024,Vol.47Issue(19):131-138,8.DOI:10.16652/j.issn.1004-373x.2024.19.020
基于金字塔结构的Transformer边缘检测算法研究
Research on Transformer edge detection algorithm based on pyramid structure
段续延 1于复兴 2索依娜2
作者信息
- 1. 华北理工大学 人工智能学院,河北 唐山 063210
- 2. 华北理工大学 人工智能学院,河北 唐山 063210||河北省工业智能感知重点实验室,河北 唐山 063210
- 折叠
摘要
Abstract
In view of the difficult multi-scale feature extraction and low utilization rate of multi-scale features in the task of complex image edge detection,a Pyramid-structured Transformer edge detection model is proposed.In this model,the Transformer feature extraction trunk named PVT(pyramid vision transformer)network,which is good at modeling based on global long-range dependency relationships,is adopted to replace the traditional convolutional neural network(CNN),so as to improve the utilization rate of multi-scale features.A module specifically designed for modeling and transferring context knowledge is designed to explore more discriminant information of significant edges,so as to take account of the cross-layer context feature interaction between layers fully.A multi-scale feature enhancement module(MSFEM)based on the attention mechanism(AM)is designed to achieve the prediction of edges by fully exploring the multi-level and multi-scale feature information of the objects under detection,and to increase the edge detection accuracy of the model.Moreover,the feature summing and stitching process of the model does not occupy video memory nor memory,and speeds up the model inference speed.A large number of experiments were carried out on the two public datasets BSDS500 and BIPED.The ODS(optimal dataset scale)value of edge detection on the dataset BSDS500 reached 0.796;and on the dataset BIPED,the ODS value of edge detection reached 0.846.The experimental results show that the proposed algorithm is superior to the bechmark model in performance.关键词
边缘检测/Transformer/多尺度特征提取/卷积神经网络/PVT/多尺度特征增强Key words
edge detection/Transformer/multi-scale feature extraction/CNN/PVT/multi-scale feature enhancement分类
信息技术与安全科学引用本文复制引用
段续延,于复兴,索依娜..基于金字塔结构的Transformer边缘检测算法研究[J].现代电子技术,2024,47(19):131-138,8.