浙江电力2024,Vol.43Issue(4):121-128,8.DOI:10.19585/j.zjdl.202404013
基于体素注意力网络的电力设备目标检测模型
An object detection model for power equipment based on SVGA-Net
摘要
Abstract
Convolutional neural networks(CNNs)struggle to efficiently capture contextual information of power equipment such as arresters and GIS inlet casings due to their limited receptive fields,thereby affecting detection performance.To address this issue,the paper introduces a Transformer-based voxel-graph attention network.Local attention and dilated attention mechanisms are proposed to respectively capture short-range and long-range feature correlations within image volume pixels,effectively expanding the attention scope while keeping computational costs unchanged.Additionally,submanifold voxel modules and sparse voxel modules are designed to extract feature information from non-empty voxel positions and empty voxel positions,respectively.Finally,through comparative analysis with mainstream models on the general datasets Waymo and KITTI,as well as on an image dataset from a transmission and transformation area in Yunnan Province,the superior performance of the proposed model in detect-ing power equipment is demonstrated.关键词
注意力网络/目标检测/几何流形/体积像素Key words
attention network/object detection/geometric manifold/volume pixel引用本文复制引用
陈勇,李松,晋伟平,谢珉,杨永昆..基于体素注意力网络的电力设备目标检测模型[J].浙江电力,2024,43(4):121-128,8.基金项目
云南电网有限责任公司科技项目(YNKJXM20210149) (YNKJXM20210149)