首页|期刊导航|计算机工程与应用|融合空间与通道重构卷积和注意力的轻量型动物姿态估计

融合空间与通道重构卷积和注意力的轻量型动物姿态估计

宰清鹏徐杨

计算机工程与应用2025，Vol.61Issue(6)：282-294,13.

计算机工程与应用2025，Vol.61Issue(6)：282-294,13.DOI:10.3778/j.issn.1002-8331.2311-0057

融合空间与通道重构卷积和注意力的轻量型动物姿态估计

Lightweight Animal Pose Estimation with Integrated Spatial and Channel Reconstructive Con-volutions and Attention

宰清鹏 ¹徐杨²

作者信息

1. 贵州大学大数据与信息工程学院,贵阳 550025
2. 贵州大学大数据与信息工程学院,贵阳 550025||贵阳铝镁设计研究院有限公司,贵阳 550009
折叠

摘要

Abstract

The importance of animal pose estimation in fields such as behavioral ecology,animal health monitoring,and wildlife conservation has been increasingly emphasized.However,current mainstream algorithms for animal pose estima-tion tend to prioritize accuracy,leading to a continuous increase in network complexity and computational cost,which limits their application on mobile devices and embedded platforms.In response to this issue,this paper proposes a multi-scale animal pose estimation network called SPANet,which combines spatial and channel-reconstructing convolutions with pyramid split attention.Firstly,the bottleneck layer EPSAneck of the high-resolution network is redesigned by incor-porating pyramid split attention and coordinate attention mechanisms.This redesign not only reduces the computational cost caused by excessive use of large convolutional kernels but also enhances the ability of network to extract useful features.Secondly,the SCCAblock foundational module is introduced,which is based on spatial and channel-reconstructing convolutions as well as coordinate attention mechanisms.This module significantly reduces computational redundancy and memory access while enhancing information exchange between channels and spatial dimensions.Lastly,the fusion method of network output features is re-designed using deconvolution modules to further improve the accuracy of the network.Experimental results demonstrate that compared to the high-resolution network,the proposed network model achieves an average precision improvement of 1.8 percentage points on the AP10K test set,while reducing the floating-point operations by 48.5%and the number of model parameters by 67.0%.On the AnimalPose dataset,the floating-point operations are reduced by 49.5%,and the number of model parameters is reduced by 67.0%.The experimental data indi-cate that the proposed network model achieves a small-range improvement in prediction accuracy while reducing the complexity of the model.

关键词

动物姿态估计/轻量型/高分辨率/注意力机制/空间与通道重构卷积

Key words

animal pose estimation/lightweight/high-resolution/attention mechanism/spatial and channel reconstruc-tion convolution

分类

信息技术与安全科学

引用本文复制引用

宰清鹏,徐杨..融合空间与通道重构卷积和注意力的轻量型动物姿态估计[J].计算机工程与应用,2025,61(6):282-294,13.

基金项目

贵州省科技计划项目(黔科合支撑[2023]一般326). （黔科合支撑[2023]一般326）

计算机工程与应用

OA北大核心

ISSN：1002-8331

访问量9

下载量0

段落导航