首页|期刊导航|计算机工程与科学|结合混合注意力与多尺度特征的人体姿态估计

结合混合注意力与多尺度特征的人体姿态估计

谷学静栗燕茹杨蓝潇

计算机工程与科学2026，Vol.48Issue(3)：531-539,9.

计算机工程与科学2026，Vol.48Issue(3)：531-539,9.DOI:10.3969/j.issn.1007-130X.2026.03.015

结合混合注意力与多尺度特征的人体姿态估计

Human pose estimation combining mixed attention and multi-scale feature

谷学静 ¹栗燕茹 ²杨蓝潇²

作者信息

1. 华北理工大学电气工程学院,河北唐山 063210||唐山市数字媒体工程技术研究中心,河北唐山 063000
2. 唐山市数字媒体工程技术研究中心,河北唐山 063000||华北理工大学人工智能学院,河北唐山 063210
折叠

摘要

Abstract

To solve the problem of low accuracy of multi-person pose estimation in occlusion scenes,a human pose estimation model named DAW-YOLOPose,which combines mixed attention mechanism and multi-scale sequence feature is proposed.Firstly,the mixed local channel attention(MLCA)mech-anism is used to improve the backbone network of YOLOv8Pose,effectively capturing and transmitting spatial and channel information without increasing the number of model parameters,so as to improve the feature expression effect of the network.Secondly,a new multi-scale sequence feature fusion net-work is proposed to enhance the extraction ability of multi-scale feature information and integrate feature maps of different scales.Finally,the gradient gain allocation strategy of Wise-IoU v3 loss function is used to improve the ability to distinguish high-quality anchor frames and reduce the negative impact of low-quality samples on model training.The experimental results on MSCOCO dataset show that,com-pared with YOLOv8Pose,DAW-YOLOPose improves the mAP@0.5,mAP@0.5:0.95 and recall by 2.7 percentage points,1.4 percentage points and 1.9 percentage points respectively,achieving a better estimation effect.

关键词

YOLOPose/人体姿态估计/注意力机制/多尺度序列特征/损失函数

Key words

YOLOPose/human pose estimation/attention mechanism/multi-scale sequence feature/loss function

分类

信息技术与安全科学

引用本文复制引用

谷学静,栗燕茹,杨蓝潇..结合混合注意力与多尺度特征的人体姿态估计[J].计算机工程与科学,2026,48(3):531-539,9.

基金项目

唐山市科技创新团队培养计划(18130221A) （18130221A）

计算机工程与科学

ISSN：1007-130X

访问量0

下载量0

段落导航