| 注册
首页|期刊导航|计算机应用研究|多分支骨架特征输入下的歧义行为识别

多分支骨架特征输入下的歧义行为识别

王超亚 韩华 王春媛 田瑾

计算机应用研究2025,Vol.42Issue(11):3497-3505,9.
计算机应用研究2025,Vol.42Issue(11):3497-3505,9.DOI:10.19734/j.issn.1001-3695.2025.03.0056

多分支骨架特征输入下的歧义行为识别

Ambiguous action recognition under multi-input-branch skeleton feature

王超亚 1韩华 1王春媛 1田瑾1

作者信息

  • 1. 上海工程技术大学电子电气工程学院,上海 201620
  • 折叠

摘要

Abstract

Skeleton-based human action recognition is a critical research topic in computer vision,focusing on extracting and learning discriminative skeletal features to achieve high-precision action classification.However,the presence of ambiguous ac-tions severely impacts classification accuracy.To address these challenges,this paper introduced two key innovations based on data optimization and computational complexity reduction as well as spatio-temporal feature refinement.Firstly,it employed a multiple-input-branches architecture in the early stage of data processing to facilitate early feature fusion,enabling the model to learn complementary information across different modalities more effectively.This design enhanced computational efficiency while improving the model's generalization ability.Secondly,to improve the recognition of highly similar actions,it proposed an ambig-uous-feature-refinement module to extract distinctive spatio-temporal features.This mechanism enhanced the model's sensitivity to action details,thereby achieving more refined spatio-temporal feature modeling.Finally,it evaluated the proposed ambiguous action recognition under multi-input-branch skeleton feature model(GCN+)on two large-scale public datasets,D60 and D120,covering four single-modal settings as well as their fused modalities.Experimental results demonstrate that:in single-modal set-tings,GCN+outperforms the baseline models,achieving a 2.6 percentage points accuracy improvement under the X-Sub eva-luation protocol on the D120 dataset,indicating superior robustness in recognizing actions across different subjects in complex en-vironments.In fused-modal settings,GCN+achieves a 3.2 percentage points increase in X-Sub accuracy and a 3.0 percentage points increase in X-Set accuracy on the D120 dataset,further validating its applicability in large-scale data scenarios and its out-standing performance in cross-subject and cross-view action recognition tasks.Overall,the experimental results confirm that GCN+exhibits strong generalization capability,high computational efficiency,and exceptional performance in ambiguous highly simi-lar actions,providing an effective and robust solution for skeleton-based action recognition in complex environments.

关键词

骨架识别/图卷积神经网络/多分支输入/歧义行为/模糊特征细化

Key words

skeleton recognition/graph convolution network/multiple-input-branches/ambiguous action/ambiguous fea-ture refinement

分类

计算机与自动化

引用本文复制引用

王超亚,韩华,王春媛,田瑾..多分支骨架特征输入下的歧义行为识别[J].计算机应用研究,2025,42(11):3497-3505,9.

基金项目

国家自然科学基金项目(62103257) (62103257)

上海市自然科学基金项目(22ZR1426200) (22ZR1426200)

计算机应用研究

OA北大核心

1001-3695

访问量2
|
下载量0
段落导航相关论文