| 注册
首页|期刊导航|湖北民族大学学报(自然科学版)|基于时间序列分配和多频率分量压缩的人体行为识别

基于时间序列分配和多频率分量压缩的人体行为识别

佘本杰 朱彦敏 宋健 华健

湖北民族大学学报(自然科学版)2024,Vol.42Issue(3):368-374,7.
湖北民族大学学报(自然科学版)2024,Vol.42Issue(3):368-374,7.DOI:10.13501/j.cnki.42-1908/n.2024.06.025

基于时间序列分配和多频率分量压缩的人体行为识别

Human Action Recognition Based on Time Series Allocation and Multi-frequency Component Compression

佘本杰 1朱彦敏 2宋健 1华健1

作者信息

  • 1. 安徽理工大学 计算机科学与工程学院,安徽 淮南 232001
  • 2. 安徽理工大学 机械工程学院,安徽 淮南 232001
  • 折叠

摘要

Abstract

In order to address the issues of time-series feature extraction and feature compression in three dimensional convolutional neural network(3D-CNN)-based human action recognition,an improved network stronger 3D-CNN-based approach for skeleton-based action recognition(S-PoseConv3D)was proposed.The network effectively improved the performance of the original PoseConv3D by introducing the time series allocation strategy(TSAS)and the multi-frequency component feature compression fusion(M3F).TSAS dynamically adjusted the weights of different time series in the feature extraction process,and calculating the weights of the time series to enhance the model′s ability to capture the spatial-temporal features of human behaviors.Meanwhile,the M3F utilized the discrete cosine transform to compress and fuse the multi-frequency components in the feature map,thereby retaining the spatial and temporal features of human behaviors and reducing feature loss due to global pooling.The network was experimented on the dataset built on top of gymnastic videos(FineGYM)and Nanyang university of technology skeleton behavior recognition dataset 60 classification extended subset(NTU60-XSub),and the results showed that the improved network improved the mean top-1 accuracy by 3.93%on the FineGYM dataset compared to the original PoseConv3D.It also demonstrated significant performance improvement on the NTU60-XSub dataset,which indicated higher accuracy and robustness,and it was well-suited for application in the field of human behavior recognition.This network can be applied in such fields as security monitoring and human-computer interaction.

关键词

行为识别/三维卷积神经网络/时空特征融合/注意力机制/离散余弦变换

Key words

action recognition/3D-CNN/spatial-temporal feature fusion/attention mechanisms/discrete cosine transformations

分类

信息技术与安全科学

引用本文复制引用

佘本杰,朱彦敏,宋健,华健..基于时间序列分配和多频率分量压缩的人体行为识别[J].湖北民族大学学报(自然科学版),2024,42(3):368-374,7.

基金项目

国家自然科学基金项目(52374155) (52374155)

安徽理工大学研究生创新基金项目(2023cx2132). (2023cx2132)

湖北民族大学学报(自然科学版)

2096-7594

访问量0
|
下载量0
段落导航相关论文