面向骨架手势识别的全局时空可变形网络OACSTPCD
Global Spatio-Temporal Deformable Network for Skeleton-Based Gesture Recognition
基于骨架序列进行手势识别关键在于如何融合时空信息提取可分辨性强的特征.该文提出关键点聚焦模块,通过全局上下文建模和不受限于固定形式的卷积方式,网络可以跨越多帧和不相关的关键点,在全局范围内自适应地聚合与手势动作密切相关的关键点信息,提取手势的时空特征.实验表明该方法在ChaLearn2013和SHREC数据集上得到的准确率可以达到94.88%和95.23%,优于现有方法.此外,该方法在处理噪声数据和动态手势方面稳定性更好.
The key of gesture recognition based on skeleton sequence is how to fuse spatio-temporal information and extract discriminate features.This paper proposes a key point focusing module.Through the global context modeling and the convolution method not limited to the fixed form,the network can span multiple frames and irrelevant key points,adaptively aggregate key point information closely related to gesture actions in the global scope,and extract the spatio-temporal characteristics of gesture.Experiments on Chalearn2013 and SHREC datasets show that the accuracy of our proposed method can reach 94.88% and 95.23%,and the method outperforms state-of-the-art methods.In addition,the method has better stability in dealing with noisy data and dynamic gestures.
石东子;林宏辉;刘一江;张鑫
华南理工大学电子与信息学院,广州 510640华南理工大学电子与信息学院,广州 510640||人工智能与数字经济广东省实验室,广州 510640
计算机与自动化
手势识别特征提取可变形卷积骨架序列全局信息
gesture recognitionfeatures extractiondeformable convolutionskeleton sequenceglobal information
《电子科技大学学报》 2024 (001)
60-66 / 7
中央高校基本科研业务费交叉学科研究项目(2022ZYGXZR104);广东省数字孪生人重点实验室项目(2022B1212010004)
评论