数据采集与处理2024,Vol.39Issue(5):1192-1203,12.DOI:10.16337/j.1004-9037.2024.05.011
基于多核扩展卷积的无监督视频行人重识别
Unsupervised Video Person Re-identification Based on Multiple Kernel Dilated Convolution
摘要
Abstract
Person re-identification aims to identify specific individuals across surveillance cameras,overcoming challenges such as pose variations,occlusions,and background noise that often lead to insufficient feature extraction.This paper proposes a novel unsupervised video-based person re-identification method that utilizes multi-kernel dilated convolution to provide a more comprehensive and accurate representation of individual differences and features.Initially,we employ a pre-trained ResNet50 as an encoder.To further enhance the encoder's feature extraction capability,we introduce a multiple kernel dilated convolution module.Enlarging the receptive field of convolutional kernels allows the network to more effectively capture both local and global feature information,offering a more comprehensive depiction of a person's appearance features.Subsequently,a decoder is employed to restore high-level semantic information to a more fundamental feature representation,thereby strengthening feature representation and improving system performance under complex imaging conditions.Finally,a multi-scale feature fusion module is introduced in the decoder output to merge features from adjacent layers,reducing semantic gaps between different feature channel layers and generating more robust feature representations.Offline experiments are conducted on three mainstream datasets,and results show that the proposed method achieves significant improvements in both accuracy and robustness.关键词
行人重识别/多核扩展卷积/无监督学习/特征提取/注意力机制Key words
person re-identification/multiple kernel dilated convolution/unsupervised learning/feature extraction/attention mechanism分类
信息技术与安全科学引用本文复制引用
刘仲民,张长凯,胡文瑾..基于多核扩展卷积的无监督视频行人重识别[J].数据采集与处理,2024,39(5):1192-1203,12.基金项目
国家自然科学基金(62061042) (62061042)
甘肃省工业过程先进控制重点实验室开放基金项目(2022KX10). (2022KX10)