电子学报2023,Vol.51Issue(10):2925-2935,11.DOI:10.12263/DZXB.20221190
基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法
Cross-Modal Pedestrian Re-identification Pre-training Method Based on Catastrophic Forgetting and Combination Superimposed Erasure
摘要
Abstract
To meet the need of building a 24-hour full-time video surveillance system,cross-modal pedestrian recog-nition based on visible light and near-infrared is widely concerned by industry and academia.However,most of the current cross-modal pedestrian recognition tasks attempt to use pre-trained models on ImageNet to learn modal commonalities in ad-vance,but there are large modal differences between ImageNet and cross-modal pedestrian data,in the pre-training process,the color information is taken as one of the distinguishing features,which leads to the common features learned in the pre-training is not suitable for the information representation of the colorless infrared image.This paper proposes a self-super-vised cross-modal pedestrian recognition pre-training method based on catastrophic forgetting and combined superposition erasure.Firstly,the pre-training data are filtered by using the proposed catastrophic forgetting score,the aim is to reduce the domain gap between the pre-training data and the follow-up task data,and further reduce the training time of the model.Secondly,aiming at the key distinguishing feature extraction in traditional cross-modal identification,this paper designs a strong channel data enhancement strategy by erasing and combining the R,G and B channels at the channel level,multi-type samples with different colors are generated,which makes the model focus on texture information instead of color infor-mation.Finally,a self-supervised learning framework for cross-modal tasks is constructed based on the cross-modal data filtering index and channel enhancement strategy.The experimental results show that the ResNet50 network trained by the proposed pre-training method is superior to the current self-supervised pre-training methods when migrating to a large num-ber of cross-modal pedestrian recognition methods,on the basis of AGW,Rank1 and mAP were increased by 8.02%and 5.81%respectively.关键词
自监督/行人重识别/跨模态/预训练/灾难性遗忘/组合叠加擦除Key words
self-supervised/pedestrian re-identification/cross-modality/pre-training/catastrophic forgetting/strong channel combination分类
信息技术与安全科学引用本文复制引用
孙锐,谢瑞瑞,张磊,张旭东,高隽..基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法[J].电子学报,2023,51(10):2925-2935,11.基金项目
国家自然科学基金面上项目(No.61876057) (No.61876057)
安徽省自然科学基金(No.2208085MF158) (No.2208085MF158)
安徽省重点研发计划-科技强警专项项目(No.202004d07020012)National Natural Science Foundation of China(No.61876057) (No.202004d07020012)
National Natural Science Foun-dation of Anhui(No.2208085MF158) (No.2208085MF158)
Key Research and Development Plan of Anhui Province(No.202004d07020012) (No.202004d07020012)