首页|期刊导航|电子学报|基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法

基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法

孙锐谢瑞瑞张磊张旭东高隽

电子学报2023，Vol.51Issue(10)：2925-2935,11.

电子学报2023，Vol.51Issue(10)：2925-2935,11.DOI:10.12263/DZXB.20221190

基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法

Cross-Modal Pedestrian Re-identification Pre-training Method Based on Catastrophic Forgetting and Combination Superimposed Erasure

孙锐 ¹谢瑞瑞 ¹张磊 ¹张旭东 ¹高隽¹

作者信息

1. 合肥工业大学计算机与信息学院,安徽合肥 230601||工业安全与应急技术安徽省重点实验室,安徽合肥 230009
折叠

摘要

Abstract

To meet the need of building a 24-hour full-time video surveillance system,cross-modal pedestrian recog-nition based on visible light and near-infrared is widely concerned by industry and academia.However,most of the current cross-modal pedestrian recognition tasks attempt to use pre-trained models on ImageNet to learn modal commonalities in ad-vance,but there are large modal differences between ImageNet and cross-modal pedestrian data,in the pre-training process,the color information is taken as one of the distinguishing features,which leads to the common features learned in the pre-training is not suitable for the information representation of the colorless infrared image.This paper proposes a self-super-vised cross-modal pedestrian recognition pre-training method based on catastrophic forgetting and combined superposition erasure.Firstly,the pre-training data are filtered by using the proposed catastrophic forgetting score,the aim is to reduce the domain gap between the pre-training data and the follow-up task data,and further reduce the training time of the model.Secondly,aiming at the key distinguishing feature extraction in traditional cross-modal identification,this paper designs a strong channel data enhancement strategy by erasing and combining the R,G and B channels at the channel level,multi-type samples with different colors are generated,which makes the model focus on texture information instead of color infor-mation.Finally,a self-supervised learning framework for cross-modal tasks is constructed based on the cross-modal data filtering index and channel enhancement strategy.The experimental results show that the ResNet50 network trained by the proposed pre-training method is superior to the current self-supervised pre-training methods when migrating to a large num-ber of cross-modal pedestrian recognition methods,on the basis of AGW,Rank1 and mAP were increased by 8.02％and 5.81％respectively.

关键词

自监督/行人重识别/跨模态/预训练/灾难性遗忘/组合叠加擦除

Key words

self-supervised/pedestrian re-identification/cross-modality/pre-training/catastrophic forgetting/strong channel combination

分类

信息技术与安全科学

引用本文复制引用

孙锐,谢瑞瑞,张磊,张旭东,高隽..基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法[J].电子学报,2023,51(10):2925-2935,11.

基金项目

国家自然科学基金面上项目(No.61876057) （No.61876057）

安徽省自然科学基金(No.2208085MF158) （No.2208085MF158）

安徽省重点研发计划-科技强警专项项目(No.202004d07020012)National Natural Science Foundation of China(No.61876057) （No.202004d07020012）

National Natural Science Foun-dation of Anhui(No.2208085MF158) （No.2208085MF158）

Key Research and Development Plan of Anhui Province(No.202004d07020012) （No.202004d07020012）

电子学报

OA北大核心CSCDCSTPCD

ISSN：0372-2112

访问量0

下载量0

段落导航