| 注册
首页|期刊导航|电子学报|基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法

基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法

孙锐 谢瑞瑞 张磊 张旭东 高隽

电子学报2023,Vol.51Issue(10):2925-2935,11.
电子学报2023,Vol.51Issue(10):2925-2935,11.DOI:10.12263/DZXB.20221190

基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法

Cross-Modal Pedestrian Re-identification Pre-training Method Based on Catastrophic Forgetting and Combination Superimposed Erasure

孙锐 1谢瑞瑞 1张磊 1张旭东 1高隽1

作者信息

  • 1. 合肥工业大学计算机与信息学院,安徽合肥 230601||工业安全与应急技术安徽省重点实验室,安徽合肥 230009
  • 折叠

摘要

Abstract

To meet the need of building a 24-hour full-time video surveillance system,cross-modal pedestrian recog-nition based on visible light and near-infrared is widely concerned by industry and academia.However,most of the current cross-modal pedestrian recognition tasks attempt to use pre-trained models on ImageNet to learn modal commonalities in ad-vance,but there are large modal differences between ImageNet and cross-modal pedestrian data,in the pre-training process,the color information is taken as one of the distinguishing features,which leads to the common features learned in the pre-training is not suitable for the information representation of the colorless infrared image.This paper proposes a self-super-vised cross-modal pedestrian recognition pre-training method based on catastrophic forgetting and combined superposition erasure.Firstly,the pre-training data are filtered by using the proposed catastrophic forgetting score,the aim is to reduce the domain gap between the pre-training data and the follow-up task data,and further reduce the training time of the model.Secondly,aiming at the key distinguishing feature extraction in traditional cross-modal identification,this paper designs a strong channel data enhancement strategy by erasing and combining the R,G and B channels at the channel level,multi-type samples with different colors are generated,which makes the model focus on texture information instead of color infor-mation.Finally,a self-supervised learning framework for cross-modal tasks is constructed based on the cross-modal data filtering index and channel enhancement strategy.The experimental results show that the ResNet50 network trained by the proposed pre-training method is superior to the current self-supervised pre-training methods when migrating to a large num-ber of cross-modal pedestrian recognition methods,on the basis of AGW,Rank1 and mAP were increased by 8.02%and 5.81%respectively.

关键词

自监督/行人重识别/跨模态/预训练/灾难性遗忘/组合叠加擦除

Key words

self-supervised/pedestrian re-identification/cross-modality/pre-training/catastrophic forgetting/strong channel combination

分类

信息技术与安全科学

引用本文复制引用

孙锐,谢瑞瑞,张磊,张旭东,高隽..基于灾难性遗忘及组合叠加擦除的跨模态行人重识别预训练方法[J].电子学报,2023,51(10):2925-2935,11.

基金项目

国家自然科学基金面上项目(No.61876057) (No.61876057)

安徽省自然科学基金(No.2208085MF158) (No.2208085MF158)

安徽省重点研发计划-科技强警专项项目(No.202004d07020012)National Natural Science Foundation of China(No.61876057) (No.202004d07020012)

National Natural Science Foun-dation of Anhui(No.2208085MF158) (No.2208085MF158)

Key Research and Development Plan of Anhui Province(No.202004d07020012) (No.202004d07020012)

电子学报

OA北大核心CSCDCSTPCD

0372-2112

访问量0
|
下载量0
段落导航相关论文