| 注册
首页|期刊导航|计算机工程与应用|CNN-Transformer特征融合多目标跟踪算法

CNN-Transformer特征融合多目标跟踪算法

张英俊 白小辉 谢斌红

计算机工程与应用2024,Vol.60Issue(2):180-190,11.
计算机工程与应用2024,Vol.60Issue(2):180-190,11.DOI:10.3778/j.issn.1002-8331.2211-0028

CNN-Transformer特征融合多目标跟踪算法

Multi-Object Tracking Algorithm Based on CNN-Transformer Feature Fusion

张英俊 1白小辉 1谢斌红1

作者信息

  • 1. 太原科技大学 计算机科学与技术学院,太原 030024
  • 折叠

摘要

Abstract

In convolutional neural network(CNN),convolution can efficiently extract local features of the object,but it is difficult to capture global representation;in the visual Transformer,the attention mechanism can capture long-distance fea-ture dependency,but will ignore local feature details.To solve the above problems,a multi-object tracking algorithm CTMOT(CNN transformer multi-object tracking)based on CNN-Transformer hybrid network for feature extraction and fusion is proposed.Firstly,the backbone network is adopted based on CNN and Transformer to extract the local and global features of the image respectively.Secondly,two way bridge module(TBM)is used to fully integrate two features.Then,the fused features are input to two parallel decoders for processing.Finally,the detection box and the tracking box output-ted by the decoder are matched to obtain final tracking result and complete the multi-target tracking task.Evaluated on MOT17,MOT20,KITTI and UA-DETRAC multi-object tracking datasets,the MOTA indicators of CTMOT algorithm have reached 76.4%,66.3%,92.36%and 88.57%respectively.It is equivalent to the SOTA method on the MOT dataset,and achieves the SOTA effect on the KITTI dataset.At the same time,the MOTP and IDs indicators have reached the SOTA effect on all datasets.In addition,since the object detection and correlation are completed at the same time,the object tracking can be carried out end-to-end,and the tracking speed can reach 35 FPS,which shows that CTMOT algorithm achieves a good balance in the real-time and accuracy of tracking,and has great potential.

关键词

多目标跟踪/Transformer/特征融合

Key words

multi-object tracking/Transformer/feature fusion

分类

信息技术与安全科学

引用本文复制引用

张英俊,白小辉,谢斌红..CNN-Transformer特征融合多目标跟踪算法[J].计算机工程与应用,2024,60(2):180-190,11.

基金项目

山西省基础研究计划项目(20210302123216) (20210302123216)

吕梁市引进高层次科技人才重点研发项目(2022RC08). (2022RC08)

计算机工程与应用

OA北大核心CSTPCD

1002-8331

访问量0
|
下载量0
段落导航相关论文