| 注册
首页|期刊导航|自动化学报|基于语义概念关联的参考多目标跟踪方法

基于语义概念关联的参考多目标跟踪方法

LIN Jia-Cheng CHEN Jia-Jun LI Zhi-Yong WANG Yao-Nan

自动化学报2025,Vol.51Issue(12):2664-2678,15.
自动化学报2025,Vol.51Issue(12):2664-2678,15.DOI:10.16383/j.aas.c250118

基于语义概念关联的参考多目标跟踪方法

Semantic Conceptual Association-Based Method for Referring Multi-Object Tracking

LIN Jia-Cheng 1CHEN Jia-Jun 2LI Zhi-Yong 3WANG Yao-Nan4

作者信息

  • 1. College of Computer Science and Electronic Engineering,Hu-nan University,Changsha 410082
  • 2. School of Artificial Intelli-gence and Robotics,Hunan University,Changsha 410012
  • 3. College of Computer Science and Electronic Engineering,Hu-nan University,Changsha 410082||School of Artificial Intelli-gence and Robotics,Hunan University,Changsha 410012
  • 4. School of Artificial Intelli-gence and Robotics,Hunan University,Changsha 410012||Na-tional Engineering Research Center for Robot Visual Perception and Control Technology,Hunan University,Changsha 410082
  • 折叠

摘要

Abstract

Referring multi-object tracking(RMOT)is a task that jointly leverages language and visual modalities for object localization and tracking,aiming to accurately identify and continuously track specific objects in video frames according to natural language prompts.Although existing RMOT methods have achieved notable progress,their modeling of the conceptual granularity of language expressions remains limited,leading to insufficient semant-ic parsing when handling complex descriptions.To address this issue,we propose a semantic concept association-based RMOT framework,termed SCATrack.The framework introduces two key modules,namely the sharing se-mantic concept(SSC)module and the semantic concept auxiliary generation(SCG)module,to enhance the model's capability of deeply understanding language expressions,thereby improving both the continuity and robustness of tracking.Specifically,the SSC module performs semantic concept partitioning over the language expressions,en-abling the model to effectively distinguish different expressions conveying the same semantics and similar expres-sions across distinct semantics,which strengthens object discrimination under multi-granularity input conditions.The SCG module adopts a feature masking and generation mechanism to guide the model in learning representa-tions of multi-granularity language concepts,thereby improving its robustness and discriminative ability in com-plex language scenarios.Experimental results on two widely used benchmark datasets demonstrate that the pro-posed SCATrack significantly improves tracking performance in RMOT tasks,validating its effectiveness and su-periority.

关键词

参考多目标跟踪/多模态融合/对比学习/目标跟踪/提示场景理解

Key words

Referring multi-object tracking/multi-modal fusion/contrastive learning/object tracking/prompt scene understanding

引用本文复制引用

LIN Jia-Cheng,CHEN Jia-Jun,LI Zhi-Yong,WANG Yao-Nan..基于语义概念关联的参考多目标跟踪方法[J].自动化学报,2025,51(12):2664-2678,15.

基金项目

国家自然科学基金(U23A20341),湖南省重大科技攻关计划(2025QK1005)资助 Supported by National Natural Science Foundation of China(U23A20341)and Major Scientific and Technological Research Plan of Hunan Province(2025QK1005) (U23A20341)

自动化学报

OA北大核心

0254-4156

访问量0
|
下载量0
段落导航相关论文