| 注册
首页|期刊导航|安全、健康和环境|一种基于交叉注意力机制的跨模态视频-文本检索模型

一种基于交叉注意力机制的跨模态视频-文本检索模型

王盛 宋向辉 胡世雄 梁营力 孙晓亮

安全、健康和环境2025,Vol.25Issue(3):20-26,7.
安全、健康和环境2025,Vol.25Issue(3):20-26,7.DOI:10.3969/j.issn.1672-7932.2025.03.003

一种基于交叉注意力机制的跨模态视频-文本检索模型

A Cross-modal Video-Text Retrieval Model Based on the Cross-attention Mechanism

王盛 1宋向辉 2胡世雄 3梁营力 4孙晓亮5

作者信息

  • 1. 信息工程大学,河南 郑州 450001
  • 2. 交通运输部公路科学研究院,北京 100088
  • 3. 黄河交通学院,河南 焦作 454950
  • 4. 河南省中工设计研究院集团有限公司,河南 郑州 450018
  • 5. 北京中交国通智能交通系统技术有限公司,北京 100082
  • 折叠

摘要

Abstract

In the task of safety planning of dangerous goods transportation,it is very important to accurately identify the cause of traffic accidents.The existing methods usually rely on the combination analysis of traffic ac-cident report,traffic surveillance video and other text data,but the accuracy and efficiency of cross-modal data retrieval are not high.Therefore,a cross-modal retrieval model based on cross-attention mechanism was pro-posed to improve the performance of cross-modal data retrieval in the process of dangerous goods transport acci-dent analysis.The model integrated text data such as traffic surveillance video and accident report,and used cross-attention mechanism to extract the corresponding relationship between video and text effectively,so as to improve the accuracy and efficiency of retrieval.The model architecture included data preprocessing,feature extraction,cross-attention mechanism,multi-modal feature fusion,fine similarity calculation and optimization loss function.The experimental results showed that the proposed model outperformed the best benchmark model(HiT)by 1.3%in the retrieval tasks on the dangerous goods transport dataset Recall@1 and Recall@5,which was significantly better than the existing cross-modal data retrieval methods such as CLIP.The ablation experi-ment further verified the key role of cross-attention mechanism in improving retrieval accuracy and efficiency.This study provided strong support for the safety planning and accident prevention of dangerous goods transporta-tion.

关键词

危险品运输/跨模态检索/交通监控/交叉注意力机制/事故分析/任务规划

Key words

dangerous goods transportation/cross-modal retrieval/traffic surveillance/cross-attention mechanism/accident analyze/route planning

分类

计算机与自动化

引用本文复制引用

王盛,宋向辉,胡世雄,梁营力,孙晓亮..一种基于交叉注意力机制的跨模态视频-文本检索模型[J].安全、健康和环境,2025,25(3):20-26,7.

基金项目

国家自然科学基金(面上项目)(62272480),黑灰产网络资产图谱可视分析关键技术研究. (面上项目)

安全、健康和环境

1672-7932

访问量0
|
下载量0
段落导航相关论文