安全、健康和环境2025,Vol.25Issue(3):20-26,7.DOI:10.3969/j.issn.1672-7932.2025.03.003
一种基于交叉注意力机制的跨模态视频-文本检索模型
A Cross-modal Video-Text Retrieval Model Based on the Cross-attention Mechanism
摘要
Abstract
In the task of safety planning of dangerous goods transportation,it is very important to accurately identify the cause of traffic accidents.The existing methods usually rely on the combination analysis of traffic ac-cident report,traffic surveillance video and other text data,but the accuracy and efficiency of cross-modal data retrieval are not high.Therefore,a cross-modal retrieval model based on cross-attention mechanism was pro-posed to improve the performance of cross-modal data retrieval in the process of dangerous goods transport acci-dent analysis.The model integrated text data such as traffic surveillance video and accident report,and used cross-attention mechanism to extract the corresponding relationship between video and text effectively,so as to improve the accuracy and efficiency of retrieval.The model architecture included data preprocessing,feature extraction,cross-attention mechanism,multi-modal feature fusion,fine similarity calculation and optimization loss function.The experimental results showed that the proposed model outperformed the best benchmark model(HiT)by 1.3%in the retrieval tasks on the dangerous goods transport dataset Recall@1 and Recall@5,which was significantly better than the existing cross-modal data retrieval methods such as CLIP.The ablation experi-ment further verified the key role of cross-attention mechanism in improving retrieval accuracy and efficiency.This study provided strong support for the safety planning and accident prevention of dangerous goods transporta-tion.关键词
危险品运输/跨模态检索/交通监控/交叉注意力机制/事故分析/任务规划Key words
dangerous goods transportation/cross-modal retrieval/traffic surveillance/cross-attention mechanism/accident analyze/route planning分类
计算机与自动化引用本文复制引用
王盛,宋向辉,胡世雄,梁营力,孙晓亮..一种基于交叉注意力机制的跨模态视频-文本检索模型[J].安全、健康和环境,2025,25(3):20-26,7.基金项目
国家自然科学基金(面上项目)(62272480),黑灰产网络资产图谱可视分析关键技术研究. (面上项目)