首页|期刊导航|火力与指挥控制|基于细粒度图文对齐的多模态事件抽取方法

基于细粒度图文对齐的多模态事件抽取方法

曹健威孙英杰李凌寒曾维新胡艳丽

火力与指挥控制2025，Vol.50Issue(4)：135-140,149,7.

火力与指挥控制2025，Vol.50Issue(4)：135-140,149,7.DOI:10.3969/j.issn.1002-0640.2025.04.019

基于细粒度图文对齐的多模态事件抽取方法

Multimodal Event Extraction Method Based on Fine-grained Image-text Alignment

曹健威 ¹孙英杰 ¹李凌寒 ¹曾维新 ²胡艳丽¹

作者信息

1. 国防科技大学信息系统工程全国重点实验室,长沙 410073
2. 国防科技大学大数据与决策实验室,长沙 410073
折叠

摘要

Abstract

Multimodal event extraction aims to extract structured multimodal event information from image-text data,the core challenge of this task lies in bridging the gap between different modalities and establishing cross-modal associations.A multimodal event extraction method based on fine-grained image-text alignment is proposed,which consists of two stages:single modal information extraction and multimodal information fusion.First,textual event extraction and visual entity extraction models are employed to perform single modal information extraction,obtaining fine-grained event information from each modality.Subsequently,a multimodal pre-training model is used for fine-grained image-text alignment,to obtain multimodal event information.Experiments conducted on a multimodal event extraction dataset validate its effectiveness.

关键词

多模态事件抽取/图文对齐/多模态预训练模型/信息抽取/事件抽取

Key words

multimodal event extraction/image-text alignment/multimodal pre-trained model/information extraction/event extraction

分类

信息技术与安全科学

引用本文复制引用

曹健威,孙英杰,李凌寒,曾维新,胡艳丽..基于细粒度图文对齐的多模态事件抽取方法[J].火力与指挥控制,2025,50(4):135-140,149,7.

基金项目

国家自然科学基金资助项目(72471237) （72471237）

(72371245) （72371245）

火力与指挥控制

OA北大核心

ISSN：1002-0640

访问量4

下载量0

段落导航