| 注册
首页|期刊导航|智能系统学报|面向智能座舱的多源混合模态数据集及层次化融合分类方法

面向智能座舱的多源混合模态数据集及层次化融合分类方法

赵荣峰 卢宝莉 唐小江 胡敏 李卫军 宁欣

智能系统学报2026,Vol.21Issue(1):83-94,12.
智能系统学报2026,Vol.21Issue(1):83-94,12.DOI:10.11992/tis.202507024

面向智能座舱的多源混合模态数据集及层次化融合分类方法

Multi-source hybrid-modality dataset and hierarchical fusion classification method for intelligent cockpits

赵荣峰 1卢宝莉 2唐小江 2胡敏 3李卫军 4宁欣1

作者信息

  • 1. 中国科学院半导体研究所人工智能与高速电路实验室,北京 100083||中国科学院大学材料科学与光电技术学院,北京 100049
  • 2. 中国科学院半导体研究所人工智能与高速电路实验室,北京 100083
  • 3. 北京中科睿途科技有限公司,北京 100096
  • 4. 中国科学院半导体研究所人工智能与高速电路实验室,北京 100083||中国科学院大学集成电路学院,北京 100049
  • 折叠

摘要

Abstract

The scarcity of open-source data for intelligent cockpits in the driving domain is characterized by limited modality dimensions,insufficient annotations,and restricted scene diversity.To address these challenges,a multi-source hybrid-modality dataset has been constructed.This dataset incorporates RGB,depth,and infrared visual data,along with structured textual data detailing vehicle information and driving scenarios.A dual-layer annotation scheme is applied to capture ten behavior categories.Leveraging this dataset,a hierarchical multi-modal fusion framework is proposed to en-hance feature extraction via cross-modal information exchange and semantically guided fusion mechanisms.Experi-ments on video classification tasks reveal significant improvements in environmental understanding when combining RGB data with additional modalities.Using the full range of modalities leads to a 15.75%increase in accuracy com-pared to using only RGB data.These results validate the effectiveness of the multi-source hybrid-modality dataset in ad-vancing intelligent cockpit systems.

关键词

智能座舱/数据集/多模态融合/视觉多模态/行为分类/危险行为/行为识别/多源数据

Key words

intelligent cockpit/dataset/multimodal fusion/visual multimodality/behavior classification/dangerous be-havior/behavior recognition/multi-source data

分类

信息技术与安全科学

引用本文复制引用

赵荣峰,卢宝莉,唐小江,胡敏,李卫军,宁欣..面向智能座舱的多源混合模态数据集及层次化融合分类方法[J].智能系统学报,2026,21(1):83-94,12.

基金项目

北京市自然科学基金-小米创新联合基金(L233036). (L233036)

智能系统学报

1673-4785

访问量0
|
下载量0
段落导航相关论文