| 注册
首页|期刊导航|广东工业大学学报|双模态迭代交叉注意力融合集成框架

双模态迭代交叉注意力融合集成框架

蔡志宏 曾安 潘丹 叶嘉宇

广东工业大学学报2026,Vol.43Issue(2):1-11,11.
广东工业大学学报2026,Vol.43Issue(2):1-11,11.DOI:10.12052/gdutxb.250002

双模态迭代交叉注意力融合集成框架

Bimodal Iterative Cross-attention Fusion Ensemble Framework

蔡志宏 1曾安 1潘丹 2叶嘉宇1

作者信息

  • 1. 广东工业大学 计算机学院,广东 广州 510006
  • 2. 广东技术师范大学 电子与信息学院,广东 广州 510665
  • 折叠

摘要

Abstract

Alzheimer's disease(AD),as a progressive neurodegenerative disorder,presents significant challenges in early diagnosis and clinical intervention.In medical imaging,structural magnetic resonance imaging(sMRI)captures brain atrophy and structural alterations through high-resolution anatomical imaging,while fluorodeoxyglucose positron emission tomography(FDG-PET)effectively reflects functional changes by monitoring cerebral glucose metabolism.These two modalities hold complementary value in detecting AD-related pathological brain changes.However,existing multimodal AD classification models are limited by suboptimal feature fusion,insufficient inter-modal information interaction,and feature distribution discrepancies,hindering their diagnostic utility.To address these issues,a bimodal iterative cross-attention fusion ensemble framework(BICAFEF)is proposed.This framework comprises base classifiers and a meta-classifier.The base classifiers employ ResNet modules to extract features from sMRI and FDG-PET image patches.A spatial feature shrinking(SFS)module,integrating convolutional operations and adaptive aggregation pooling,is designed to reduce inter-modal redundancy and emphasize discriminative features.Additionally,an iterative cross-attention mechanism is constructed to dynamically capture and reinforce global dependencies and complementary information across modalities through multi-round iterations,thereby resolving the challenge of insufficiently exploiting inter-modal synergies and enhancing AD classification performance.To further improve whole-brain classification accuracy,the framework incorporates a meta-classifier to screen and ensemble base classifiers by discarding those with accuracy below 75%,retaining high-performance classifiers to boost robustness and precision.Visualization analyses validate the framework's focus on critical brain regions,demonstrating its capability to effectively identify AD-related pathological areas in sMRI and PET modalities.Experimental results show that the framework achieves a five-fold classification accuracy(ACC)of 94.3%,sensitivity(SEN)of 92.6%,specificity(SPE)of 96.3%,AUC of 97.5%,and Matthews correlation coefficient(MCC)of 88.7%in AD vs.healthy control(HC)classification,outperforming state-of-the-art multimodal frameworks.

关键词

阿尔茨海默症/多模态融合/迭代学习/交叉注意力机制/分类

Key words

Alzheimer's disease(AD)/multimodal fusion/iterative learning/cross-attention mechanism/classification

分类

信息技术与安全科学

引用本文复制引用

蔡志宏,曾安,潘丹,叶嘉宇..双模态迭代交叉注意力融合集成框架[J].广东工业大学学报,2026,43(2):1-11,11.

基金项目

国家自然科学基金资助项目(61976058) (61976058)

广州市科技计划项目(202103000034,202206010007,202002020090) (202103000034,202206010007,202002020090)

广东省科技计划项目(2021A1515012300,2019A050510041,2021B0101220006) (2021A1515012300,2019A050510041,2021B0101220006)

广东工业大学学报

1007-7162

访问量1
|
下载量0
段落导航相关论文