首页|期刊导航|液晶与显示|基于全局双组注意力的红外与可见光图像融合

基于全局双组注意力的红外与可见光图像融合

ZHAO Yang YANG Wengui GAO Cuiyun

液晶与显示2025，Vol.40Issue(12)：1840-1852,13.

液晶与显示2025，Vol.40Issue(12)：1840-1852,13.DOI:10.37188/CJLCD.2025-0195

基于全局双组注意力的红外与可见光图像融合

Infrared and visible image fusion based on global dual-group attention

ZHAO Yang ¹YANG Wengui ¹GAO Cuiyun¹

作者信息

1. School of Electronic and Information Engineering,Anhui Jianzhu University,Hefei 230601,China
折叠

摘要

Abstract

In complex scenarios,infrared and visible image fusion models often struggle to fully extract the characteristics of overall macro-structures(from infrared images)and local micro-details(from visible images),as well as to achieve synergy between these elements,which degrades fusion quality.To address this problem,this paper proposes a collaborative fusion principle based on scale specialization and designs a new fusion model based on an autoencoder architecture.The encoder and decoder of the model adopt a convolutional neural network(CNN)architecture.The model utilizes the global dual-group attention mechanism:after grouping feature maps by length and width to extract information,the generated inter-group channel attention map can achieve weighting of the feature maps,thereby generating new feature maps containing more large-scale global structural information.The model utilises a convolution mechanism with multi-scale pooling and dilation,using receptive fields of different sizes and implementing global average and median pooling operations,to extract small-scale local features in the image.The model utilizes a decoder to integrate the large-scale structure and small-scale details of densely connected layers and skip connections,enabling them to synergistically fuse and reconstruct the fused image.The experimental results demonstrate that,on the MSRS and TNO datasets,compared to the best results of other methods,the information entropy,mean gradient,and edge intensity were improved by 0.95%,6.28%,and 6.19%,and then by 1.75%,13.51%,and 11.75%respectively.Spatial frequency increased by 4.61%on the MSRS dataset,second only to the MDLSR-RFM method on the TNO dataset.These results validate the improvement in the quality of merged images in complex scenarios,as well as the increased robustness and generalization of the model.

关键词

红外与可见光图像融合/图像增强/全局双组注意力/空洞卷积

Key words

infrared and visible image fusion/image enhancement/global dual-group attention/dilated convolution

分类

信息技术与安全科学

引用本文复制引用

ZHAO Yang,YANG Wengui,GAO Cuiyun..基于全局双组注意力的红外与可见光图像融合[J].液晶与显示,2025,40(12):1840-1852,13.

基金项目

安徽省高校自然科学基金(No.KJ2019A0765) （No.KJ2019A0765）

安徽省古建筑智能感知与高维建模国际联合研究中心主任基金(No.GJZZX2024ZR02)Supported by Natural Science Foundation of Anhui Province Higher Education Institutions(No.KJ2019A0765) （No.GJZZX2024ZR02）

Anhui International Joint Research Center for Ancient Architecture Intellisencing and Multi-Dimensional Modeling(No.GJZZX2024ZR02) （No.GJZZX2024ZR02）

液晶与显示

OA北大核心

ISSN：1007-2780

访问量1

下载量0

段落导航