GLMAFuse: A Dual-Stream Infrared and Visible Image Fusion Framework Integrating Local and Global Features with Multi-Scale Attention

被引：0

作者：

Li, Fu ^{[1
,2
,3
]}

Gu, Yanghai ^{[4
]}

Zhao, Ming ^{[1
]}

Chen, Deji ^{[1
,3
]}

Wang, Quan ^{[1
]}

机构：

[1] Wuxi Univ, Sch Internet Things Engn, Wuxi 214105, Peoples R China

[2] Wuxi Univ, Jiangsu Engn Res Ctr Hyperconvergence Applicat & S, Wuxi 214105, Peoples R China

[3] Tongji Univ, Minist Educ, Key Lab Embedded Syst & Serv Comp, Shanghai 201804, Peoples R China

[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci & Technol, Nanjing 210044, Peoples R China

来源：

ELECTRONICS | 2024年 / 13卷 / 24期

关键词：

image fusion; global and local feature; multi-scale; dual-stream; attention mechanism; NETWORK; NEST;

D O I：

10.3390/electronics13245002

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Integrating infrared and visible-light images facilitates a more comprehensive understanding of scenes by amalgamating dual-sensor data derived from identical environments. Traditional CNN-based fusion techniques are predominantly confined to local feature emphasis due to their inherently limited receptive fields. Conversely, Transformer-based models tend to prioritize global information, which can lead to a deficiency in feature diversity and detail retention. Furthermore, methods reliant on single-scale feature extraction are inadequate for capturing extensive scene information. To address these limitations, this study presents GLMAFuse, an innovative dual-stream encoder-decoder network, which utilizes a multi-scale attention mechanism to harmoniously integrate global and local features. This framework is designed to maximize the extraction of multi-scale features from source images while effectively synthesizing local and global information across all layers. We introduce the global-aware and local embedding (GALE) module to adeptly capture and merge global structural attributes and localized details from infrared and visible imagery via a parallel dual-branch architecture. Additionally, the multi-scale attention fusion (MSAF) module is engineered to optimize attention weights at the channel level, facilitating an enhanced synergy between high-frequency edge details and global backgrounds. This promotes effective interaction and fusion of dual-modal features. Extensive evaluations using standard datasets demonstrate that GLMAFuse surpasses the existing leading methods in both qualitative and quantitative assessments, highlighting its superior capability in infrared and visible image fusion. On the TNO and MSRS datasets, our method achieves outstanding performance across multiple metrics, including EN (7.15, 6.75), SD (46.72, 47.55), SF (12.79, 12.56), MI (2.21, 3.22), SCD (1.75, 1.80), VIF (0.79, 1.08), Qbaf (0.58, 0.71), and SSIM (0.99, 1.00). These results underscore its exceptional proficiency in infrared and visible image fusion.

引用

页数：30

共 50 条

[41] DeepFake detection method based on multi-scale interactive dual-stream network
Cheng, Ziyuan
Wang, Yiyang
Wan, Yongjing
Jiang, Cuiling
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
[42] Multi-Scale Enhanced Dual-Stream Network for Facial Attribute Editing Localization
Huang, Jinkun
Luo, Weiqi
Huang, Wenmin
Xi, Ziyi
Wei, Kangkang
Huang, Jiwu
DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 151 - 165
[43] Infrared and Visible Image Fusion using Multi-Scale Decomposition and Visual Saliency Map
Chen, Yunfan
Xie, Han
Yeo, Donghoon
Shin, Hyunchul
2018 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2018, : 243 - 244
[44] An infrared and visible image fusion method based on multi-scale transformation and norm optimization
Li, Guofa
Lin, Yongjie
Qu, Xingda
INFORMATION FUSION, 2021, 71 : 109 - 129
[45] UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion
Wang, Zhishe
Wang, Junyao
Wu, Yuanyuan
Xu, Jiawei
Zhang, Xiaoqin
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3360 - 3374
[46] Infrared and Visible Image Fusion Using Multi-scale Decomposition and Partial Differential Equations
Trivedi G.
Sanghvi R.
International Journal of Applied and Computational Mathematics, 2024, 10 (4)
[47] Infrared and visible image fusion enhancement technology based on multi-scale directional analysis
Zhou Xin
Liu Rui-an
Chen Fin
PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4035 - 4037
[48] Hyperspectral unmixing algorithm based on channel multi-scale dual-stream autoencode
Gan, Yuquan
Wang, Yong
Yi, Chen
Wang, Quan
Zhang, Ji
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025,
[49] Image Inpainting with EMMA Attention and Multi-scale Fusion
Wei, Yun
Wang, Lulu
Wu, Kaijun
Shan, Hongquan
Tian, Bin
Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2024, 51 (12): : 87 - 97
[50] Research on Image Segmentation Method Based on Multi-Scale Feature Fusion and Dual Attention
Wang, Zhihong
Wang, Chaoying
Li, Jianxin
Wu, Tianxiang
Li, Jiajun
Huang, Hongxing
Jiang, Lai
Journal of Computers (Taiwan), 2024, 35 (06) : 45 - 54

← 1 2 3 4 5 →