GLMAFuse: A Dual-Stream Infrared and Visible Image Fusion Framework Integrating Local and Global Features with Multi-Scale Attention

被引:0
|
作者
Li, Fu [1 ,2 ,3 ]
Gu, Yanghai [4 ]
Zhao, Ming [1 ]
Chen, Deji [1 ,3 ]
Wang, Quan [1 ]
机构
[1] Wuxi Univ, Sch Internet Things Engn, Wuxi 214105, Peoples R China
[2] Wuxi Univ, Jiangsu Engn Res Ctr Hyperconvergence Applicat & S, Wuxi 214105, Peoples R China
[3] Tongji Univ, Minist Educ, Key Lab Embedded Syst & Serv Comp, Shanghai 201804, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci & Technol, Nanjing 210044, Peoples R China
来源
ELECTRONICS | 2024年 / 13卷 / 24期
关键词
image fusion; global and local feature; multi-scale; dual-stream; attention mechanism; NETWORK; NEST;
D O I
10.3390/electronics13245002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Integrating infrared and visible-light images facilitates a more comprehensive understanding of scenes by amalgamating dual-sensor data derived from identical environments. Traditional CNN-based fusion techniques are predominantly confined to local feature emphasis due to their inherently limited receptive fields. Conversely, Transformer-based models tend to prioritize global information, which can lead to a deficiency in feature diversity and detail retention. Furthermore, methods reliant on single-scale feature extraction are inadequate for capturing extensive scene information. To address these limitations, this study presents GLMAFuse, an innovative dual-stream encoder-decoder network, which utilizes a multi-scale attention mechanism to harmoniously integrate global and local features. This framework is designed to maximize the extraction of multi-scale features from source images while effectively synthesizing local and global information across all layers. We introduce the global-aware and local embedding (GALE) module to adeptly capture and merge global structural attributes and localized details from infrared and visible imagery via a parallel dual-branch architecture. Additionally, the multi-scale attention fusion (MSAF) module is engineered to optimize attention weights at the channel level, facilitating an enhanced synergy between high-frequency edge details and global backgrounds. This promotes effective interaction and fusion of dual-modal features. Extensive evaluations using standard datasets demonstrate that GLMAFuse surpasses the existing leading methods in both qualitative and quantitative assessments, highlighting its superior capability in infrared and visible image fusion. On the TNO and MSRS datasets, our method achieves outstanding performance across multiple metrics, including EN (7.15, 6.75), SD (46.72, 47.55), SF (12.79, 12.56), MI (2.21, 3.22), SCD (1.75, 1.80), VIF (0.79, 1.08), Qbaf (0.58, 0.71), and SSIM (0.99, 1.00). These results underscore its exceptional proficiency in infrared and visible image fusion.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] DeepFake detection method based on multi-scale interactive dual-stream network
    Cheng, Ziyuan
    Wang, Yiyang
    Wan, Yongjing
    Jiang, Cuiling
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104
  • [42] Multi-Scale Enhanced Dual-Stream Network for Facial Attribute Editing Localization
    Huang, Jinkun
    Luo, Weiqi
    Huang, Wenmin
    Xi, Ziyi
    Wei, Kangkang
    Huang, Jiwu
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 151 - 165
  • [43] Infrared and Visible Image Fusion using Multi-Scale Decomposition and Visual Saliency Map
    Chen, Yunfan
    Xie, Han
    Yeo, Donghoon
    Shin, Hyunchul
    2018 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2018, : 243 - 244
  • [44] An infrared and visible image fusion method based on multi-scale transformation and norm optimization
    Li, Guofa
    Lin, Yongjie
    Qu, Xingda
    INFORMATION FUSION, 2021, 71 : 109 - 129
  • [45] UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion
    Wang, Zhishe
    Wang, Junyao
    Wu, Yuanyuan
    Xu, Jiawei
    Zhang, Xiaoqin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3360 - 3374
  • [46] Infrared and Visible Image Fusion Using Multi-scale Decomposition and Partial Differential Equations
    Trivedi G.
    Sanghvi R.
    International Journal of Applied and Computational Mathematics, 2024, 10 (4)
  • [47] Infrared and visible image fusion enhancement technology based on multi-scale directional analysis
    Zhou Xin
    Liu Rui-an
    Chen Fin
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 4035 - 4037
  • [48] Hyperspectral unmixing algorithm based on channel multi-scale dual-stream autoencode
    Gan, Yuquan
    Wang, Yong
    Yi, Chen
    Wang, Quan
    Zhang, Ji
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025,
  • [49] Image Inpainting with EMMA Attention and Multi-scale Fusion
    Wei, Yun
    Wang, Lulu
    Wu, Kaijun
    Shan, Hongquan
    Tian, Bin
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2024, 51 (12): : 87 - 97
  • [50] Research on Image Segmentation Method Based on Multi-Scale Feature Fusion and Dual Attention
    Wang, Zhihong
    Wang, Chaoying
    Li, Jianxin
    Wu, Tianxiang
    Li, Jiajun
    Huang, Hongxing
    Jiang, Lai
    Journal of Computers (Taiwan), 2024, 35 (06) : 45 - 54