An infrared and visible image fusion network based on multi-scale feature cascades and non-local attention

被引:1
|
作者
Xu, Jing [1 ,3 ]
Liu, Zhenjin [1 ,3 ]
Fang, Ming [2 ,3 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun, Peoples R China
[2] Changchun Univ Sci & Technol, Sch Artificial Intelligence, Changchun, Peoples R China
[3] Changchun Univ Sci & Technol, Zhongshan Inst, Machine Vis & Unmanned Syst Lab, Zhongshan, Peoples R China
关键词
convolutional neural nets; feature extraction; image fusion; image reconstruction; QUALITY ASSESSMENT; NEST;
D O I
10.1049/ipr2.13088
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, research on infrared and visible image fusion has mainly focused on deep learning-based approaches, particularly deep neural networks with auto-encoder architectures. However, these approaches suffer from problems such as insufficient feature extraction capability and inefficient fusion strategies. Therefore, this paper introduces a novel image fusion network to address the limitations of infrared and visible image fusion networks with auto-encoder architectures. In the designed network, the encoder employs a multi-branch cascade structure, and these convolution branches with different kernel sizes provide the encoder with an adaptive receptive field to extract multi-scale features. In addition, the fusion layer incorporates a non-local attention module that is inspired by the self-attention mechanism. With its global receptive field, this module is used to build a non-local attention fusion network, which works together with the l1${l}_1$-norm spatial fusion strategy to extract, split, filter, and fuse global and local features. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. This paper introduces a novel infrared and visible image fusion network to address the limitations of auto-encoder fusion networks. In the designed network, the encoder employs a multi-branch cascade structure with convolution kernels of different sizes to extract multi-scale features, and the fusion layer incorporates a non-local attention module alongside a spatial feature fusion strategy for both global and local feature fusion. Comparative experiments on the TNO and MSRS datasets demonstrate that the proposed method outperforms other state-of-the-art fusion approaches. image
引用
收藏
页码:2114 / 2125
页数:12
相关论文
共 50 条
  • [41] UNFusion: A Unified Multi-Scale Densely Connected Network for Infrared and Visible Image Fusion
    Wang, Zhishe
    Wang, Junyao
    Wu, Yuanyuan
    Xu, Jiawei
    Zhang, Xiaoqin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (06) : 3360 - 3374
  • [42] MFANet: Multi-scale feature fusion network with attention mechanism
    Wang, Gaihua
    Gan, Xin
    Cao, Qingcheng
    Zhai, Qianyu
    VISUAL COMPUTER, 2023, 39 (07): : 2969 - 2980
  • [43] Dual-Attention-Based Feature Aggregation Network for Infrared and Visible Image Fusion
    Tang, Zhimin
    Xiao, Guobao
    Guo, Junwen
    Wang, Shiping
    Ma, Jiayi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [44] MFANet: Multi-scale feature fusion network with attention mechanism
    Gaihua Wang
    Xin Gan
    Qingcheng Cao
    Qianyu Zhai
    The Visual Computer, 2023, 39 : 2969 - 2980
  • [45] MSAt-GAN: a generative adversarial network based on multi-scale and deep attention mechanism for infrared and visible light image fusion
    Li, Junwu
    Li, Binhua
    Jiang, Yaoxi
    Cai, Weiwei
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (06) : 4753 - 4781
  • [46] MSAt-GAN: a generative adversarial network based on multi-scale and deep attention mechanism for infrared and visible light image fusion
    Junwu Li
    Binhua Li
    Yaoxi Jiang
    Weiwei Cai
    Complex & Intelligent Systems, 2022, 8 : 4753 - 4781
  • [47] Image fusion method based on multi-scale non-local mean filter and shear direction filter
    Wang F.
    Cheng Y.-M.
    Kongzhi yu Juece/Control and Decision, 2017, 32 (12): : 2183 - 2189
  • [48] A Novel Multi-scale Feature Fusion Based Network for Hyperspectral and Multispectral Image Fusion
    Dong, Shuai
    Huang, Shaoguang
    Zhang, Jinhan
    Zhang, Hongyan
    PATTERN RECOGNITION AND COMPUTER VISION, PT XIII, PRCV 2024, 2025, 15043 : 530 - 544
  • [49] Infrared and Visible Image Fusion Based on Multi-scale Network with Dual-channel Information Cross Fusion Block
    Yang, Yong
    Kong, Xiangkai
    Huang, Shuying
    Wan, Weiguo
    Liu, Jiaxiang
    Zhang, Wang
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [50] A Non-Local Attention Feature Fusion Network for Multiscale Object Detection
    Wu, Xuke
    Xiong, Gang
    Tian, Bin
    Song, Bing
    Lu, Bo
    Liu, Sheng
    Zhu, Fenghua
    IEEE JOURNAL OF RADIO FREQUENCY IDENTIFICATION, 2022, 6 : 733 - 738