Infrared and visible image fusion based on a two-stage class conditioned auto-encoder network

被引:7
|
作者
Cao, Yanpeng [1 ,2 ]
Luo, Xing [1 ,2 ]
Tong, Xi [1 ,2 ]
Yang, Jiangxin [1 ,2 ]
Cao, Yanlong [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Mech Engn, State Key Lab Fluid Power Transmiss & Control, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Sch Mech Engn, Key Lab Adv Mfg Technol Zhejiang Prov, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared imaging; Image fusion; Conditional learning; Auto; -encoder; PERFORMANCE; ALGORITHM; NEST;
D O I
10.1016/j.neucom.2023.126248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing auto-encoder based infrared and visible image fusion methods typically utilize a shared encoder to extract features from different modalities and adopt a handcrafted fusion strategy to fuse the extracted features into intermediate representation before the decoder part. In this paper, we present a novel two -stage class conditioned auto-encoder framework for high-quality multispectral fusion tasks. In the first training stage, we introduce a class embedding sub-branch to the encoder network for modeling the characteristics of different modalities and adaptively scaling the intermediate features based on the input modality. Moreover, we design a cross-transfer residual block to promote the content and texture infor-mation flow in the encoder for generating more representative features. In the second training stage, we insert a learnable fusion module between the pre-trained class conditioned encoder and decoder parts to replace the handcrafted fusion strategy. Specific intensity and gradient loss functions are utilized to tune the model for the fusion of distinctive deep features in a data-driven manner. With the important designs including the class conditioned auto-encoder and the two-stage training strategy, our proposed TS-ClassFuse can better preserve distinctive information/features from the source images and decrease the training difficulty for simultaneously extracting informative features and determining the optimal fusion scheme. Experimental results verify the effectiveness of our method in terms of both qualitative and quantitative evaluations.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Blurred Image Region Detection based on Stacked Auto-Encoder
    Zhou, Yuan
    Yang, Jianxing
    Chen, Yang
    Kung, Sun-Yuan
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2959 - 2964
  • [42] CAMF: An Interpretable Infrared and Visible Image Fusion Network Based on Class Activation Mapping
    Tang, Linfeng
    Chen, Ziang
    Huang, Jun
    Ma, Jiayi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4776 - 4791
  • [43] A Multi-Stage Visible and Infrared Image Fusion Network Based on Attention Mechanism
    Zheng, Xin
    Yang, Qiyong
    Si, Pengbo
    Wu, Qiang
    SENSORS, 2022, 22 (10)
  • [44] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Nahed Tawfik
    Heba A. Elnemr
    Mahmoud Fakhr
    Moawad I. Dessouky
    Fathi E. Abd El-Samie
    Journal of Digital Imaging, 2022, 35 : 1308 - 1325
  • [45] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Tawfik, Nahed
    Elnemr, Heba A.
    Fakhr, Mahmoud
    Dessouky, Moawad I.
    Abd El-Samie, Fathi E.
    JOURNAL OF DIGITAL IMAGING, 2022, 35 (05) : 1308 - 1325
  • [46] Normalized auto-encoder based on biased walk for network representation
    Sun, Cheng'ai
    Zhang, Sha
    Qiu, Liqing
    Jing, Caixia
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
  • [47] An Air Pollutant Prediction Model Based on Auto-Encoder Network
    Qin D.
    Ding Z.
    Jin Y.
    Zhao Q.
    Tongji Daxue Xuebao/Journal of Tongji University, 2019, 47 (05): : 681 - 687
  • [48] Deep Convolutional Auto-Encoder based Indoor Visible Light Positioning Using RSS Temporal Image
    Wang, Zhan
    Zhang, Xun
    Wang, Wenxiao
    Shi, Lina
    Huang, Chuanxi
    Wang, Jintao
    Zhang, Yue
    2019 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2019,
  • [49] Network Communication Protocol Reverse Engineering Based on Auto-Encoder
    Yu, Tianxiang
    Xin, Yang
    Tao, Yuexin
    Hou, Bingqing
    Zhu, Hongliang
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [50] Convolutional auto-encoder based multiple description coding network
    Meng, Lili
    Li, Hongfei
    Zhang, Jia
    Tan, Yanyan
    Ren, Yuwei
    Zhang, Huaxiang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (04): : 1689 - 1703