Infrared and visible image fusion based on a two-stage class conditioned auto-encoder network

被引:7
|
作者
Cao, Yanpeng [1 ,2 ]
Luo, Xing [1 ,2 ]
Tong, Xi [1 ,2 ]
Yang, Jiangxin [1 ,2 ]
Cao, Yanlong [1 ,2 ]
机构
[1] Zhejiang Univ, Sch Mech Engn, State Key Lab Fluid Power Transmiss & Control, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Sch Mech Engn, Key Lab Adv Mfg Technol Zhejiang Prov, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Infrared imaging; Image fusion; Conditional learning; Auto; -encoder; PERFORMANCE; ALGORITHM; NEST;
D O I
10.1016/j.neucom.2023.126248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing auto-encoder based infrared and visible image fusion methods typically utilize a shared encoder to extract features from different modalities and adopt a handcrafted fusion strategy to fuse the extracted features into intermediate representation before the decoder part. In this paper, we present a novel two -stage class conditioned auto-encoder framework for high-quality multispectral fusion tasks. In the first training stage, we introduce a class embedding sub-branch to the encoder network for modeling the characteristics of different modalities and adaptively scaling the intermediate features based on the input modality. Moreover, we design a cross-transfer residual block to promote the content and texture infor-mation flow in the encoder for generating more representative features. In the second training stage, we insert a learnable fusion module between the pre-trained class conditioned encoder and decoder parts to replace the handcrafted fusion strategy. Specific intensity and gradient loss functions are utilized to tune the model for the fusion of distinctive deep features in a data-driven manner. With the important designs including the class conditioned auto-encoder and the two-stage training strategy, our proposed TS-ClassFuse can better preserve distinctive information/features from the source images and decrease the training difficulty for simultaneously extracting informative features and determining the optimal fusion scheme. Experimental results verify the effectiveness of our method in terms of both qualitative and quantitative evaluations.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Interactive Image Segmentation Based on Fusion of Two-Stage Feature and Transformer Encoder
    Feng, Jun
    Zhang, Tian
    Shi, Yichen
    Wang, Hui
    Hu, Jingjing
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (06): : 831 - 843
  • [22] Infrared-Visible Image Fusion Using Dual-Branch Auto-Encoder With Invertible High-Frequency Encoding
    Liu, Honglin
    Mao, Qirong
    Dong, Ming
    Zhan, Yongzhao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2675 - 2688
  • [23] VISIBLE AND INFRARED IMAGE FUSION USING ENCODER-DECODER NETWORK
    Ataman, Ferhat Can
    Bozdagi Akar, Gozde
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1779 - 1783
  • [24] A METHOD FOR FACE FUSION BASED ON VARIATIONAL AUTO-ENCODER
    Li, Xiang
    Wen, Jin-Mei
    Chen, An-Long
    Chen, Bo
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 77 - 80
  • [25] Image Retrieval System based on a Binary Auto-Encoder and a Convolutional Neural Network
    Ferreyra-Ramirez, Andres
    Rodriguez-Martinez, Eduardo
    Aviles-Cruz, Carlos
    Lopez-Saca, Fidel
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (11) : 1925 - 1932
  • [26] TCPMFNet: An infrared and visible image fusion network with composite auto encoder and transformer-convolutional parallel mixed fusion strategy
    Yi, Shi
    Jiang, Gang
    Liu, Xi
    Li, Junjie
    Chen, Ling
    INFRARED PHYSICS & TECHNOLOGY, 2022, 127
  • [27] Deep neural network for halftone image classification based on sparse auto-encoder
    Zhang, Yan
    Zhang, Erhu
    Chen, Wanjun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 50 : 245 - 255
  • [28] Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval
    Rafiei, Mehdi
    Iosifidis, Alexandros
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [29] A Big Network Traffic Data Fusion Approach Based on Fisher and Deep Auto-Encoder
    Tao, Xiaoling
    Kong, Deyan
    Wei, Yi
    Wang, Yong
    INFORMATION, 2016, 7 (02)
  • [30] A dual-encoder network based on multi-layer feature fusion for infrared and visible image fusion
    Huang, Shuying
    Wu, Xueqiang
    Yang, Yong
    Wan, Weiguo
    Wang, Xiaozheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (10) : 4511 - 4520