Infrared and visible image fusion based on a two-stage class conditioned auto-encoder network

被引：7

作者：

Cao, Yanpeng ^{[1
,2
]}

Luo, Xing ^{[1
,2
]}

Tong, Xi ^{[1
,2
]}

Yang, Jiangxin ^{[1
,2
]}

Cao, Yanlong ^{[1
,2
]}

机构：

[1] Zhejiang Univ, Sch Mech Engn, State Key Lab Fluid Power Transmiss & Control, Hangzhou 310027, Peoples R China

[2] Zhejiang Univ, Sch Mech Engn, Key Lab Adv Mfg Technol Zhejiang Prov, Hangzhou 310027, Peoples R China

来源：

NEUROCOMPUTING | 2023年 / 544卷

基金：

中国国家自然科学基金;

关键词：

Infrared imaging; Image fusion; Conditional learning; Auto; -encoder; PERFORMANCE; ALGORITHM; NEST;

D O I：

10.1016/j.neucom.2023.126248

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing auto-encoder based infrared and visible image fusion methods typically utilize a shared encoder to extract features from different modalities and adopt a handcrafted fusion strategy to fuse the extracted features into intermediate representation before the decoder part. In this paper, we present a novel two -stage class conditioned auto-encoder framework for high-quality multispectral fusion tasks. In the first training stage, we introduce a class embedding sub-branch to the encoder network for modeling the characteristics of different modalities and adaptively scaling the intermediate features based on the input modality. Moreover, we design a cross-transfer residual block to promote the content and texture infor-mation flow in the encoder for generating more representative features. In the second training stage, we insert a learnable fusion module between the pre-trained class conditioned encoder and decoder parts to replace the handcrafted fusion strategy. Specific intensity and gradient loss functions are utilized to tune the model for the fusion of distinctive deep features in a data-driven manner. With the important designs including the class conditioned auto-encoder and the two-stage training strategy, our proposed TS-ClassFuse can better preserve distinctive information/features from the source images and decrease the training difficulty for simultaneously extracting informative features and determining the optimal fusion scheme. Experimental results verify the effectiveness of our method in terms of both qualitative and quantitative evaluations.(c) 2023 Elsevier B.V. All rights reserved.

引用

页数：13

共 50 条

[21] Interactive Image Segmentation Based on Fusion of Two-Stage Feature and Transformer Encoder
Feng, Jun
Zhang, Tian
Shi, Yichen
Wang, Hui
Hu, Jingjing
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2024, 36 (06): : 831 - 843
[22] Infrared-Visible Image Fusion Using Dual-Branch Auto-Encoder With Invertible High-Frequency Encoding
Liu, Honglin
Mao, Qirong
Dong, Ming
Zhan, Yongzhao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (03) : 2675 - 2688
[23] VISIBLE AND INFRARED IMAGE FUSION USING ENCODER-DECODER NETWORK
Ataman, Ferhat Can
Bozdagi Akar, Gozde
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1779 - 1783
[24] A METHOD FOR FACE FUSION BASED ON VARIATIONAL AUTO-ENCODER
Li, Xiang
Wen, Jin-Mei
Chen, An-Long
Chen, Bo
2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 77 - 80
[25] Image Retrieval System based on a Binary Auto-Encoder and a Convolutional Neural Network
Ferreyra-Ramirez, Andres
Rodriguez-Martinez, Eduardo
Aviles-Cruz, Carlos
Lopez-Saca, Fidel
IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (11) : 1925 - 1932
[26] TCPMFNet: An infrared and visible image fusion network with composite auto encoder and transformer-convolutional parallel mixed fusion strategy
Yi, Shi
Jiang, Gang
Liu, Xi
Li, Junjie
Chen, Ling
INFRARED PHYSICS & TECHNOLOGY, 2022, 127
[27] Deep neural network for halftone image classification based on sparse auto-encoder
Zhang, Yan
Zhang, Erhu
Chen, Wanjun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 50 : 245 - 255
[28] Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval
Rafiei, Mehdi
Iosifidis, Alexandros
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[29] A Big Network Traffic Data Fusion Approach Based on Fisher and Deep Auto-Encoder
Tao, Xiaoling
Kong, Deyan
Wei, Yi
Wang, Yong
INFORMATION, 2016, 7 (02)
[30] A dual-encoder network based on multi-layer feature fusion for infrared and visible image fusion
Huang, Shuying
Wu, Xueqiang
Yang, Yong
Wan, Weiguo
Wang, Xiaozheng
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (10) : 4511 - 4520

← 1 2 3 4 5 →