Commonality Feature Representation Learning for Unsupervised Multimodal Change Detection

被引:0
|
作者
Liu, Tongfei [1 ]
Zhang, Mingyang [2 ]
Gong, Maoguo [2 ]
Zhang, Qingfu [3 ]
Jiang, Fenlong [2 ]
Zheng, Hanhong [2 ]
Lu, Di [2 ]
机构
[1] Shaanxi Univ Sci & Technol, Shaanxi Joint Lab Artificial Intelligence, Xian 710021, Peoples R China
[2] Xidian Univ, Key Lab Collaborat Intelligent Syst, Minist Educ, Xian 710071, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Image reconstruction; Training; Data mining; Autoencoders; Representation learning; Image sensors; Electronic mail; Decoding; Clustering algorithms; Multimodal change detection; unsupervised change detection; heterogeneous images; representation learning; commonality feature; REMOTE-SENSING IMAGES; HETEROGENEOUS IMAGES; NETWORK; GRAPH; REGRESSION;
D O I
10.1109/TIP.2025.3539461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main challenge of multimodal change detection (MCD) is that multimodal bitemporal images (MBIs) cannot be compared directly to identify changes. To overcome this problem, this paper proposes a novel commonality feature representation learning (CFRL) and constructs a CFRL-based unsupervised MCD framework. The CFRL is composed of a Siamese-based encoder and two decoders. First, the Siamese-based encoder can map original MBIs in the same feature space for extracting the representative features of each modality. Then, the two decoders are used to reconstruct the original MBIs by regressing themselves, respectively. Meanwhile, we swap the decoders to reconstruct the pseudo-MBIs to conduct modality alignment. Subsequently, all reconstructed images are input to the Siamese-based encoder again to map them in a same feature space, by which representative features are obtained. On this basis, latent commonality features between MBIs can be extracted by minimizing the distance between these representative features. These latent commonality features are comparable and can be used to identify changes. Notably, the proposed CFRL can be performed simultaneously in two modalities corresponding to MBIs. Therefore, two change magnitude images (CMIs) can be generated simultaneously by measuring the difference between the commonality features of MBIs. Finally, a simple threshold algorithm or a clustering algorithm can be employed to divide CMIs into binary change maps. Extensive experiments on six publicly available MCD datasets show that the proposed CFRL-based framework can achieve superior performance compared with other state-of-the-art approaches.
引用
收藏
页码:1219 / 1233
页数:15
相关论文
共 50 条
  • [1] Unsupervised Multimodal Change Detection Based on Structural Relationship Graph Representation Learning
    Chen, Hongruixuan
    Yokoya, Naoto
    Wu, Chen
    Du, Bo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] Multimodal pretraining for unsupervised protein representation learning
    Nguyen, Viet Thanh Duy
    Hy, Truong Son
    BIOLOGY METHODS & PROTOCOLS, 2024, 9 (01):
  • [3] Unsupervised Feature Recommendation using Representation Learning
    Datta, Anish
    Bandyopadhyay, Soma
    Sachan, Shruti
    Pal, Arpan
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1591 - 1595
  • [4] Instance Similarity Learning for Unsupervised Feature Representation
    Wang, Ziwei
    Wang, Yunsong
    Wu, Ziyi
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 10316 - 10325
  • [5] COMPACT FEATURE REPRESENTATION FOR UNSUPERVISED OOD DETECTION
    Saha, Sudipan
    Gawlikowski, Jakob
    Nandy, Jay
    Zhu, Xiao Xiang
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3143 - 3146
  • [6] Unsupervised Multimodal Feature Learning for Semantic Image Segmentation
    Pei, Deli
    Liu, Huaping
    Liu, Yulong
    Sun, Fuchun
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [7] Unsupervised deep learning for geometric feature detection and multilevel-multimodal image registration
    Lajili, Mohamed
    Belhachmi, Zakaria
    Moakher, Maher
    Theljani, Anis
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 7878 - 7896
  • [8] Style Transformation-Based SpatialSpectral Feature Learning for Unsupervised Change Detection
    Liu, Ganchao
    Yuan, Yuan
    Zhang, Yuelin
    Dong, Yongsheng
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [9] Unsupervised Video Representation Learning by Bidirectional Feature Prediction
    Behrmann, Nadine
    Gall, Juergen
    Noroozi, Mehdi
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1669 - 1678
  • [10] Coupled dictionary learning for unsupervised change detection between multimodal remote sensing images
    Ferraris, Vinicius
    Dobigeon, Nicolas
    Cavalcanti, Yanna
    Oberlin, Thomas
    Chabert, Marie
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2019, 189