Cross-Modal Contrastive Learning for Remote Sensing Image Classification

被引:15
|
作者
Feng, Zhixi [1 ]
Song, Liangliang [1 ]
Yang, Shuyuan [1 ]
Zhang, Xinyu [1 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Cross-modal contrastive learning (CMCL); multimodal remote sensing image (MRSI) classification; self-supervised; LIDAR DATA; FUSION;
D O I
10.1109/TGRS.2023.3296703
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, multimodal remote sensing image (MRSI) classification has attracted increasing attention from researchers. However, the classification of MRSI with limited labeled instances is still a challenging task. In this article, a novel self-supervised cross-modal contrastive learning (CMCL) method is proposed for MRSI classification. Joint intramodal contrastive learning (IMCL) and CMCL are used to better mine multimodal feature representations during pretraining, and the IMCL and CMCL objectives are jointly optimized, whereby it encourages the learned representation to be semantically consistent within and between modalities simultaneously. Moreover, a simple but effective hybrid cross-modal fusion module (HCFM) is designed in the fine-tuning stage, which could better compactly integrate complementary information across these modalities for more accurate classification. Extensive experiments are taken on four benchmark datasets (i.e., Houston 2013, Augsburg, Germany; Trento, Italy; and Berlin, Germany), and the results show that the proposed method outperforms state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Cross-Modal Compositional Learning for Multilabel Remote Sensing Image Classification
    Guo, Jie
    Jiao, Shuchang
    Sun, Hao
    Song, Bin
    Chi, Yuhao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 5810 - 5823
  • [2] Variance Consistency Learning: Enhancing Cross-Modal Knowledge Distillation for Remote Sensing Image Classification
    Song, Huaxiang
    Zhou, Yong
    Liu, Wanbo
    Zhao, Di
    Liu, Qun
    Liu, Jinling
    Annals of Emerging Technologies in Computing, 2024, 8 (04) : 56 - 76
  • [3] UNSUPERVISED CONTRASTIVE HASHING FOR CROSS-MODAL RETRIEVAL IN REMOTE SENSING
    Mikriukov, Georgii
    Ravanbakhsh, Mahdyar
    Demir, Begum
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4463 - 4467
  • [4] Cross-Modal Contrastive Learning With Spatiotemporal Context for Correlation-Aware Multiscale Remote Sensing Image Retrieval
    Zhu, Lilu
    Wang, Yang
    Hu, Yanfeng
    Su, Xiaolu
    Fu, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [5] A fusion-based contrastive learning model for cross-modal remote sensing retrieval
    Li, Haoran
    Xiong, Wei
    Cui, Yaqi
    Xiong, Zhenyu
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (09) : 3359 - 3386
  • [6] Mining Contrastive Relations Between Cross-Modal Features for Zero-Shot Remote Sensing Image Scene Classification
    Liu, Chun
    Ma, Suqiang
    Li, Zheng
    Yang, Wei
    Han, Zhigang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [7] Masking-Based Cross-Modal Remote Sensing Image-Text Retrieval via Dynamic Contrastive Learning
    Zhao, Zuopeng
    Miao, Xiaoran
    He, Chen
    Hu, Jianfeng
    Min, Bingbing
    Gao, Yumeng
    Liu, Ying
    Pharksuwan, Kanyaphakphachsorn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15
  • [8] A Cross-modal image retrieval method based on contrastive learning
    Zhou, Wen
    JOURNAL OF OPTICS-INDIA, 2024, 53 (03): : 2098 - 2107
  • [9] Cross-Modal Contrastive Learning for Text-to-Image Generation
    Zhang, Han
    Koh, Jing Yu
    Baldridge, Jason
    Lee, Honglak
    Yang, Yinfei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 833 - 842
  • [10] A Cross-modal image retrieval method based on contrastive learning
    Zhou, Wen
    JOURNAL OF OPTICS-INDIA, 2023, 53 (3): : 2098 - 2107