Contrastive Tokens and Label Activation for Remote Sensing Weakly Supervised Semantic Segmentation

被引:2
|
作者
Hu, Zaiyi [1 ]
Gao, Junyu [1 ,2 ]
Yuan, Yuan [1 ]
Li, Xuelong [3 ]
机构
[1] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] China Telecom Corp Ltd, Inst Artificial Intelligence TeleAI, Beijing 100033, Peoples R China
关键词
Remote sensing; Semantic segmentation; Training; Task analysis; Semantics; Convolutional neural networks; Transformers; Deep learning; remote sensing images; vision transformer (ViT); weakly supervised semantic segmentation (WSSS);
D O I
10.1109/TGRS.2024.3385747
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, there has been remarkable progress in weakly supervised semantic segmentation (WSSS), with vision transformer (ViT) architectures emerging as a natural fit for such tasks due to their inherent ability to leverage global attention for comprehensive object information perception. However, directly applying ViT to WSSS tasks can introduce challenges. The characteristics of ViT can lead to an oversmoothing problem, particularly in dense scenes of remote sensing images, significantly compromising the effectiveness of class activation maps (CAMs) and posing challenges for segmentation. Moreover, existing methods often adopt multistage strategies, adding complexity and reducing training efficiency. To overcome these challenges, a comprehensive framework Contrastive Token and Foreground Activation (CTFA) based on the ViT architecture for WSSS of remote sensing images is presented. Our proposed method includes a contrastive token learning module (CTLM), incorporating both patch-wise and class-wise token learning to enhance model performance. In patch-wise learning, we leverage the semantic diversity preserved in intermediate layers of ViT and derive a relation matrix from these layers and employ it to supervise the final output tokens, thereby improving the quality of CAM. In class-wise learning, we ensure the consistency of representation between global and local tokens, revealing more entire object regions. Additionally, by activating foreground features in the generated pseudo label using a dual-branch decoder, we further promote the improvement of CAM generation. Our approach demonstrates outstanding results across three well-established datasets, providing a more efficient and streamlined solution for WSSS. Code will be available at: https://github.com/ZaiyiHu/CTFA.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Weakly Supervised Remote Sensing Image Semantic Segmentation With Pseudo-Label Noise Suppression
    Lu, Xiao
    Jiang, Zhiguo
    Zhang, Haopeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [2] WEAKLY SUPERVISED SEMANTIC SEGMENTATION FOR REMOTE SENSING HYPERSPECTRAL IMAGING
    Moliner, Eloi
    Romero, Luis Salgueiro
    Vilaplana, Veronica
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2273 - 2277
  • [3] Label Propagation and Contrastive Regularization for Semisupervised Semantic Segmentation of Remote Sensing Images
    Yang, Zhujun
    Yan, Zhiyuan
    Diao, Wenhui
    Zhang, Qiang
    Kang, Yuzhuo
    Li, Junxi
    Li, Xinming
    Sun, Xian
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] Semi-Supervised Semantic Segmentation of Remote Sensing Images With Iterative Contrastive Network
    Wang, Jia-Xin
    Chen, Si-Bao
    Ding, Chris H. Q.
    Tang, Jin
    Luo, Bin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [5] Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images
    Dong, Zhe
    Liu, Tianzhu
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images
    Dong, Zhe
    Liu, Tianzhu
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] Weakly Supervised Semantic Segmentation of Remote Sensing Images Using Siamese Affinity Network
    Chen, Zheng
    Lian, Yuheng
    Bai, Jing
    Zhang, Jingsen
    Xiao, Zhu
    Hou, Biao
    REMOTE SENSING, 2025, 17 (05)
  • [8] Hierarchical Weakly Supervised Learning for Residential Area Semantic Segmentation in Remote Sensing Images
    Zhang, Libao
    Ma, Jie
    Lv, Xiruan
    Chen, Donghui
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (01) : 117 - 121
  • [9] Remote Sensing Image Semantic Change Detection Boosted by Semi-Supervised Contrastive Learning of Semantic Segmentation
    Zhang, Xiuwei
    Yang, Yizhe
    Ran, Lingyan
    Chen, Liang
    Wang, Kangwei
    Yu, Lei
    Wang, Peng
    Zhang, Yanning
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13
  • [10] Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty
    Neven, Robby
    Neven, Davy
    De Brabandere, Bert
    Proesmans, Marc
    Goedeme, Toon
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1678 - 1686