Contrastive Tokens and Label Activation for Remote Sensing Weakly Supervised Semantic Segmentation

被引:2
|
作者
Hu, Zaiyi [1 ]
Gao, Junyu [1 ,2 ]
Yuan, Yuan [1 ]
Li, Xuelong [3 ]
机构
[1] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
[2] Shanghai Artificial Intelligence Lab, Shanghai 200232, Peoples R China
[3] China Telecom Corp Ltd, Inst Artificial Intelligence TeleAI, Beijing 100033, Peoples R China
关键词
Remote sensing; Semantic segmentation; Training; Task analysis; Semantics; Convolutional neural networks; Transformers; Deep learning; remote sensing images; vision transformer (ViT); weakly supervised semantic segmentation (WSSS);
D O I
10.1109/TGRS.2024.3385747
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, there has been remarkable progress in weakly supervised semantic segmentation (WSSS), with vision transformer (ViT) architectures emerging as a natural fit for such tasks due to their inherent ability to leverage global attention for comprehensive object information perception. However, directly applying ViT to WSSS tasks can introduce challenges. The characteristics of ViT can lead to an oversmoothing problem, particularly in dense scenes of remote sensing images, significantly compromising the effectiveness of class activation maps (CAMs) and posing challenges for segmentation. Moreover, existing methods often adopt multistage strategies, adding complexity and reducing training efficiency. To overcome these challenges, a comprehensive framework Contrastive Token and Foreground Activation (CTFA) based on the ViT architecture for WSSS of remote sensing images is presented. Our proposed method includes a contrastive token learning module (CTLM), incorporating both patch-wise and class-wise token learning to enhance model performance. In patch-wise learning, we leverage the semantic diversity preserved in intermediate layers of ViT and derive a relation matrix from these layers and employ it to supervise the final output tokens, thereby improving the quality of CAM. In class-wise learning, we ensure the consistency of representation between global and local tokens, revealing more entire object regions. Additionally, by activating foreground features in the generated pseudo label using a dual-branch decoder, we further promote the improvement of CAM generation. Our approach demonstrates outstanding results across three well-established datasets, providing a more efficient and streamlined solution for WSSS. Code will be available at: https://github.com/ZaiyiHu/CTFA.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [21] A multi-strategy contrastive learning framework for weakly supervised semantic segmentation
    Yuan, Kunhao
    Schaefer, Gerald
    Lai, Yu-Kun
    Wang, Yifan
    Liu, Xiyao
    Guan, Lin
    Fang, Hui
    PATTERN RECOGNITION, 2023, 137
  • [22] Contrastive and consistent feature learning for weakly supervised object localization and semantic segmentation
    Ki, Minsong
    Uh, Youngjung
    Lee, Wonyoung
    Byun, Hyeran
    NEUROCOMPUTING, 2021, 445 : 244 - 254
  • [23] Weakly Supervised Deep Learning for Segmentation of Remote Sensing Imagery
    Wang, Sherrie
    Chen, William
    Xie, Sang Michael
    Azzari, George
    Lobell, David B.
    REMOTE SENSING, 2020, 12 (02)
  • [24] Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation
    Qin, Jie
    Wu, Jie
    Xiao, Xuefeng
    Li, Lujun
    Wang, Xingang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2117 - 2125
  • [25] Class Activation Map Calibration for Weakly Supervised Semantic Segmentation
    Wang, Jian
    Dai, Tianhong
    Zhao, Xinqiao
    Garcia-Fernandez, Angel F.
    Lim, Eng Gee
    Xiao, Jimin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11668 - 11681
  • [26] Semantic Attention and Structured Model for Weakly Supervised Instance Segmentation in Optical and SAR Remote Sensing Imagery
    Chen, Man
    Xu, Kun
    Chen, Enping
    Zhang, Yao
    Xie, Yifei
    Hu, Yahao
    Pan, Zhisong
    REMOTE SENSING, 2023, 15 (21)
  • [27] WEAKLY SUPERVISED SEMANTIC SEGMENTATION OF REMOTE SENSING IMAGES FOR TREE SPECIES CLASSIFICATION BASED ON EXPLANATION METHODS
    Ahlswede, Steve
    Madam, Nimisha Thekke
    Schulz, Christian
    Kleinschmit, Birgit
    Demir, Begum
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 4847 - 4850
  • [28] A Novel Weakly Supervised Remote Sensing Landslide Semantic Segmentation Method: Combining CAM and cycleGAN Algorithms
    Zhou, Yongxiu
    Wang, Honghui
    Yang, Ronghao
    Yao, Guangle
    Xu, Qiang
    Zhang, Xiaojuan
    REMOTE SENSING, 2022, 14 (15)
  • [29] A Creative Weak Supervised Semantic Segmentation for Remote Sensing Images
    Wang, Zhibao
    Chang, Huan
    Bai, Lu
    Chen, Liangfu
    Bi, Xiuli
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [30] ICA-Net: improving class activation for weakly supervised semantic segmentation via joint contrastive and simulation learning
    YE Zhuang
    LIU Ruyu
    SUN Bo
    Optoelectronics Letters, 2025, 21 (03) : 188 - 192