Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation

被引:1
|
作者
Du, Ye [1 ]
Fu, Zehua [2 ]
Liu, Qingjie [1 ,2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
基金
中国国家自然科学基金;
关键词
Cams; Semantic segmentation; Training; Feature extraction; Adaptation models; Task analysis; Semantics; weakly supervised learning; domain adaptation; pseudo-labeling; MODEL; FRAMEWORK; ALIGNMENT; NETWORK;
D O I
10.1109/TIP.2024.3444190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent attention has been devoted to the pursuit of learning semantic segmentation models exclusively from image tags, a paradigm known as image-level Weakly Supervised Semantic Segmentation (WSSS). Existing attempts adopt the Class Activation Maps (CAMs) as priors to mine object regions yet observe the imbalanced activation issue, where only the most discriminative object parts are located. In this paper, we argue that the distribution discrepancy between the discriminative and the non-discriminative parts of objects prevents the model from producing complete and precise pseudo masks as ground truths. For this purpose, we propose a Pixel-Level Domain Adaptation (PLDA) method to encourage the model in learning pixel-wise domain-invariant features. Specifically, a multi-head domain classifier trained adversarially with the feature extraction is introduced to promote the emergence of pixel features that are invariant with respect to the shift between the source (i.e., the discriminative object parts) and the target (i.e., the non-discriminative object parts) domains. In addition, we come up with a Confident Pseudo-Supervision strategy to guarantee the discriminative ability of each pixel for the segmentation task, which serves as a complement to the intra-image domain adversarial training. Our method is conceptually simple, intuitive and can be easily integrated into existing WSSS methods. Taking several strong baseline models as instances, we experimentally demonstrate the effectiveness of our approach under a wide range of settings.
引用
收藏
页码:4654 / 4669
页数:16
相关论文
共 50 条
  • [21] Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks
    Bousmalis, Konstantinos
    Silberman, Nathan
    Dohan, David
    Erhan, Dumitru
    Krishnan, Dilip
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 95 - 104
  • [22] Landslide detection based on pixel-level contrastive learning for semi-supervised semantic segmentation in wide areas
    Lv, Jichao
    Zhang, Rui
    Wu, Renzhe
    Bao, Xin
    Liu, Guoxiang
    LANDSLIDES, 2025, 22 (04) : 1087 - 1105
  • [23] EfficientFusion: simple and efficient learning with pixel-level fusion for semantic segmentation
    Liu, Ping
    Tian, Shuaijie
    Gao, Yu
    Xie, Yuting
    Hao, Shufeng
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [24] Pixel-Level Domain Transfer
    Yoo, Donggeun
    Kim, Namil
    Park, Sunggyun
    Paek, Anthony S.
    Kweon, In So
    COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 517 - 532
  • [25] Enhancing pixel-level crack segmentation with visual mamba and convolutional networks
    Han, Chengjia
    Yang, Handuo
    Yang, Yaowen
    AUTOMATION IN CONSTRUCTION, 2024, 168
  • [26] Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation
    Wang, Zhonghao
    Wei, Yunchao
    Feris, Rogerio
    Xiong, Jinjun
    Hwu, Wen-mei
    Huang, Thomas S.
    Shi, Honghui
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4043 - 4047
  • [27] Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank
    Alonso, Inigo
    Sabater, Alberto
    Ferstl, David
    Montesano, Luis
    Murillo, Ana C.
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8199 - 8208
  • [28] Semi-supervised Domain Adaptation based on Dual-level Domain Mixing for Semantic Segmentation
    Chen, Shuaijun
    Jia, Xu
    He, Jianzhong
    Shi, Yongjie
    Liu, Jianzhuang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11013 - 11022
  • [29] Hybrid Pixel-Level Crack Segmentation for Ballastless Track Slab Using Digital Twin Model and Weakly Supervised Style Transfer
    Hu, Wenbo
    Wang, Weidong
    Liu, Xianhua
    Peng, Jun
    Wang, Sicheng
    Ai, Chengbo
    Qiu, Shi
    Wang, Wenjuan
    Wang, Jin
    Zaheer, Qasim
    Wang, Lichang
    STRUCTURAL CONTROL & HEALTH MONITORING, 2024, 2024
  • [30] Pixel-Level and Feature-Level Domain Adaptation for Heterogeneous SAR Target Recognition
    Chen, Zhuo
    Zhao, Lingjun
    He, Qishan
    Kuang, Gangyao
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19