Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation

被引:1
|
作者
Du, Ye [1 ]
Fu, Zehua [2 ]
Liu, Qingjie [1 ,2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China
基金
中国国家自然科学基金;
关键词
Cams; Semantic segmentation; Training; Feature extraction; Adaptation models; Task analysis; Semantics; weakly supervised learning; domain adaptation; pseudo-labeling; MODEL; FRAMEWORK; ALIGNMENT; NETWORK;
D O I
10.1109/TIP.2024.3444190
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent attention has been devoted to the pursuit of learning semantic segmentation models exclusively from image tags, a paradigm known as image-level Weakly Supervised Semantic Segmentation (WSSS). Existing attempts adopt the Class Activation Maps (CAMs) as priors to mine object regions yet observe the imbalanced activation issue, where only the most discriminative object parts are located. In this paper, we argue that the distribution discrepancy between the discriminative and the non-discriminative parts of objects prevents the model from producing complete and precise pseudo masks as ground truths. For this purpose, we propose a Pixel-Level Domain Adaptation (PLDA) method to encourage the model in learning pixel-wise domain-invariant features. Specifically, a multi-head domain classifier trained adversarially with the feature extraction is introduced to promote the emergence of pixel features that are invariant with respect to the shift between the source (i.e., the discriminative object parts) and the target (i.e., the non-discriminative object parts) domains. In addition, we come up with a Confident Pseudo-Supervision strategy to guarantee the discriminative ability of each pixel for the segmentation task, which serves as a complement to the intra-image domain adversarial training. Our method is conceptually simple, intuitive and can be easily integrated into existing WSSS methods. Taking several strong baseline models as instances, we experimentally demonstrate the effectiveness of our approach under a wide range of settings.
引用
收藏
页码:4654 / 4669
页数:16
相关论文
共 50 条
  • [31] Supervised Domain Adaptation for Automated Semantic Segmentation of the Atrial Cavity
    Saiz-Vivo, Marta
    Colomer, Adrian
    Fonfria, Carles
    Marti-Bonmati, Luis
    Naranjo, Valery
    ENTROPY, 2021, 23 (07)
  • [32] Semi-Supervised Pixel-Level Scene Text Segmentation by Mutually Guided Network
    Wang, Chuan
    Zhao, Shan
    Zhu, Li
    Luo, Kunming
    Guo, Yanwen
    Wang, Jue
    Liu, Shuaicheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8212 - 8221
  • [33] Weakly Supervised RBM for Semantic Segmentation
    Li, Yong
    Liu, Jing
    Wang, Yuhang
    Lu, Hanqing
    Ma, Songde
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1888 - 1894
  • [34] A Survey of Weakly -supervised Semantic Segmentation
    Zhu, Kaiyin
    Xiong, Neal N.
    Lu, Mingming
    2023 IEEE 9TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD, BIGDATASECURITY, IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC AND IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS, 2023, : 10 - 15
  • [35] Single annotated pixel based weakly supervised semantic segmentation under driving scenes
    Li, Xi
    Ma, Huimin
    Yi, Sheng
    Chen, Yanxian
    Ma, Hongbing
    PATTERN RECOGNITION, 2021, 116
  • [36] Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation
    Lee, Minhyun
    Lee, Seungho
    Lee, Jongwuk
    Shim, Hyunjung
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12341 - 12357
  • [37] Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation
    Lee, Seungho
    Lee, Minhyun
    Lee, Jongwuk
    Shim, Hyunjung
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5491 - 5501
  • [38] Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
    Chen, Hongjun
    Wang, Jinbao
    Chen, Hong Cai
    Zhen, Xiantong
    Zheng, Feng
    Ji, Rongrong
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6900 - 6909
  • [39] Pixel-Level Domain Adaptation for Real-to-Sim Object Pose Estimation
    Qian, Kun
    Duan, Yanhui
    Luo, Chaomin
    Zhao, Yongqiang
    Jing, Xingshuo
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1618 - 1627
  • [40] A Pavement Crack Translator for Data Augmentation and Pixel-Level Detection Based on Weakly Supervised Learning
    Zhong, Jingtao
    Ma, Yuetan
    Zhang, Miaomiao
    Xiao, Rui
    Cheng, Guantao
    Huang, Baoshan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13350 - 13363