Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation

被引：1

作者：

Du, Ye ^{[1
]}

Fu, Zehua ^{[2
]}

Liu, Qingjie ^{[1
,2
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Cams; Semantic segmentation; Training; Feature extraction; Adaptation models; Task analysis; Semantics; weakly supervised learning; domain adaptation; pseudo-labeling; MODEL; FRAMEWORK; ALIGNMENT; NETWORK;

D O I：

10.1109/TIP.2024.3444190

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent attention has been devoted to the pursuit of learning semantic segmentation models exclusively from image tags, a paradigm known as image-level Weakly Supervised Semantic Segmentation (WSSS). Existing attempts adopt the Class Activation Maps (CAMs) as priors to mine object regions yet observe the imbalanced activation issue, where only the most discriminative object parts are located. In this paper, we argue that the distribution discrepancy between the discriminative and the non-discriminative parts of objects prevents the model from producing complete and precise pseudo masks as ground truths. For this purpose, we propose a Pixel-Level Domain Adaptation (PLDA) method to encourage the model in learning pixel-wise domain-invariant features. Specifically, a multi-head domain classifier trained adversarially with the feature extraction is introduced to promote the emergence of pixel features that are invariant with respect to the shift between the source (i.e., the discriminative object parts) and the target (i.e., the non-discriminative object parts) domains. In addition, we come up with a Confident Pseudo-Supervision strategy to guarantee the discriminative ability of each pixel for the segmentation task, which serves as a complement to the intra-image domain adversarial training. Our method is conceptually simple, intuitive and can be easily integrated into existing WSSS methods. Taking several strong baseline models as instances, we experimentally demonstrate the effectiveness of our approach under a wide range of settings.

引用

页码：4654 / 4669

页数：16

共 50 条

[31] Supervised Domain Adaptation for Automated Semantic Segmentation of the Atrial Cavity
Saiz-Vivo, Marta
Colomer, Adrian
Fonfria, Carles
Marti-Bonmati, Luis
Naranjo, Valery
ENTROPY, 2021, 23 (07)
[32] Semi-Supervised Pixel-Level Scene Text Segmentation by Mutually Guided Network
Wang, Chuan
Zhao, Shan
Zhu, Li
Luo, Kunming
Guo, Yanwen
Wang, Jue
Liu, Shuaicheng
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8212 - 8221
[33] Weakly Supervised RBM for Semantic Segmentation
Li, Yong
Liu, Jing
Wang, Yuhang
Lu, Hanqing
Ma, Songde
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1888 - 1894
[34] A Survey of Weakly -supervised Semantic Segmentation
Zhu, Kaiyin
Xiong, Neal N.
Lu, Mingming
2023 IEEE 9TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD, BIGDATASECURITY, IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC AND IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS, 2023, : 10 - 15
[35] Single annotated pixel based weakly supervised semantic segmentation under driving scenes
Li, Xi
Ma, Huimin
Yi, Sheng
Chen, Yanxian
Ma, Hongbing
PATTERN RECOGNITION, 2021, 116
[36] Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation
Lee, Minhyun
Lee, Seungho
Lee, Jongwuk
Shim, Hyunjung
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12341 - 12357
[37] Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation
Lee, Seungho
Lee, Minhyun
Lee, Jongwuk
Shim, Hyunjung
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5491 - 5501
[38] Seminar Learning for Click-Level Weakly Supervised Semantic Segmentation
Chen, Hongjun
Wang, Jinbao
Chen, Hong Cai
Zhen, Xiantong
Zheng, Feng
Ji, Rongrong
Shao, Ling
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6900 - 6909
[39] Pixel-Level Domain Adaptation for Real-to-Sim Object Pose Estimation
Qian, Kun
Duan, Yanhui
Luo, Chaomin
Zhao, Yongqiang
Jing, Xingshuo
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (03) : 1618 - 1627
[40] A Pavement Crack Translator for Data Augmentation and Pixel-Level Detection Based on Weakly Supervised Learning
Zhong, Jingtao
Ma, Yuetan
Zhang, Miaomiao
Xiao, Rui
Cheng, Guantao
Huang, Baoshan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13350 - 13363

← 1 2 3 4 5 →