Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation

被引：1

作者：

Du, Ye ^{[1
]}

Fu, Zehua ^{[2
]}

Liu, Qingjie ^{[1
,2
]}

机构：

[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Beihang Univ, Hangzhou Innovat Inst, Hangzhou 310051, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

基金：

中国国家自然科学基金;

关键词：

Cams; Semantic segmentation; Training; Feature extraction; Adaptation models; Task analysis; Semantics; weakly supervised learning; domain adaptation; pseudo-labeling; MODEL; FRAMEWORK; ALIGNMENT; NETWORK;

D O I：

10.1109/TIP.2024.3444190

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent attention has been devoted to the pursuit of learning semantic segmentation models exclusively from image tags, a paradigm known as image-level Weakly Supervised Semantic Segmentation (WSSS). Existing attempts adopt the Class Activation Maps (CAMs) as priors to mine object regions yet observe the imbalanced activation issue, where only the most discriminative object parts are located. In this paper, we argue that the distribution discrepancy between the discriminative and the non-discriminative parts of objects prevents the model from producing complete and precise pseudo masks as ground truths. For this purpose, we propose a Pixel-Level Domain Adaptation (PLDA) method to encourage the model in learning pixel-wise domain-invariant features. Specifically, a multi-head domain classifier trained adversarially with the feature extraction is introduced to promote the emergence of pixel features that are invariant with respect to the shift between the source (i.e., the discriminative object parts) and the target (i.e., the non-discriminative object parts) domains. In addition, we come up with a Confident Pseudo-Supervision strategy to guarantee the discriminative ability of each pixel for the segmentation task, which serves as a complement to the intra-image domain adversarial training. Our method is conceptually simple, intuitive and can be easily integrated into existing WSSS methods. Taking several strong baseline models as instances, we experimentally demonstrate the effectiveness of our approach under a wide range of settings.

引用

页码：4654 / 4669

页数：16

共 50 条

[21] Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks
Bousmalis, Konstantinos
Silberman, Nathan
Dohan, David
Erhan, Dumitru
Krishnan, Dilip
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 95 - 104
[22] Landslide detection based on pixel-level contrastive learning for semi-supervised semantic segmentation in wide areas
Lv, Jichao
Zhang, Rui
Wu, Renzhe
Bao, Xin
Liu, Guoxiang
LANDSLIDES, 2025, 22 (04) : 1087 - 1105
[23] EfficientFusion: simple and efficient learning with pixel-level fusion for semantic segmentation
Liu, Ping
Tian, Shuaijie
Gao, Yu
Xie, Yuting
Hao, Shufeng
MULTIMEDIA SYSTEMS, 2024, 30 (06)
[24] Pixel-Level Domain Transfer
Yoo, Donggeun
Kim, Namil
Park, Sunggyun
Paek, Anthony S.
Kweon, In So
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 517 - 532
[25] Enhancing pixel-level crack segmentation with visual mamba and convolutional networks
Han, Chengjia
Yang, Handuo
Yang, Yaowen
AUTOMATION IN CONSTRUCTION, 2024, 168
[26] Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation
Wang, Zhonghao
Wei, Yunchao
Feris, Rogerio
Xiong, Jinjun
Hwu, Wen-mei
Huang, Thomas S.
Shi, Honghui
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 4043 - 4047
[27] Semi-Supervised Semantic Segmentation with Pixel-Level Contrastive Learning from a Class-wise Memory Bank
Alonso, Inigo
Sabater, Alberto
Ferstl, David
Montesano, Luis
Murillo, Ana C.
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8199 - 8208
[28] Semi-supervised Domain Adaptation based on Dual-level Domain Mixing for Semantic Segmentation
Chen, Shuaijun
Jia, Xu
He, Jianzhong
Shi, Yongjie
Liu, Jianzhuang
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 11013 - 11022
[29] Hybrid Pixel-Level Crack Segmentation for Ballastless Track Slab Using Digital Twin Model and Weakly Supervised Style Transfer
Hu, Wenbo
Wang, Weidong
Liu, Xianhua
Peng, Jun
Wang, Sicheng
Ai, Chengbo
Qiu, Shi
Wang, Wenjuan
Wang, Jin
Zaheer, Qasim
Wang, Lichang
STRUCTURAL CONTROL & HEALTH MONITORING, 2024, 2024
[30] Pixel-Level and Feature-Level Domain Adaptation for Heterogeneous SAR Target Recognition
Chen, Zhuo
Zhao, Lingjun
He, Qishan
Kuang, Gangyao
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19

← 1 2 3 4 5 →