Boundary-refined prototype generation: A general end-to-end paradigm for semi-supervised semantic segmentation

被引：0

作者：

Dong, Junhao ^{[1
]}

Meng, Zhu ^{[1
]}

Liu, Delong ^{[1
]}

Liu, Jiaxuan ^{[1
]}

Zhao, Zhicheng ^{[1
,2
]}

Su, Fei ^{[1
,2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[2] Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2024年 / 137卷

关键词：

Semantic segmentation; Semi-supervised learning; Prototype-based contrastive learning; Mean teacher;

D O I：

10.1016/j.engappai.2024.109021

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Semi-supervised semantic segmentation has attracted increasing attention in computer vision, aiming to leverage unlabeled data through latent supervision. To achieve this goal, prototype-based classification has been introduced and achieved lots of success. However, the current approaches isolate prototype generation from the main training framework, presenting a non-end-to-end workflow. Furthermore, most methods directly perform the K-Means clustering on features to generate prototypes, resulting in their proximity to category semantic centers, while overlooking the clear delineation of class boundaries. To address the above problems, we propose a novel end-to-end boundary-refined prototype generation (BRPG) method. Specifically, we perform online clustering on sampled features to incorporate the prototype generation into the whole training framework. In addition, to enhance the classification boundaries, we sample and cluster high- and low-confidence features separately based on confidence estimation, facilitating the generation of prototypes closer to the class boundaries. Moreover, an adaptive prototype optimization strategy is proposed to increase the number of prototypes for categories with scattered feature distributions, which further refines the class boundaries. Extensive experiments demonstrate the remarkable robustness and scalability of our method across diverse datasets, segmentation networks, and semi-supervised frameworks, outperforming the state-of-the-art approaches on three benchmark datasets: PASCAL VOC 2012, Cityscapes and MS COCO. The code is available at https://github.com/djh-dzxw/BRPG.

引用

页数：20

共 50 条

[1] End-to-end Boundary Exploration forWeakly-supervised Semantic Segmentation
Chen, Jianjun
Fang, Shancheng
Xie, Hongtao
Zha, Zhengjun
Hu, Yue
Tan, Jianlong
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2381 - 2390
[2] Semi-Supervised End-to-End Speech Recognition
Karita, Shigeki
Watanabe, Shinji
Iwata, Tomoharu
Ogawa, Atsunori
Delcroix, Marc
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2 - 6
[3] Towards End-to-End Semi-supervised Table Detection with Semantic Aligned Matching Transformer
Shehzadi, Tahira
Sarode, Shalini
Stricker, Didier
Afzal, Muhammad Zeshan
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 295 - 318
[4] Semi-Supervised End-to-End Learning for Integrated Sensing and Communications
Mateos-Ramos, Jose Miguel
Chatelier, Baptiste
Hager, Christian
Keskin, Musa Furkan
Le Magoarou, Luc
Wymeersch, Henk
2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 132 - 138
[5] GrowingNet: An end-to-end growing network for semi-supervised learning
Zhang, Qifei
Yu, Xiaomo
COMPUTER COMMUNICATIONS, 2020, 151 : 208 - 215
[6] ACTIVEMATCH: END-TO-END SEMI-SUPERVISED ACTIVE REPRESENTATION LEARNING
Yuan, Xinkai
Li, Zilinghan
Wang, Gaoang
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1136 - 1140
[7] End-to-End Semi-supervised Learning for Differentiable Particle Filters
Wen, Hao
Chen, Xiongjie
Papagiannis, Georgios
Hu, Conghui
Li, Yunpeng
2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5825 - 5831
[8] End-to-End Semi-Supervised Learning for Video Action Detection
Kumar, Akash
Rawat, Yogesh Singh
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14680 - 14690
[9] Semi-supervised ASR by End-to-end Self-training
Chen, Yang
Wang, Weiran
Wang, Chao
INTERSPEECH 2020, 2020, : 2787 - 2791
[10] End-to-End Semi-Supervised Object Detection with Soft Teacher
Xu, Mengde
Zhang, Zheng
Hu, Han
Wang, Jianfeng
Wang, Lijuan
Wei, Fangyun
Bai, Xiang
Liu, Zicheng
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3040 - 3049

← 1 2 3 4 5 →