Boundary-refined prototype generation: A general end-to-end paradigm for semi-supervised semantic segmentation

被引:0
|
作者
Dong, Junhao [1 ]
Meng, Zhu [1 ]
Liu, Delong [1 ]
Liu, Jiaxuan [1 ]
Zhao, Zhicheng [1 ,2 ]
Su, Fei [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[2] Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China
关键词
Semantic segmentation; Semi-supervised learning; Prototype-based contrastive learning; Mean teacher;
D O I
10.1016/j.engappai.2024.109021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised semantic segmentation has attracted increasing attention in computer vision, aiming to leverage unlabeled data through latent supervision. To achieve this goal, prototype-based classification has been introduced and achieved lots of success. However, the current approaches isolate prototype generation from the main training framework, presenting a non-end-to-end workflow. Furthermore, most methods directly perform the K-Means clustering on features to generate prototypes, resulting in their proximity to category semantic centers, while overlooking the clear delineation of class boundaries. To address the above problems, we propose a novel end-to-end boundary-refined prototype generation (BRPG) method. Specifically, we perform online clustering on sampled features to incorporate the prototype generation into the whole training framework. In addition, to enhance the classification boundaries, we sample and cluster high- and low-confidence features separately based on confidence estimation, facilitating the generation of prototypes closer to the class boundaries. Moreover, an adaptive prototype optimization strategy is proposed to increase the number of prototypes for categories with scattered feature distributions, which further refines the class boundaries. Extensive experiments demonstrate the remarkable robustness and scalability of our method across diverse datasets, segmentation networks, and semi-supervised frameworks, outperforming the state-of-the-art approaches on three benchmark datasets: PASCAL VOC 2012, Cityscapes and MS COCO. The code is available at https://github.com/djh-dzxw/BRPG.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] End-to-end Boundary Exploration forWeakly-supervised Semantic Segmentation
    Chen, Jianjun
    Fang, Shancheng
    Xie, Hongtao
    Zha, Zhengjun
    Hu, Yue
    Tan, Jianlong
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2381 - 2390
  • [2] Semi-Supervised End-to-End Speech Recognition
    Karita, Shigeki
    Watanabe, Shinji
    Iwata, Tomoharu
    Ogawa, Atsunori
    Delcroix, Marc
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2 - 6
  • [3] Towards End-to-End Semi-supervised Table Detection with Semantic Aligned Matching Transformer
    Shehzadi, Tahira
    Sarode, Shalini
    Stricker, Didier
    Afzal, Muhammad Zeshan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 295 - 318
  • [4] Semi-Supervised End-to-End Learning for Integrated Sensing and Communications
    Mateos-Ramos, Jose Miguel
    Chatelier, Baptiste
    Hager, Christian
    Keskin, Musa Furkan
    Le Magoarou, Luc
    Wymeersch, Henk
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 132 - 138
  • [5] GrowingNet: An end-to-end growing network for semi-supervised learning
    Zhang, Qifei
    Yu, Xiaomo
    COMPUTER COMMUNICATIONS, 2020, 151 : 208 - 215
  • [6] ACTIVEMATCH: END-TO-END SEMI-SUPERVISED ACTIVE REPRESENTATION LEARNING
    Yuan, Xinkai
    Li, Zilinghan
    Wang, Gaoang
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1136 - 1140
  • [7] End-to-End Semi-supervised Learning for Differentiable Particle Filters
    Wen, Hao
    Chen, Xiongjie
    Papagiannis, Georgios
    Hu, Conghui
    Li, Yunpeng
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5825 - 5831
  • [8] End-to-End Semi-Supervised Learning for Video Action Detection
    Kumar, Akash
    Rawat, Yogesh Singh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14680 - 14690
  • [9] Semi-supervised ASR by End-to-end Self-training
    Chen, Yang
    Wang, Weiran
    Wang, Chao
    INTERSPEECH 2020, 2020, : 2787 - 2791
  • [10] End-to-End Semi-Supervised Object Detection with Soft Teacher
    Xu, Mengde
    Zhang, Zheng
    Hu, Han
    Wang, Jianfeng
    Wang, Lijuan
    Wei, Fangyun
    Bai, Xiang
    Liu, Zicheng
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3040 - 3049