Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression

被引:0
|
作者
Liu, Xinyi [1 ]
Huang, Guoheng [1 ]
Yuan, Xiaochen [2 ]
Zheng, Zewen [1 ]
Zhong, Guo [3 ]
Chen, Xuhang [4 ]
Pun, Chi-Man [5 ]
机构
[1] Guangdong Univ Technol, Guangzhou, Peoples R China
[2] Macao Polytech Univ, Macau, Peoples R China
[3] Guangdong Univ Foreign Studies, Guangzhou, Peoples R China
[4] Huizhou Univ, Huizhou, Peoples R China
[5] Univ Macau, Macau, Peoples R China
来源
VISUAL COMPUTER | 2025年 / 41卷 / 04期
关键词
Weakly Supervised Semantic Segmentation; Class Activation Mapping; Uncertainty estimation; Attention mechanism;
D O I
10.1007/s00371-024-03574-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Weakly Supervised Semantic Segmentation (WSSS) has become increasingly popular for achieving remarkable segmentation with only image-level labels. Current WSSS approaches extract Class Activation Mapping (CAM) from classification models to produce pseudo-masks for segmentation supervision. However, due to the gap between image-level supervised classification loss and pixel-level CAM generation tasks, the model tends to activate discriminative regions at the image level rather than pursuing pixel-level classification results. Moreover, insufficient supervision leads to unrestricted attention diffusion in the model, further introducing inter-class recognition noise. In this paper, we introduce a framework that employs Saliency Perception and Uncertainty, which includes a Saliency Perception Module (SPM) with Pixel-wise Transfer Loss (SP-PT), and an Uncertainty-guided Noise Suppression method. Specifically, within the SPM, we employ a hybrid attention mechanism to expand the receptive field of the module and enhance its ability to perceive salient object features. Meanwhile, a Pixel-wise Transfer Loss is designed to guide the attention diffusion of the classification model to non-discriminative regions at the pixel-level, thereby mitigating the bias of the model. To further enhance the robustness of CAM for obtaining more accurate pseudo-masks, we propose a noise suppression method based on uncertainty estimation, which applies a confidence matrix to the loss function to suppress the propagation of erroneous information and correct it, thus making the model more robust to noise. We conducted experiments on the PASCAL VOC 2012 and MS COCO 2014, and the experimental results demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/pur-suit/SPU.
引用
收藏
页码:2891 / 2906
页数:16
相关论文
共 50 条
  • [31] Uncertainty-guided mutual consistency learning for semi-supervised medical image segmentation
    Zhang, Yichi
    Jiao, Rushi
    Liao, Qingcheng
    Li, Dongyang
    Zhang, Jicong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 138
  • [32] Uncertainty-guided transformer for brain tumor segmentation
    Chen, Zan
    Peng, Chenxu
    Guo, Wenlong
    Xie, Lei
    Wang, Shanshan
    Zhuge, Qichuan
    Wen, Caiyun
    Feng, Yuanjing
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (12) : 3289 - 3301
  • [33] Hybrid Suppression and Attention with Online Augmentation for Weakly Supervised Semantic Segmentation
    Tseng, Li-An
    Guo, Jing-Ming
    Lin, Zi-Han
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 101 - 102
  • [34] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Zhai, Wei
    Wu, Pingyu
    Zhu, Kai
    Cao, Yang
    Wu, Feng
    Zha, Zheng-Jun
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 750 - 775
  • [35] Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
    Wei Zhai
    Pingyu Wu
    Kai Zhu
    Yang Cao
    Feng Wu
    Zheng-Jun Zha
    International Journal of Computer Vision, 2024, 132 (3) : 750 - 775
  • [36] Clustering-Guided Class Activation for Weakly Supervised Semantic Segmentation
    Kim, Yeong Woo
    Kim, Wonjun
    IEEE ACCESS, 2024, 12 : 4871 - 4880
  • [37] Weakly Supervised Semantic Segmentation Via Progressive Patch Learning
    Li, Jinlong
    Jie, Zequn
    Wang, Xu
    Zhou, Yu
    Wei, Xiaolin
    Ma, Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1686 - 1699
  • [38] Saliency as Pseudo-Pixel Supervision for Weakly and Semi-Supervised Semantic Segmentation
    Lee, Minhyun
    Lee, Seungho
    Lee, Jongwuk
    Shim, Hyunjung
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12341 - 12357
  • [39] Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation
    Lee, Seungho
    Lee, Minhyun
    Lee, Jongwuk
    Shim, Hyunjung
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5491 - 5501
  • [40] Comprehensive mining of information in Weakly Supervised Semantic Segmentation: Saliency semantics and edge semantics
    Wang, Shaohui
    Shao, Youjia
    Tian, Na
    Zhao, Wencang
    NEURAL NETWORKS, 2024, 169 : 75 - 82