Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression

被引:0
|
作者
Liu, Xinyi [1 ]
Huang, Guoheng [1 ]
Yuan, Xiaochen [2 ]
Zheng, Zewen [1 ]
Zhong, Guo [3 ]
Chen, Xuhang [4 ]
Pun, Chi-Man [5 ]
机构
[1] Guangdong Univ Technol, Guangzhou, Peoples R China
[2] Macao Polytech Univ, Macau, Peoples R China
[3] Guangdong Univ Foreign Studies, Guangzhou, Peoples R China
[4] Huizhou Univ, Huizhou, Peoples R China
[5] Univ Macau, Macau, Peoples R China
来源
VISUAL COMPUTER | 2025年 / 41卷 / 04期
关键词
Weakly Supervised Semantic Segmentation; Class Activation Mapping; Uncertainty estimation; Attention mechanism;
D O I
10.1007/s00371-024-03574-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Weakly Supervised Semantic Segmentation (WSSS) has become increasingly popular for achieving remarkable segmentation with only image-level labels. Current WSSS approaches extract Class Activation Mapping (CAM) from classification models to produce pseudo-masks for segmentation supervision. However, due to the gap between image-level supervised classification loss and pixel-level CAM generation tasks, the model tends to activate discriminative regions at the image level rather than pursuing pixel-level classification results. Moreover, insufficient supervision leads to unrestricted attention diffusion in the model, further introducing inter-class recognition noise. In this paper, we introduce a framework that employs Saliency Perception and Uncertainty, which includes a Saliency Perception Module (SPM) with Pixel-wise Transfer Loss (SP-PT), and an Uncertainty-guided Noise Suppression method. Specifically, within the SPM, we employ a hybrid attention mechanism to expand the receptive field of the module and enhance its ability to perceive salient object features. Meanwhile, a Pixel-wise Transfer Loss is designed to guide the attention diffusion of the classification model to non-discriminative regions at the pixel-level, thereby mitigating the bias of the model. To further enhance the robustness of CAM for obtaining more accurate pseudo-masks, we propose a noise suppression method based on uncertainty estimation, which applies a confidence matrix to the loss function to suppress the propagation of erroneous information and correct it, thus making the model more robust to noise. We conducted experiments on the PASCAL VOC 2012 and MS COCO 2014, and the experimental results demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/pur-suit/SPU.
引用
收藏
页码:2891 / 2906
页数:16
相关论文
共 50 条
  • [41] Weakly supervised semantic segmentation via self-supervised destruction learning
    Li, Jinlong
    Jie, Zequn
    Wang, Xu
    Zhou, Yu
    Ma, Lin
    Jiang, Jianmin
    NEUROCOMPUTING, 2023, 561
  • [42] Uncertainty-Guided Voxel-Level Supervised Contrastive Learning for Semi-Supervised Medical Image Segmentation
    Hua, Yu
    Shu, Xin
    Wang, Zizhou
    Zhang, Lei
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (04)
  • [43] UNCERTAINTY-GUIDED ROBUST TRAINING FOR MEDICAL IMAGE SEGMENTATION
    Li, Yan
    Chen, Xiaoyi
    Quan, Li
    Zhang, Ni
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1471 - 1475
  • [44] UTFNet: Uncertainty-Guided Trustworthy Fusion Network for RGB-Thermal Semantic Segmentation
    Wang, Qingwang
    Yin, Cheng
    Song, Haochen
    Shen, Tao
    Gu, Yanfeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [45] Weakly supervised fine-grained semantic segmentation via spatial correlation-guided learning
    Dong, Zihao
    Fang, Tiyu
    Li, Jinping
    Shao, Xiuli
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236
  • [46] Weakly Supervised RBM for Semantic Segmentation
    Li, Yong
    Liu, Jing
    Wang, Yuhang
    Lu, Hanqing
    Ma, Songde
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1888 - 1894
  • [47] A Survey of Weakly -supervised Semantic Segmentation
    Zhu, Kaiyin
    Xiong, Neal N.
    Lu, Mingming
    2023 IEEE 9TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD, BIGDATASECURITY, IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC AND IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS, 2023, : 10 - 15
  • [48] Uncertainty-guided dual-views for semi-supervised volumetric medical image segmentation
    Peiris, Himashi
    Hayat, Munawar
    Chen, Zhaolin
    Egan, Gary
    Harandi, Mehrtash
    NATURE MACHINE INTELLIGENCE, 2023, 5 (07) : 724 - +
  • [49] Prompt-Guided Semantic-Aware Distillation for Weakly Supervised Incremental Semantic Segmentation
    Hao, Xuze
    Jiang, Xuhao
    Ni, Wenqian
    Tan, Weimin
    Yan, Bo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 10632 - 10645
  • [50] Uncertainty-guided dual-views for semi-supervised volumetric medical image segmentation
    Himashi Peiris
    Munawar Hayat
    Zhaolin Chen
    Gary Egan
    Mehrtash Harandi
    Nature Machine Intelligence, 2023, 5 : 724 - 738