Weakly supervised semantic segmentation via saliency perception with uncertainty-guided noise suppression

被引：0

作者：

Liu, Xinyi ^{[1
]}

Huang, Guoheng ^{[1
]}

Yuan, Xiaochen ^{[2
]}

Zheng, Zewen ^{[1
]}

Zhong, Guo ^{[3
]}

Chen, Xuhang ^{[4
]}

Pun, Chi-Man ^{[5
]}

机构：

[1] Guangdong Univ Technol, Guangzhou, Peoples R China

[2] Macao Polytech Univ, Macau, Peoples R China

[3] Guangdong Univ Foreign Studies, Guangzhou, Peoples R China

[4] Huizhou Univ, Huizhou, Peoples R China

[5] Univ Macau, Macau, Peoples R China

来源：

VISUAL COMPUTER | 2025年 / 41卷 / 04期

关键词：

Weakly Supervised Semantic Segmentation; Class Activation Mapping; Uncertainty estimation; Attention mechanism;

D O I：

10.1007/s00371-024-03574-1

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Weakly Supervised Semantic Segmentation (WSSS) has become increasingly popular for achieving remarkable segmentation with only image-level labels. Current WSSS approaches extract Class Activation Mapping (CAM) from classification models to produce pseudo-masks for segmentation supervision. However, due to the gap between image-level supervised classification loss and pixel-level CAM generation tasks, the model tends to activate discriminative regions at the image level rather than pursuing pixel-level classification results. Moreover, insufficient supervision leads to unrestricted attention diffusion in the model, further introducing inter-class recognition noise. In this paper, we introduce a framework that employs Saliency Perception and Uncertainty, which includes a Saliency Perception Module (SPM) with Pixel-wise Transfer Loss (SP-PT), and an Uncertainty-guided Noise Suppression method. Specifically, within the SPM, we employ a hybrid attention mechanism to expand the receptive field of the module and enhance its ability to perceive salient object features. Meanwhile, a Pixel-wise Transfer Loss is designed to guide the attention diffusion of the classification model to non-discriminative regions at the pixel-level, thereby mitigating the bias of the model. To further enhance the robustness of CAM for obtaining more accurate pseudo-masks, we propose a noise suppression method based on uncertainty estimation, which applies a confidence matrix to the loss function to suppress the propagation of erroneous information and correct it, thus making the model more robust to noise. We conducted experiments on the PASCAL VOC 2012 and MS COCO 2014, and the experimental results demonstrate the effectiveness of our proposed framework. Code is available at https://github.com/pur-suit/SPU.

引用

页码：2891 / 2906

页数：16

共 50 条

[41] Weakly supervised semantic segmentation via self-supervised destruction learning
Li, Jinlong
Jie, Zequn
Wang, Xu
Zhou, Yu
Ma, Lin
Jiang, Jianmin
NEUROCOMPUTING, 2023, 561
[42] Uncertainty-Guided Voxel-Level Supervised Contrastive Learning for Semi-Supervised Medical Image Segmentation
Hua, Yu
Shu, Xin
Wang, Zizhou
Zhang, Lei
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2022, 32 (04)
[43] UNCERTAINTY-GUIDED ROBUST TRAINING FOR MEDICAL IMAGE SEGMENTATION
Li, Yan
Chen, Xiaoyi
Quan, Li
Zhang, Ni
2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 1471 - 1475
[44] UTFNet: Uncertainty-Guided Trustworthy Fusion Network for RGB-Thermal Semantic Segmentation
Wang, Qingwang
Yin, Cheng
Song, Haochen
Shen, Tao
Gu, Yanfeng
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
[45] Weakly supervised fine-grained semantic segmentation via spatial correlation-guided learning
Dong, Zihao
Fang, Tiyu
Li, Jinping
Shao, Xiuli
COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236
[46] Weakly Supervised RBM for Semantic Segmentation
Li, Yong
Liu, Jing
Wang, Yuhang
Lu, Hanqing
Ma, Songde
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 1888 - 1894
[47] A Survey of Weakly -supervised Semantic Segmentation
Zhu, Kaiyin
Xiong, Neal N.
Lu, Mingming
2023 IEEE 9TH INTL CONFERENCE ON BIG DATA SECURITY ON CLOUD, BIGDATASECURITY, IEEE INTL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, HPSC AND IEEE INTL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS, 2023, : 10 - 15
[48] Uncertainty-guided dual-views for semi-supervised volumetric medical image segmentation
Peiris, Himashi
Hayat, Munawar
Chen, Zhaolin
Egan, Gary
Harandi, Mehrtash
NATURE MACHINE INTELLIGENCE, 2023, 5 (07) : 724 - +
[49] Prompt-Guided Semantic-Aware Distillation for Weakly Supervised Incremental Semantic Segmentation
Hao, Xuze
Jiang, Xuhao
Ni, Wenqian
Tan, Weimin
Yan, Bo
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 10632 - 10645
[50] Uncertainty-guided dual-views for semi-supervised volumetric medical image segmentation
Himashi Peiris
Munawar Hayat
Zhaolin Chen
Gary Egan
Mehrtash Harandi
Nature Machine Intelligence, 2023, 5 : 724 - 738

← 1 2 3 4 5 →