Multi-scale feature correspondence and pseudo label retraining strategy for weakly supervised semantic segmentation

被引：0

作者：

Wang, Weizheng ^{[1
]}

Zhou, Lei ^{[1
]}

Wang, Haonan ^{[1
]}

机构：

[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410076, Peoples R China

来源：

IMAGE AND VISION COMPUTING | 2024年 / 150卷

基金：

中国国家自然科学基金;

关键词：

Weakly supervised semantic segmentation; Vision transformer; Multi-scale feature correspondence; Pseudo label retraining strategy;

D O I：

10.1016/j.imavis.2024.105215

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, the performance of semantic segmentation using weakly supervised learning has significantly improved. Weakly supervised semantic segmentation (WSSS) that uses only image-level labels has received widespread attention, it employs Class Activation Maps (CAM) to generate pseudo labels. Compared to traditional use of pixel-level labels, this technique greatly reduces annotation costs by utilizing simpler and more readily available image-level annotations. Besides, due to the local perceptual ability of Convolutional Neural Networks (CNN), the generated CAM cannot activate the entire object area. Researchers have found that this CNN limitation can be compensated for by using Vision Transformer (ViT). However, ViT also introduces an over-smoothing problem. Recent research has made good progress in solving this issue, but when discussing CAM and its related segmentation predictions, it is easy to overlook their intrinsic information and the interrelationships between them. In this paper, we propose a Multi-Scale Feature Correspondence (MSFC) method. Our MSFC can obtain the feature correspondence of CAM and segmentation predictions at different scales, reextract useful semantic information from them, enhancing the network's learning of feature information and improving the quality of CAM. Moreover, to further improve the segmentation precision, we design a Pseudo Label Retraining Strategy (PLRS). This strategy refines the accuracy in local regions, elevates the quality of pseudo labels, and aims to enhance segmentation precision. Experimental results on the PASCAL VOC 2012 and MS COCO 2014 datasets show that our method achieves impressive performance among end-to-end WSSS methods.

引用

页数：11

共 50 条

[1] Weakly supervised semantic segmentation and optimization algorithm based on multi-scale feature model
Xiong C.
Zhi H.
Tongxin Xuebao/Journal on Communications, 2019, 40 (01): : 163 - 171
[2] Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping
He, Chunming
Li, Kai
Zhang, Yachao
Xu, Guoxia
Tang, Longxiang
Zhang, Yulun
Guo, Zhenhua
Li, Xiu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[3] Scale-Aware Feature Network for Weakly Supervised Semantic Segmentation
Xu, Lian
Bennamoun, Mohammed
Boussaid, Farid
Sohel, Ferdous
IEEE ACCESS, 2020, 8 : 75957 - 75967
[4] Pseudo-Label-Free Weakly Supervised Semantic Segmentation Using Image Masking
Kim, Sangtae
Luong Trung Nguyen
Shim, Kyuhong
Kim, Junhan
Shim, Byonghyo
IEEE ACCESS, 2022, 10 : 19401 - 19411
[5] Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning
Valvano, Gabriele
Leo, Andrea
Tsaftaris, Sotirios A.
DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND AFFORDABLE HEALTHCARE AND AI FOR RESOURCE DIVERSE GLOBAL HEALTH (DART 2021), 2021, 12968 : 14 - 24
[6] A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation
Yang, Guoqing
Zhu, Chuang
Zhang, Yu
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 876 - 881
[7] Multi-scale feature similarity-based weakly supervised lymphoma segmentation in PET/CT images
Huang, Zhengshan
Guo, Yu
Zhang, Ning
Huang, Xian
Decazes, Pierre
Becker, Stephanie
Ruan, Su
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
[8] Multi-scale feature similarity-based weakly supervised lymphoma segmentation in PET/CT images
Huang, Zhengshan
Guo, Yu
Zhang, Ning
Huang, Xian
Decazes, Pierre
Becker, Stephanie
Ruan, Su
Computers in Biology and Medicine, 2022, 151
[9] Enhanced Pseudo-Label Generation With Self-Supervised Training for Weakly- Supervised Semantic Segmentation
Qin, Zhen
Chen, Yujie
Zhu, Guosong
Zhou, Erqiang
Zhou, Yingjie
Zhou, Yicong
Zhu, Ce
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7017 - 7028
[10] Weakly Supervised Remote Sensing Image Semantic Segmentation With Pseudo-Label Noise Suppression
Lu, Xiao
Jiang, Zhiguo
Zhang, Haopeng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62

← 1 2 3 4 5 →