Multi-scale feature correspondence and pseudo label retraining strategy for weakly supervised semantic segmentation

被引:0
|
作者
Wang, Weizheng [1 ]
Zhou, Lei [1 ]
Wang, Haonan [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410076, Peoples R China
基金
中国国家自然科学基金;
关键词
Weakly supervised semantic segmentation; Vision transformer; Multi-scale feature correspondence; Pseudo label retraining strategy;
D O I
10.1016/j.imavis.2024.105215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the performance of semantic segmentation using weakly supervised learning has significantly improved. Weakly supervised semantic segmentation (WSSS) that uses only image-level labels has received widespread attention, it employs Class Activation Maps (CAM) to generate pseudo labels. Compared to traditional use of pixel-level labels, this technique greatly reduces annotation costs by utilizing simpler and more readily available image-level annotations. Besides, due to the local perceptual ability of Convolutional Neural Networks (CNN), the generated CAM cannot activate the entire object area. Researchers have found that this CNN limitation can be compensated for by using Vision Transformer (ViT). However, ViT also introduces an over-smoothing problem. Recent research has made good progress in solving this issue, but when discussing CAM and its related segmentation predictions, it is easy to overlook their intrinsic information and the interrelationships between them. In this paper, we propose a Multi-Scale Feature Correspondence (MSFC) method. Our MSFC can obtain the feature correspondence of CAM and segmentation predictions at different scales, reextract useful semantic information from them, enhancing the network's learning of feature information and improving the quality of CAM. Moreover, to further improve the segmentation precision, we design a Pseudo Label Retraining Strategy (PLRS). This strategy refines the accuracy in local regions, elevates the quality of pseudo labels, and aims to enhance segmentation precision. Experimental results on the PASCAL VOC 2012 and MS COCO 2014 datasets show that our method achieves impressive performance among end-to-end WSSS methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Weakly supervised semantic segmentation and optimization algorithm based on multi-scale feature model
    Xiong C.
    Zhi H.
    Tongxin Xuebao/Journal on Communications, 2019, 40 (01): : 163 - 171
  • [2] Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping
    He, Chunming
    Li, Kai
    Zhang, Yachao
    Xu, Guoxia
    Tang, Longxiang
    Zhang, Yulun
    Guo, Zhenhua
    Li, Xiu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [3] Scale-Aware Feature Network for Weakly Supervised Semantic Segmentation
    Xu, Lian
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    IEEE ACCESS, 2020, 8 : 75957 - 75967
  • [4] Pseudo-Label-Free Weakly Supervised Semantic Segmentation Using Image Masking
    Kim, Sangtae
    Luong Trung Nguyen
    Shim, Kyuhong
    Kim, Junhan
    Shim, Byonghyo
    IEEE ACCESS, 2022, 10 : 19401 - 19411
  • [5] Self-supervised Multi-scale Consistency for Weakly Supervised Segmentation Learning
    Valvano, Gabriele
    Leo, Andrea
    Tsaftaris, Sotirios A.
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER, AND AFFORDABLE HEALTHCARE AND AI FOR RESOURCE DIVERSE GLOBAL HEALTH (DART 2021), 2021, 12968 : 14 - 24
  • [6] A Self-Training Framework Based on Multi-Scale Attention Fusion for Weakly Supervised Semantic Segmentation
    Yang, Guoqing
    Zhu, Chuang
    Zhang, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 876 - 881
  • [7] Multi-scale feature similarity-based weakly supervised lymphoma segmentation in PET/CT images
    Huang, Zhengshan
    Guo, Yu
    Zhang, Ning
    Huang, Xian
    Decazes, Pierre
    Becker, Stephanie
    Ruan, Su
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [8] Multi-scale feature similarity-based weakly supervised lymphoma segmentation in PET/CT images
    Huang, Zhengshan
    Guo, Yu
    Zhang, Ning
    Huang, Xian
    Decazes, Pierre
    Becker, Stephanie
    Ruan, Su
    Computers in Biology and Medicine, 2022, 151
  • [9] Enhanced Pseudo-Label Generation With Self-Supervised Training for Weakly- Supervised Semantic Segmentation
    Qin, Zhen
    Chen, Yujie
    Zhu, Guosong
    Zhou, Erqiang
    Zhou, Yingjie
    Zhou, Yicong
    Zhu, Ce
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 7017 - 7028
  • [10] Weakly Supervised Remote Sensing Image Semantic Segmentation With Pseudo-Label Noise Suppression
    Lu, Xiao
    Jiang, Zhiguo
    Zhang, Haopeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62