Multi-scale feature correspondence and pseudo label retraining strategy for weakly supervised semantic segmentation

被引:0
|
作者
Wang, Weizheng [1 ]
Zhou, Lei [1 ]
Wang, Haonan [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410076, Peoples R China
基金
中国国家自然科学基金;
关键词
Weakly supervised semantic segmentation; Vision transformer; Multi-scale feature correspondence; Pseudo label retraining strategy;
D O I
10.1016/j.imavis.2024.105215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the performance of semantic segmentation using weakly supervised learning has significantly improved. Weakly supervised semantic segmentation (WSSS) that uses only image-level labels has received widespread attention, it employs Class Activation Maps (CAM) to generate pseudo labels. Compared to traditional use of pixel-level labels, this technique greatly reduces annotation costs by utilizing simpler and more readily available image-level annotations. Besides, due to the local perceptual ability of Convolutional Neural Networks (CNN), the generated CAM cannot activate the entire object area. Researchers have found that this CNN limitation can be compensated for by using Vision Transformer (ViT). However, ViT also introduces an over-smoothing problem. Recent research has made good progress in solving this issue, but when discussing CAM and its related segmentation predictions, it is easy to overlook their intrinsic information and the interrelationships between them. In this paper, we propose a Multi-Scale Feature Correspondence (MSFC) method. Our MSFC can obtain the feature correspondence of CAM and segmentation predictions at different scales, reextract useful semantic information from them, enhancing the network's learning of feature information and improving the quality of CAM. Moreover, to further improve the segmentation precision, we design a Pseudo Label Retraining Strategy (PLRS). This strategy refines the accuracy in local regions, elevates the quality of pseudo labels, and aims to enhance segmentation precision. Experimental results on the PASCAL VOC 2012 and MS COCO 2014 datasets show that our method achieves impressive performance among end-to-end WSSS methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] DenserNet: Weakly Supervised Visual Localization Using Multi-Scale Feature Aggregation
    Liu, Dongfang
    Cui, Yiming
    Yan, Liqi
    Mousas, Christos
    Yang, Baijian
    Chen, Yingjie
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6101 - 6109
  • [22] Dynamic feature regularized loss for weakly supervised semantic segmentation
    Zhang, Bingfeng
    Xiao, Jimin
    Zhao, Yao
    PATTERN RECOGNITION, 2025, 164
  • [23] Atrous convolutional feature network for weakly supervised semantic segmentation
    Xu, Lian
    Xue, Hao
    Bennamoun, Mohammed
    Boussaid, Farid
    Sohel, Ferdous
    NEUROCOMPUTING, 2021, 421 : 115 - 126
  • [24] Atrous convolutional feature network for weakly supervised semantic segmentation
    Xu L.
    Xue H.
    Bennamoun M.
    Boussaid F.
    Sohel F.
    Neurocomputing, 2021, 421 : 115 - 126
  • [25] Adaptive multi-scale feature fusion with spatial translation for semantic segmentation
    Wang, Hongru
    Wang, Haoyu
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (11) : 8337 - 8348
  • [26] MFPNet: A Multi-scale Feature Propagation Network for Lightweight Semantic Segmentation
    Xu, Guoan
    Jia, Wenjing
    Wu, Tao
    Chen, Ligeng
    Gao, Guangwei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT III, 2024, 15018 : 76 - 86
  • [27] Semantic Segmentation Method Based on Residual and Multi-Scale Feature Fusion
    Xiu, Chunbo
    Su, Huan
    Su, Xuemiao
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 2078 - 2083
  • [28] Multi-Scale Feature Aggregation Network for Semantic Segmentation of Land Cover
    Shen, Xu
    Weng, Liguo
    Xia, Min
    Lin, Haifeng
    REMOTE SENSING, 2022, 14 (23)
  • [29] Semantic Segmentation on Remote Sensing Images with Multi-Scale Feature Fusion
    Zhang J.
    Jin Q.
    Wang H.
    Da C.
    Xiang S.
    Pan C.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (09): : 1509 - 1517
  • [30] Beyond Pixels: Semi-supervised Semantic Segmentation with a Multi-scale Patch-Based Multi-label Classifier
    Howlader, Prantik
    Das, Srijan
    Le, Hieu
    Samaras, Dimitris
    COMPUTER VISION - ECCV 2024, PT LXXV, 2025, 15133 : 342 - 360