Multi-scale feature correspondence and pseudo label retraining strategy for weakly supervised semantic segmentation

被引:0
|
作者
Wang, Weizheng [1 ]
Zhou, Lei [1 ]
Wang, Haonan [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410076, Peoples R China
基金
中国国家自然科学基金;
关键词
Weakly supervised semantic segmentation; Vision transformer; Multi-scale feature correspondence; Pseudo label retraining strategy;
D O I
10.1016/j.imavis.2024.105215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the performance of semantic segmentation using weakly supervised learning has significantly improved. Weakly supervised semantic segmentation (WSSS) that uses only image-level labels has received widespread attention, it employs Class Activation Maps (CAM) to generate pseudo labels. Compared to traditional use of pixel-level labels, this technique greatly reduces annotation costs by utilizing simpler and more readily available image-level annotations. Besides, due to the local perceptual ability of Convolutional Neural Networks (CNN), the generated CAM cannot activate the entire object area. Researchers have found that this CNN limitation can be compensated for by using Vision Transformer (ViT). However, ViT also introduces an over-smoothing problem. Recent research has made good progress in solving this issue, but when discussing CAM and its related segmentation predictions, it is easy to overlook their intrinsic information and the interrelationships between them. In this paper, we propose a Multi-Scale Feature Correspondence (MSFC) method. Our MSFC can obtain the feature correspondence of CAM and segmentation predictions at different scales, reextract useful semantic information from them, enhancing the network's learning of feature information and improving the quality of CAM. Moreover, to further improve the segmentation precision, we design a Pseudo Label Retraining Strategy (PLRS). This strategy refines the accuracy in local regions, elevates the quality of pseudo labels, and aims to enhance segmentation precision. Experimental results on the PASCAL VOC 2012 and MS COCO 2014 datasets show that our method achieves impressive performance among end-to-end WSSS methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Point Cloud Semantic Segmentation Network Based on Multi-Scale Feature Fusion
    Du, Jing
    Jiang, Zuning
    Huang, Shangfeng
    Wang, Zongyue
    Su, Jinhe
    Su, Songjian
    Wu, Yundong
    Cai, Guorong
    SENSORS, 2021, 21 (05) : 1 - 20
  • [42] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (10) : 30073 - 30093
  • [43] Global and Local Multi-scale Feature Fusion for Object Detection and Semantic Segmentation
    Lim, Young-Chul
    Kang, Minsung
    2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 2557 - 2562
  • [44] Semantic segmentation of multi-scale remote sensing images with contextual feature enhancement
    Zhang, Mei
    Liu, Lingling
    Pei, Yongtao
    Xie, Guojing
    Wen, Jinghua
    VISUAL COMPUTER, 2025, 41 (02): : 1303 - 1317
  • [45] Non-target feature filtering for weakly supervised semantic segmentation
    Zhou, Xuesheng
    Li, Yan
    Cao, Guitao
    Cao, Wenming
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [46] Weakly Supervised Semantic Segmentation with a Multi-Image Model
    Vezhnevets, Alexander
    Ferrari, Vittorio
    Buhmann, Joachim M.
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 643 - 650
  • [47] DISCOBOX: Weakly Supervised Instance Segmentation and Semantic Correspondence from Box Supervision
    Lan, Shiyi
    Yu, Zhiding
    Choy, Christopher
    Radhakrishnan, Subhashree
    Liu, Guilin
    Zhu, Yuke
    Davis, Larry S.
    Anandkumar, Anima
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3386 - 3396
  • [48] JMLNet: Joint Multi-Label Learning Network for Weakly Supervised Semantic Segmentation in Aerial Images
    Guo, Rongxin
    Sun, Xian
    Chen, Kaiqiang
    Zhou, Xiao
    Yan, Zhiyuan
    Diao, Wenhui
    Yan, Menglong
    REMOTE SENSING, 2020, 12 (19) : 1 - 18
  • [49] GSTO: Gated Scale-Transfer Operation for Multi-Scale Feature Learning in Semantic Segmentation
    Wang, Zhuoying
    Wang, Yongtao
    Tang, Zhi
    Li, Yangyan
    Chen, Ying
    Ling, Haibin
    Lin, Weisi
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7111 - 7118
  • [50] GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation
    Pu, Mengyang
    Huang, Yaping
    Guan, Qingji
    Zou, Qi
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 483 - 491