Multi-scale feature correspondence and pseudo label retraining strategy for weakly supervised semantic segmentation

被引:0
|
作者
Wang, Weizheng [1 ]
Zhou, Lei [1 ]
Wang, Haonan [1 ]
机构
[1] Changsha Univ Sci & Technol, Sch Comp & Commun Engn, Changsha 410076, Peoples R China
基金
中国国家自然科学基金;
关键词
Weakly supervised semantic segmentation; Vision transformer; Multi-scale feature correspondence; Pseudo label retraining strategy;
D O I
10.1016/j.imavis.2024.105215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the performance of semantic segmentation using weakly supervised learning has significantly improved. Weakly supervised semantic segmentation (WSSS) that uses only image-level labels has received widespread attention, it employs Class Activation Maps (CAM) to generate pseudo labels. Compared to traditional use of pixel-level labels, this technique greatly reduces annotation costs by utilizing simpler and more readily available image-level annotations. Besides, due to the local perceptual ability of Convolutional Neural Networks (CNN), the generated CAM cannot activate the entire object area. Researchers have found that this CNN limitation can be compensated for by using Vision Transformer (ViT). However, ViT also introduces an over-smoothing problem. Recent research has made good progress in solving this issue, but when discussing CAM and its related segmentation predictions, it is easy to overlook their intrinsic information and the interrelationships between them. In this paper, we propose a Multi-Scale Feature Correspondence (MSFC) method. Our MSFC can obtain the feature correspondence of CAM and segmentation predictions at different scales, reextract useful semantic information from them, enhancing the network's learning of feature information and improving the quality of CAM. Moreover, to further improve the segmentation precision, we design a Pseudo Label Retraining Strategy (PLRS). This strategy refines the accuracy in local regions, elevates the quality of pseudo labels, and aims to enhance segmentation precision. Experimental results on the PASCAL VOC 2012 and MS COCO 2014 datasets show that our method achieves impressive performance among end-to-end WSSS methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Mixed-UNet: Refined class activation mapping for weakly-supervised semantic segmentation with multi-scale inference
    Liu, Yang
    Lian, Lijin
    Zhang, Ersi
    Xu, Lulu
    Xiao, Chufan
    Zhong, Xiaoyun
    Li, Fang
    Jiang, Bin
    Dong, Yuhan
    Ma, Lan
    Huang, Qiming
    Xu, Ming
    Zhang, Yongbing
    Yu, Dongmei
    Yan, Chenggang
    Qin, Peiwu
    FRONTIERS IN COMPUTER SCIENCE, 2022, 4
  • [32] M-SEE: A multi-scale encoder enhancement framework for end-to-end Weakly Supervised Semantic Segmentation
    Yang, Ziqian
    Zhao, Xinqiao
    Yao, Chao
    Zhang, Quan
    Xiao, Jimin
    PATTERN RECOGNITION, 2025, 162
  • [33] Multi-Scale Classification and Contrastive Regularization: Weakly Supervised Large-Scale 3D Point Cloud Semantic Segmentation
    Wang, Jingyi
    He, Jingyang
    Liu, Yu
    Chen, Chen
    Zhang, Maojun
    Tan, Hanlin
    REMOTE SENSING, 2024, 16 (17)
  • [34] Learning pseudo labels for semi-and-weakly supervised semantic segmentation
    Wang, Yude
    Zhang, Jie
    Kan, Meina
    Shan, Shiguang
    PATTERN RECOGNITION, 2022, 132
  • [35] Pseudo-mask Matters in Weakly-supervised Semantic Segmentation
    Li, Yi
    Kuang, Zhanghui
    Liu, Liyang
    Chen, Yimin
    Zhang, Wayne
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6944 - 6953
  • [36] Multi-Granular Semantic Mining for Weakly Supervised Semantic Segmentation
    Zhang, Meijie
    Li, Jianwu
    Zhou, Tianfei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 6019 - 6028
  • [37] A multi-scale strategy for deep semantic segmentation with convolutional neural networks
    Zhao, Bonan
    Zhang, Xiaoshan
    Li, Zheng
    Hu, Xianliang
    NEUROCOMPUTING, 2019, 365 : 273 - 284
  • [38] Multi-scale Matching Networks for Semantic Correspondence
    Zhao, Dongyang
    Song, Ziyang
    Ji, Zhenghao
    Zhao, Gangming
    Ge, Weifeng
    Yu, Yizhou
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3334 - 3344
  • [39] Multi-Scale Low-Discriminative Feature Reactivation for Weakly Supervised Object Localization
    Wang, Bo
    Yuan, Chunfeng
    Li, Bing
    Ding, Xinmiao
    Li, Zeya
    Wu, Ying
    Hu, Weiming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 6050 - 6065
  • [40] MFFLNet: lightweight semantic segmentation network based on multi-scale feature fusion
    Wei Depeng
    Wang Huabin
    Multimedia Tools and Applications, 2024, 83 : 30073 - 30093