Local optimization cropping and boundary enhancement for end-to-end weakly-supervised segmentation network

被引:0
|
作者
Wang, Weizheng [1 ]
Zeng, Chao [1 ]
Wang, Haonan [1 ]
Zhou, Lei [1 ]
机构
[1] Changsha Univ Sci & Technol, Changsha 410000, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Weakly-supervised semantic segmentation; Computer vision; Single-stage; Boundary enhancement; Local optimization cropping; CONVOLUTIONAL NETWORKS;
D O I
10.1016/j.cviu.2024.104260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the performance of weakly-supervised semantic segmentation(WSSS) has significantly increased. It usually employs image-level labels to generate Class Activation Map (CAM) for producing pseudo-labels, which greatly reduces the cost of annotation. Since CNN cannot fully identify object regions, researchers found that Vision Transformers (ViT) can complement the deficiencies of CNN by better extracting global contextual information. However, ViT also introduces the problem of over-smoothing. Great progress has been made in recent years to solve the over-smoothing problem, yet two issues remain. The first issue is that the high-confidence regions in the network-generated CAM still contain areas irrelevant to the class. The second issue is the inaccuracy of CAM boundaries, which contain a small portion of background regions. As we know, the precision of label boundaries is closely tied to excellent segmentation performance. In this work, to address the first issue, we propose a local optimized cropping module (LOC). By randomly cropping selected regions, we allow the local class tokens to be contrasted with the global class tokens. This method facilitates enhanced consistency between local and global representations. To address the second issue, we design a boundary enhancement module (BE) that utilizes an erasing strategy to re-train the image, increasing the network's extraction of boundary information and greatly improving the accuracy of CAM boundaries, thereby enhancing the quality of pseudo labels. Experiments on the PASCAL VOC dataset show that the performance of our proposed LOC-BE Net outperforms multi-stage methods and is competitive with end-to-end methods. On the PASCAL VOC dataset, our method achieves a CAM mIoU of 74.2% and a segmentation mIoU of 73.1%. On the COCO2014 dataset, our method achieves a CAM mIoU of 43.8% and a segmentation mIoU of 43.4%. Our code has been open sourced: https://github.com/whn786/LOC-BE/tree/main.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Self Correspondence Distillation for End-to-End Weakly-Supervised Semantic Segmentation
    Xu, Rongtao
    Wang, Changwei
    Sun, Jiaxi
    Xu, Shibiao
    Meng, Weiliang
    Zhang, Xiaopeng
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3045 - 3053
  • [2] End-to-end weakly-supervised semantic alignment
    Rocco, Ignacio
    Arandjelovic, Relja
    Sivic, Josef
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6917 - 6925
  • [3] Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
    Ru, Lixiang
    Zhan, Yibing
    Yu, Baosheng
    Du, Bo
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16825 - 16834
  • [4] Find it if You Can: End-to-End Adversarial Erasing for Weakly-Supervised Semantic Segmentation
    Stammes, Erik
    Runia, Tom F. H.
    Hofmann, Michael
    Ghafoorian, Mohsen
    THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
  • [5] Liver Segmentation A Weakly End-to-End Supervised Model
    Ouassit, Youssef
    Ardchir, Soufiane
    Moulouki, Reda
    El Ghoumari, Mohammed Yassine
    Azzouazi, Mohamed
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (09) : 77 - 87
  • [6] Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation
    Zhu, Lei
    Zhang, Xinliang
    He, Hangzhou
    Chen, Qian
    Li, Sha
    Zeng, Shuang
    Zhang, Yibao
    Ren, Qiushi
    Lu, Yanye
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [7] End-to-end weakly supervised semantic segmentation with reliable region mining
    Zhang, Bingfeng
    Xiao, Jimin
    Wei, Yunchao
    Huang, Kaizhu
    Luo, Shan
    Zhao, Yao
    PATTERN RECOGNITION, 2022, 128
  • [8] Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning
    Li, Hui
    Sun, Mingjie
    Xiao, Jimin
    Lim, Eng Gee
    Zhao, Yao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5999 - 6012
  • [9] Global Consistency Enhancement Network for Weakly-Supervised Semantic Segmentation
    Jiang, Le
    Yang, Xinhao
    Ma, Liyan
    Li, Zhenglin
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 53 - 65
  • [10] End-to-end weakly supervised semantic segmentation based on superpixel similarity comparison and feature channel optimization
    Wang, Weizheng
    Wang, Haonan
    Zhou, Lei
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118