Local optimization cropping and boundary enhancement for end-to-end weakly-supervised segmentation network

被引:0
|
作者
Wang, Weizheng [1 ]
Zeng, Chao [1 ]
Wang, Haonan [1 ]
Zhou, Lei [1 ]
机构
[1] Changsha Univ Sci & Technol, Changsha 410000, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Weakly-supervised semantic segmentation; Computer vision; Single-stage; Boundary enhancement; Local optimization cropping; CONVOLUTIONAL NETWORKS;
D O I
10.1016/j.cviu.2024.104260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, the performance of weakly-supervised semantic segmentation(WSSS) has significantly increased. It usually employs image-level labels to generate Class Activation Map (CAM) for producing pseudo-labels, which greatly reduces the cost of annotation. Since CNN cannot fully identify object regions, researchers found that Vision Transformers (ViT) can complement the deficiencies of CNN by better extracting global contextual information. However, ViT also introduces the problem of over-smoothing. Great progress has been made in recent years to solve the over-smoothing problem, yet two issues remain. The first issue is that the high-confidence regions in the network-generated CAM still contain areas irrelevant to the class. The second issue is the inaccuracy of CAM boundaries, which contain a small portion of background regions. As we know, the precision of label boundaries is closely tied to excellent segmentation performance. In this work, to address the first issue, we propose a local optimized cropping module (LOC). By randomly cropping selected regions, we allow the local class tokens to be contrasted with the global class tokens. This method facilitates enhanced consistency between local and global representations. To address the second issue, we design a boundary enhancement module (BE) that utilizes an erasing strategy to re-train the image, increasing the network's extraction of boundary information and greatly improving the accuracy of CAM boundaries, thereby enhancing the quality of pseudo labels. Experiments on the PASCAL VOC dataset show that the performance of our proposed LOC-BE Net outperforms multi-stage methods and is competitive with end-to-end methods. On the PASCAL VOC dataset, our method achieves a CAM mIoU of 74.2% and a segmentation mIoU of 73.1%. On the COCO2014 dataset, our method achieves a CAM mIoU of 43.8% and a segmentation mIoU of 43.4%. Our code has been open sourced: https://github.com/whn786/LOC-BE/tree/main.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Weakly supervised end-to-end domain adaptation for person re-identification
    Zhang, Lei
    Li, Haisheng
    Liu, Ruijun
    Wang, Xiaochuan
    Wu, Xiaoqun
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 113
  • [42] Face attribute recognition via end-to-end weakly supervised regional location
    Shi, Jian
    Sun, Ge
    Zhang, Jinyu
    Wang, Zhihui
    Li, Haojie
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 2137 - 2152
  • [43] End-to-End Boundary Aware Networks for Medical Image Segmentation
    Hatamizadeh, Ali
    Terzopoulos, Demetri
    Myronenko, Andriy
    MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2019), 2019, 11861 : 187 - 194
  • [44] Coupling Global Context and Local Contents for Weakly-Supervised Semantic Segmentation
    Wang, Chunyan
    Zhang, Dong
    Zhang, Liyan
    Tang, Jinhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 13483 - 13495
  • [45] Weakly-supervised semantic segmentation with superpixel guided local and global consistency
    Yi, Sheng
    Ma, Huimin
    Wang, Xiang
    Hu, Tianyu
    Li, Xi
    Wang, Yu
    PATTERN RECOGNITION, 2022, 124
  • [46] Coupling Global Context and Local Contents for Weakly-Supervised Semantic Segmentation
    Wang, Chunyan
    Zhang, Dong
    Zhang, Liyan
    Tang, Jinhui
    arXiv, 2023,
  • [47] Boundary-refined prototype generation: A general end-to-end paradigm for semi-supervised semantic segmentation
    Dong, Junhao
    Meng, Zhu
    Liu, Delong
    Liu, Jiaxuan
    Zhao, Zhicheng
    Su, Fei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 137
  • [48] End-to-end weakly-supervised single-stage multiple 3D hand mesh reconstruction from a single RGB image
    Ren, Jinwei
    Zhu, Jianke
    Zhang, Jialiang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [49] HOI-aware Adaptive Network for Weakly-supervised Action Segmentation
    Zhang, Runzhong
    Wang, Suchen
    Duan, Yueqi
    Tang, Yansong
    Zhang, Yue
    Tan, Yap-Peng
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1722 - 1730
  • [50] Co-attention dictionary network for weakly-supervised semantic segmentation
    Wan, Weitao
    Chen, Jiansheng
    Yang, Ming-Hsuan
    Ma, Huimin
    NEUROCOMPUTING, 2022, 486 : 272 - 285