Local optimization cropping and boundary enhancement for end-to-end weakly-supervised segmentation network

被引：0

作者：

Wang, Weizheng ^{[1
]}

Zeng, Chao ^{[1
]}

Wang, Haonan ^{[1
]}

Zhou, Lei ^{[1
]}

机构：

[1] Changsha Univ Sci & Technol, Changsha 410000, Peoples R China

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2025年 / 251卷

基金：

中国国家自然科学基金;

关键词：

Deep learning; Weakly-supervised semantic segmentation; Computer vision; Single-stage; Boundary enhancement; Local optimization cropping; CONVOLUTIONAL NETWORKS;

D O I：

10.1016/j.cviu.2024.104260

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, the performance of weakly-supervised semantic segmentation(WSSS) has significantly increased. It usually employs image-level labels to generate Class Activation Map (CAM) for producing pseudo-labels, which greatly reduces the cost of annotation. Since CNN cannot fully identify object regions, researchers found that Vision Transformers (ViT) can complement the deficiencies of CNN by better extracting global contextual information. However, ViT also introduces the problem of over-smoothing. Great progress has been made in recent years to solve the over-smoothing problem, yet two issues remain. The first issue is that the high-confidence regions in the network-generated CAM still contain areas irrelevant to the class. The second issue is the inaccuracy of CAM boundaries, which contain a small portion of background regions. As we know, the precision of label boundaries is closely tied to excellent segmentation performance. In this work, to address the first issue, we propose a local optimized cropping module (LOC). By randomly cropping selected regions, we allow the local class tokens to be contrasted with the global class tokens. This method facilitates enhanced consistency between local and global representations. To address the second issue, we design a boundary enhancement module (BE) that utilizes an erasing strategy to re-train the image, increasing the network's extraction of boundary information and greatly improving the accuracy of CAM boundaries, thereby enhancing the quality of pseudo labels. Experiments on the PASCAL VOC dataset show that the performance of our proposed LOC-BE Net outperforms multi-stage methods and is competitive with end-to-end methods. On the PASCAL VOC dataset, our method achieves a CAM mIoU of 74.2% and a segmentation mIoU of 73.1%. On the COCO2014 dataset, our method achieves a CAM mIoU of 43.8% and a segmentation mIoU of 43.4%. Our code has been open sourced: https://github.com/whn786/LOC-BE/tree/main.

引用

页数：12

共 50 条

[1] Self Correspondence Distillation for End-to-End Weakly-Supervised Semantic Segmentation
Xu, Rongtao
Wang, Changwei
Sun, Jiaxi
Xu, Shibiao
Meng, Weiliang
Zhang, Xiaopeng
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 3045 - 3053
[2] End-to-end weakly-supervised semantic alignment
Rocco, Ignacio
Arandjelovic, Relja
Sivic, Josef
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6917 - 6925
[3] Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers
Ru, Lixiang
Zhan, Yibing
Yu, Baosheng
Du, Bo
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16825 - 16834
[4] Find it if You Can: End-to-End Adversarial Erasing for Weakly-Supervised Semantic Segmentation
Stammes, Erik
Runia, Tom F. H.
Hofmann, Michael
Ghafoorian, Mohsen
THIRTEENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2021), 2021, 11878
[5] Liver Segmentation A Weakly End-to-End Supervised Model
Ouassit, Youssef
Ardchir, Soufiane
Moulouki, Reda
El Ghoumari, Mohammed Yassine
Azzouazi, Mohamed
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2020, 16 (09) : 77 - 87
[6] Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation
Zhu, Lei
Zhang, Xinliang
He, Hangzhou
Chen, Qian
Li, Sha
Zeng, Shuang
Zhang, Yibao
Ren, Qiushi
Lu, Yanye
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[7] End-to-end weakly supervised semantic segmentation with reliable region mining
Zhang, Bingfeng
Xiao, Jimin
Wei, Yunchao
Huang, Kaizhu
Luo, Shan
Zhao, Yao
PATTERN RECOGNITION, 2022, 128
[8] Fully and Weakly Supervised Referring Expression Segmentation With End-to-End Learning
Li, Hui
Sun, Mingjie
Xiao, Jimin
Lim, Eng Gee
Zhao, Yao
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5999 - 6012
[9] Global Consistency Enhancement Network for Weakly-Supervised Semantic Segmentation
Jiang, Le
Yang, Xinhao
Ma, Liyan
Li, Zhenglin
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 53 - 65
[10] End-to-end weakly supervised semantic segmentation based on superpixel similarity comparison and feature channel optimization
Wang, Weizheng
Wang, Haonan
Zhou, Lei
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 118

← 1 2 3 4 5 →