Learning Visual Words for Weakly-Supervised Semantic Segmentation

被引：0

作者：

Ru, Lixiang ^{[1
,2
]}

Du, Bo ^{[1
,2
]}

Wu, Chen ^{[3
]}

机构：

[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Sch Comp Sci, Wuhan, Peoples R China

[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

[3] Wuhan Univ, LIESMARS, Wuhan, Peoples R China

来源：

PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021 | 2021年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current weakly-supervised semantic segmentation (WSSS) methods with image-level labels mainly adopt class activation maps (CAM) to generate the initial pseudo labels. However, CAM usually only identifies the most discriminative object extents, which is attributed to the fact that the network doesn't need to discover the integral object to recognize image-level labels. In this work, to tackle this problem, we proposed to simultaneously learn the image-level labels and local visual word labels. Specifically, in each forward propagation, the feature maps of the input image will be encoded to visual words with a learnable codebook. By enforcing the network to classify the encoded fine-grained visual words, the generated CAM could cover more semantic regions. Besides, we also proposed a hybrid spatial pyramid pooling module that could preserve local maximum and global average values of feature maps, so that more object details and less background were considered. Based on the proposed methods, we conducted experiments on the PASCAL VOC 2012 dataset. Our proposed method achieved 67.2% mIoU on the val set and 67.3% mIoU on the test set, which outperformed recent state-of-the-art methods.

引用

页码：982 / 988

页数：7

共 50 条

[1] Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling
Ru, Lixiang
Du, Bo
Zhan, Yibing
Wu, Chen
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (04) : 1127 - 1144
[2] Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling
Lixiang Ru
Bo Du
Yibing Zhan
Chen Wu
International Journal of Computer Vision, 2022, 130 : 1127 - 1144
[3] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
Wang, Xiang
Liu, Sifei
Ma, Huimin
Yang, Ming-Hsuan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1736 - 1749
[4] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
Xiang Wang
Sifei Liu
Huimin Ma
Ming-Hsuan Yang
International Journal of Computer Vision, 2020, 128 : 1736 - 1749
[5] Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty
Neven, Robby
Neven, Davy
De Brabandere, Bert
Proesmans, Marc
Goedeme, Toon
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1678 - 1686
[6] Weakly-Supervised Semantic Segmentation with Mean Teacher Learning
Tan, Li
Luo, WenFeng
Yang, Meng
INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 324 - 335
[7] A Weakly-Supervised Approach for Semantic Segmentation
Feng, Yanqing
Wang, Lunwen
PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2311 - 2314
[8] Exclusive Constrained Discriminative Learning for Weakly-Supervised Semantic Segmentation
Ying, Peng
Liu, Jing
Lu, Hanqing
Ma, Songde
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1251 - 1254
[9] Weakly-supervised Incremental learning for Semantic segmentation with Class Hierarchy
Kim, Hyoseo
Choe, Junsuk
PATTERN RECOGNITION LETTERS, 2024, 182 : 31 - 38
[10] Token Contrast for Weakly-Supervised Semantic Segmentation
Ru, Lixiang
Zheng, Hehang
Zhan, Yibing
Du, Bo
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3093 - 3102

← 1 2 3 4 5 →