Learning Visual Words for Weakly-Supervised Semantic Segmentation

被引:0
|
作者
Ru, Lixiang [1 ,2 ]
Du, Bo [1 ,2 ]
Wu, Chen [3 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Inst Artificial Intelligence, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] Wuhan Univ, LIESMARS, Wuhan, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current weakly-supervised semantic segmentation (WSSS) methods with image-level labels mainly adopt class activation maps (CAM) to generate the initial pseudo labels. However, CAM usually only identifies the most discriminative object extents, which is attributed to the fact that the network doesn't need to discover the integral object to recognize image-level labels. In this work, to tackle this problem, we proposed to simultaneously learn the image-level labels and local visual word labels. Specifically, in each forward propagation, the feature maps of the input image will be encoded to visual words with a learnable codebook. By enforcing the network to classify the encoded fine-grained visual words, the generated CAM could cover more semantic regions. Besides, we also proposed a hybrid spatial pyramid pooling module that could preserve local maximum and global average values of feature maps, so that more object details and less background were considered. Based on the proposed methods, we conducted experiments on the PASCAL VOC 2012 dataset. Our proposed method achieved 67.2% mIoU on the val set and 67.3% mIoU on the test set, which outperformed recent state-of-the-art methods.
引用
收藏
页码:982 / 988
页数:7
相关论文
共 50 条
  • [1] Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling
    Ru, Lixiang
    Du, Bo
    Zhan, Yibing
    Wu, Chen
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (04) : 1127 - 1144
  • [2] Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling
    Lixiang Ru
    Bo Du
    Yibing Zhan
    Chen Wu
    International Journal of Computer Vision, 2022, 130 : 1127 - 1144
  • [3] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
    Wang, Xiang
    Liu, Sifei
    Ma, Huimin
    Yang, Ming-Hsuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1736 - 1749
  • [4] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
    Xiang Wang
    Sifei Liu
    Huimin Ma
    Ming-Hsuan Yang
    International Journal of Computer Vision, 2020, 128 : 1736 - 1749
  • [5] Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty
    Neven, Robby
    Neven, Davy
    De Brabandere, Bert
    Proesmans, Marc
    Goedeme, Toon
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1678 - 1686
  • [6] Weakly-Supervised Semantic Segmentation with Mean Teacher Learning
    Tan, Li
    Luo, WenFeng
    Yang, Meng
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 324 - 335
  • [7] A Weakly-Supervised Approach for Semantic Segmentation
    Feng, Yanqing
    Wang, Lunwen
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2311 - 2314
  • [8] Exclusive Constrained Discriminative Learning for Weakly-Supervised Semantic Segmentation
    Ying, Peng
    Liu, Jing
    Lu, Hanqing
    Ma, Songde
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1251 - 1254
  • [9] Weakly-supervised Incremental learning for Semantic segmentation with Class Hierarchy
    Kim, Hyoseo
    Choe, Junsuk
    PATTERN RECOGNITION LETTERS, 2024, 182 : 31 - 38
  • [10] Token Contrast for Weakly-Supervised Semantic Segmentation
    Ru, Lixiang
    Zheng, Hehang
    Zhan, Yibing
    Du, Bo
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 3093 - 3102