Sparsely Annotated Semantic Segmentation with Adaptive Gaussian Mixtures

被引:17
|
作者
Wu, Linshan [1 ]
Zhong, Zhun [2 ]
Fang, Leyuan [1 ]
He, Xingxin [1 ]
Liu, Qiang [1 ]
Ma, Jiayi [3 ]
Chen, Hao [4 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[3] Wuhan Univ, Sch Elect Informat, Wuhan, Peoples R China
[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.01483
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparsely annotated semantic segmentation (SASS) aims to learn a segmentation model by images with sparse labels (i.e., points or scribbles). Existing methods mainly focus on introducing low-level affinity or generating pseudo labels to strengthen supervision, while largely ignoring the inherent relation between labeled and unlabeled pixels. In this paper, we observe that pixels that are close to each other in the feature space are more likely to share the same class. Inspired by this, we propose a novel SASS framework, which is equipped with an Adaptive Gaussian Mixture Model (AGMM). Our AGMM can effectively endow reliable supervision for unlabeled pixels based on the distributions of labeled and unlabeled pixels. Specifically, we first build Gaussian mixtures using labeled pixels and their relatively similar unlabeled pixels, where the labeled pixels act as centroids, for modeling the feature distribution of each class. Then, we leverage the reliable information from labeled pixels and adaptively generated GMM predictions to supervise the training of unlabeled pixels, achieving online, dynamic, and robust self-supervision. In addition, by capturing category-wise Gaussian mixtures, AGMM encourages the model to learn discriminative class decision boundaries in an end-to-end contrastive learning manner. Experimental results conducted on the PASCAL VOC 2012 and Cityscapes datasets demonstrate that our AGMM can establish new state-of-the-art SASS performance. Code is available at https://github.com/Luffy03/AGMM-SASS.
引用
收藏
页码:15454 / 15464
页数:11
相关论文
共 50 条
  • [1] SASFormer: Transformers for Sparsely Annotated Semantic Segmentation
    Su, Hui
    Ye, Yue
    Hua, Wei
    Cheng, Lechao
    Song, Mingli
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 390 - 395
  • [2] Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation
    Liang, Zhiyuan
    Wang, Tiancai
    Zhang, Xiangyu
    Sun, Jian
    Shen, Jianbing
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16886 - 16895
  • [3] Learning from sparsely annotated data for semantic segmentation in histopathology images
    Bokhorst, John-Melle
    Pinckaers, Hans
    van Zwam, Peter
    Nagtegaal, Iris
    van der Laak, Jeroen
    Ciompi, Francesco
    INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 102, 2019, 102 : 84 - 91
  • [4] Semantic segmentation of sparsely annotated 3D point clouds by pseudo-labelling
    Xu, Katie
    Yao, Yasuhiro
    Murasaki, Kazuhiko
    Ando, Shingo
    Sagata, Atsushi
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 463 - 471
  • [5] Pseudo-labelling-aided semantic segmentation on sparsely annotated 3D point clouds
    Yao Y.
    Xu K.
    Murasaki K.
    Ando S.
    Sagata A.
    IPSJ Transactions on Computer Vision and Applications, 2020, 12 (01)
  • [6] Medical volume segmentation by overfitting sparsely annotated data
    Payer, Tristan
    Nizamani, Faraz
    Beer, Meinrad
    Goetz, Michael
    Ropinski, Timo
    JOURNAL OF MEDICAL IMAGING, 2023, 10 (04)
  • [7] A Bayesian framework for background segmentation based on adaptive Gaussian mixtures
    Lee, DS
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XIV, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING III, 2002, : 76 - 81
  • [8] Automatic Segmentation of Head Anatomical Structures from Sparsely-annotated Images
    Sugino, Takaaki
    Roth, Holger R.
    Eshghi, Mohammad
    Oda, Masahiro
    Chung, Min Suk
    Mori, Kensaku
    2017 IEEE INTERNATIONAL CONFERENCE ON CYBORG AND BIONIC SYSTEMS (CBS), 2017, : 145 - 149
  • [9] Gaussian Dilated Convolution for Semantic Image Segmentation
    Shen, Falong
    Zeng, Gang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 324 - 334
  • [10] Adaptive detection of sparsely distributed target in non-Gaussian clutter
    Jian, T.
    He, Y.
    Su, F.
    Qu, C.
    Ping, D.
    IET RADAR SONAR AND NAVIGATION, 2011, 5 (07): : 780 - 787