Prototype-Based Semantic Segmentation

被引:11
|
作者
Zhou, Tianfei [1 ]
Wang, Wenguan [2 ]
机构
[1] Beijing Inst Technol, Dept Comp Sci, Beijing 100811, Peoples R China
[2] Zhejiang Univ, CCAI, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Measurement; Semantic segmentation; Image segmentation; Vectors; Semantics; Transformers; prototype; nonparametric classification; online clustering; REPRESENTATION;
D O I
10.1109/TPAMI.2024.3387116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning based semantic segmentation solutions have yielded compelling results over the preceding decade. They encompass diverse network architectures (FCN based or attention based), along with various mask decoding schemes (parametric softmax based or pixel-query based). Despite the divergence, they can be grouped within a unified framework by interpreting the softmax weights or query vectors as learnable class prototypes. In light of this prototype view, we reveal inherent limitations within the parametric segmentation regime, and accordingly develop a nonparametric alternative based on non-learnable prototypes. In contrast to previous approaches that entail the learning of a single weight/query vector per class in a fully parametric manner, our approach represents each class as a set of non-learnable prototypes, relying solely upon the mean features of training pixels within that class. The pixel-wise prediction is thus achieved by nonparametric nearest prototype retrieving. This allows our model to directly shape the pixel embedding space by optimizing the arrangement between embedded pixels and anchored prototypes. It is able to accommodate an arbitrary number of classes with a constant number of learnable parameters. Through empirical evaluation with FCN based and Transformer based segmentation models (i.e., HRNet, Swin, SegFormer, Mask2Former) and backbones (i.e., ResNet, HRNet, Swin, MiT), our nonparametric framework shows superior performance on standard segmentation datasets (i.e., ADE20 K, Cityscapes, COCO-Stuff), as well as in large-vocabulary semantic segmentation scenarios. We expect that this study will provoke a rethink of the current de facto semantic segmentation model design.
引用
收藏
页码:6858 / 6872
页数:15
相关论文
共 50 条
  • [41] A SOM prototype-based cluster analysis methodology
    Delgado, Soledad
    Higuera, Clara
    Calle-Espinosa, Jorge
    Moran, Federico
    Montero, Francisco
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 88 : 14 - 28
  • [42] A prototype-based modified DBSCAN for gene clustering
    Edla, Damodar Reddy
    Jana, Prasanta K.
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 485 - 492
  • [43] Evidential prototype-based clustering based on transfer learning
    Zhou, Kuang
    Guo, Mei
    Martin, Arnaud
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 151 : 322 - 343
  • [44] Evidential prototype-based clustering based on transfer learning
    Zhou, Kuang
    Guo, Mei
    Martin, Arnaud
    International Journal of Approximate Reasoning, 2022, 151 : 322 - 343
  • [45] Efficient rejection strategies for prototype-based classification
    Fischer, L.
    Hammer, B.
    Wersing, H.
    NEUROCOMPUTING, 2015, 169 : 334 - 342
  • [46] Computational Advantages of Deep Prototype-Based Learning
    Hecht, Thomas
    Gepperth, Alexander
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 : 121 - 127
  • [47] Functional digital prototype-based MDO and simulation
    Wu, Bao-Gui
    Huang, Hong-Zhong
    Tao, Ye
    Xitong Fangzhen Xuebao / Journal of System Simulation, 2007, 19 (04): : 861 - 864
  • [48] Prototype-Based Interpretability for Legal Citation Prediction
    Luo, Chu Fei
    Bhambhoria, Rohan
    Dahan, Samuel
    Zhu, Xiaodan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4883 - 4898
  • [49] A prototype-based resonance model of rhythm categorization
    Baath, Rasmus
    Lagerstedt, Erik
    Gardenfors, Peter
    I-PERCEPTION, 2014, 5 (06): : 548 - 558
  • [50] Interpretable Prototype-based Graph Information Bottleneck
    Seo, Sangwoo
    Kim, Sungwon
    Park, Chanyoung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,