Prototype-Based Semantic Segmentation

被引:11
|
作者
Zhou, Tianfei [1 ]
Wang, Wenguan [2 ]
机构
[1] Beijing Inst Technol, Dept Comp Sci, Beijing 100811, Peoples R China
[2] Zhejiang Univ, CCAI, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Measurement; Semantic segmentation; Image segmentation; Vectors; Semantics; Transformers; prototype; nonparametric classification; online clustering; REPRESENTATION;
D O I
10.1109/TPAMI.2024.3387116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning based semantic segmentation solutions have yielded compelling results over the preceding decade. They encompass diverse network architectures (FCN based or attention based), along with various mask decoding schemes (parametric softmax based or pixel-query based). Despite the divergence, they can be grouped within a unified framework by interpreting the softmax weights or query vectors as learnable class prototypes. In light of this prototype view, we reveal inherent limitations within the parametric segmentation regime, and accordingly develop a nonparametric alternative based on non-learnable prototypes. In contrast to previous approaches that entail the learning of a single weight/query vector per class in a fully parametric manner, our approach represents each class as a set of non-learnable prototypes, relying solely upon the mean features of training pixels within that class. The pixel-wise prediction is thus achieved by nonparametric nearest prototype retrieving. This allows our model to directly shape the pixel embedding space by optimizing the arrangement between embedded pixels and anchored prototypes. It is able to accommodate an arbitrary number of classes with a constant number of learnable parameters. Through empirical evaluation with FCN based and Transformer based segmentation models (i.e., HRNet, Swin, SegFormer, Mask2Former) and backbones (i.e., ResNet, HRNet, Swin, MiT), our nonparametric framework shows superior performance on standard segmentation datasets (i.e., ADE20 K, Cityscapes, COCO-Stuff), as well as in large-vocabulary semantic segmentation scenarios. We expect that this study will provoke a rethink of the current de facto semantic segmentation model design.
引用
收藏
页码:6858 / 6872
页数:15
相关论文
共 50 条
  • [21] On the properties of prototype-based fuzzy classifiers
    Klose, Aljoscha
    Nuernberger, Andreas
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (04): : 817 - 835
  • [22] Unsupervised Domain Adaptation for Medical Image Segmentation with Dynamic Prototype-based Contrastive Learning
    En, Qing
    Guo, Yuhong
    CONFERENCE ON HEALTH, INFERENCE, AND LEARNING, 2024, 248 : 312 - 325
  • [23] Adaptive prototype-based fuzzy classification
    Cebron, Nicolas
    Berthold, Michael R.
    FUZZY SETS AND SYSTEMS, 2008, 159 (21) : 2806 - 2818
  • [24] Transfer Prototype-Based Fuzzy Clustering
    Deng, Zhaohong
    Jiang, Yizhang
    Chung, Fu-Lai
    Ishibuchi, Hisao
    Choi, Kup-Sze
    Wang, Shitong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 24 (05) : 1210 - 1232
  • [25] Prototype-based Programming with Fractal Algebra
    Semenov, Alexander
    COMPUTATIONAL MECHANICS AND MODERN APPLIED SOFTWARE SYSTEMS (CMMASS'2019), 2019, 2181
  • [26] Prototype-Based Image Search Reranking
    Yang, Linjun
    Hanjalic, Alan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (03) : 871 - 882
  • [27] Prototype-Based Classification of Dissimilarity Data
    Hammer, Barbara
    Mokbel, Bassam
    Schleif, Frank-Michael
    Zhu, Xibin
    ADVANCES IN INTELLIGENT DATA ANALYSIS X: IDA 2011, 2011, 7014 : 185 - 197
  • [28] A hierarchical prototype-based approach for classification
    Gu, Xiaowei
    Ding, Weiping
    INFORMATION SCIENCES, 2019, 505 : 325 - 351
  • [29] Prototype-based models in machine learning
    Biehl, Michael
    Hammer, Barbara
    Villmann, Thomas
    WILEY INTERDISCIPLINARY REVIEWS-COGNITIVE SCIENCE, 2016, 7 (02) : 92 - 111
  • [30] A novel prototype-based segmentation requiring only five training cases applied to MR angiography
    Jane Sjögren
    Martin Ugander
    Håkan Arheden
    Einar Heiberg
    Journal of Cardiovascular Magnetic Resonance, 11 (Suppl 1)