Prototype-Based Semantic Segmentation

被引:11
|
作者
Zhou, Tianfei [1 ]
Wang, Wenguan [2 ]
机构
[1] Beijing Inst Technol, Dept Comp Sci, Beijing 100811, Peoples R China
[2] Zhejiang Univ, CCAI, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Measurement; Semantic segmentation; Image segmentation; Vectors; Semantics; Transformers; prototype; nonparametric classification; online clustering; REPRESENTATION;
D O I
10.1109/TPAMI.2024.3387116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning based semantic segmentation solutions have yielded compelling results over the preceding decade. They encompass diverse network architectures (FCN based or attention based), along with various mask decoding schemes (parametric softmax based or pixel-query based). Despite the divergence, they can be grouped within a unified framework by interpreting the softmax weights or query vectors as learnable class prototypes. In light of this prototype view, we reveal inherent limitations within the parametric segmentation regime, and accordingly develop a nonparametric alternative based on non-learnable prototypes. In contrast to previous approaches that entail the learning of a single weight/query vector per class in a fully parametric manner, our approach represents each class as a set of non-learnable prototypes, relying solely upon the mean features of training pixels within that class. The pixel-wise prediction is thus achieved by nonparametric nearest prototype retrieving. This allows our model to directly shape the pixel embedding space by optimizing the arrangement between embedded pixels and anchored prototypes. It is able to accommodate an arbitrary number of classes with a constant number of learnable parameters. Through empirical evaluation with FCN based and Transformer based segmentation models (i.e., HRNet, Swin, SegFormer, Mask2Former) and backbones (i.e., ResNet, HRNet, Swin, MiT), our nonparametric framework shows superior performance on standard segmentation datasets (i.e., ADE20 K, Cityscapes, COCO-Stuff), as well as in large-vocabulary semantic segmentation scenarios. We expect that this study will provoke a rethink of the current de facto semantic segmentation model design.
引用
收藏
页码:6858 / 6872
页数:15
相关论文
共 50 条
  • [1] Semi-supervised Semantic Segmentation with Prototype-based Consistency Regularization
    Xu, Hai-Ming
    Liu, Lingqiao
    Bian, Qiuchen
    Yang, Zhen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [2] PROTOTYPE-BASED CLUSTERED FEDERATED LEARNING FOR SEMANTIC SEGMENTATION OF AERIAL IMAGES
    Zhang, Boning
    Zhang, Xiaokang
    Pun, Man-On
    Liu, Ming
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2227 - 2230
  • [3] Frozen is better than learning: A new design of prototype-based classifier for semantic segmentation
    Chen, Jialei
    Deguchi, Daisuke
    Zhang, Chenkai
    Zheng, Xu
    Murase, Hiroshi
    PATTERN RECOGNITION, 2024, 152
  • [4] PSDPM: Prototype-based Secondary Discriminative Pixels Mining for Weakly Supervised Semantic Segmentation
    Zhao, Xinqiao
    Yang, Ziqian
    Dai, Tianhong
    Zhang, Bingfeng
    Xiao, Jimin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 3437 - 3446
  • [5] Enhancing the Semi-Supervised Semantic Segmentation With Prototype-Based Supervision for Remote Sensing Images
    Zheng, Zhiyu
    Lv, Liang
    Zhang, Lefei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [6] PEM: Prototype-based Efficient MaskFormer for Image Segmentation
    Cavagnero, Niccolo
    Rosi, Gabriele
    Cuttano, Claudia
    Pistilli, Francesca
    Ciccone, Marco
    Averta, Giuseppe
    Cermelli, Fabio
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15804 - 15813
  • [7] Prototype-Based Explanation for Semantic Gap Reduction With Distributional Embedding
    Joo, Hyungjun
    Hong, Sangwoo
    Han, Hyeonggeun
    Yoon, Youngseok
    Lee, Jungwoo
    IEEE ACCESS, 2025, 13 : 27449 - 27461
  • [8] Prototype-based classification
    Perner, Petra
    APPLIED INTELLIGENCE, 2008, 28 (03) : 238 - 246
  • [9] Prototype-based classification
    Petra Perner
    Applied Intelligence, 2008, 28 : 238 - 246
  • [10] A Foreground Prototype-Based One-Shot Segmentation of Brain Tumors
    Balasundaram, Ananthakrishnan
    Kavitha, Muthu Subash
    Pratheepan, Yogarajah
    Akshat, Dhamale
    Kaushik, Maddirala Venkata
    DIAGNOSTICS, 2023, 13 (07)