SAM-RSP: A new few-shot segmentation method based on segment anything model and rough segmentation prompts

被引:0
|
作者
Li, Jiaguang [1 ]
Wei, Ying [1 ]
Zhang, Wei [1 ]
Shi, Zhenrui [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
关键词
Few-shot segmentation; Prompt learning; Prototype learning; Segment anything model (SAM); Semantic segmentation;
D O I
10.1016/j.imavis.2024.105214
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot segmentation (FSS) aims to segment novel classes with a few labeled images. The backbones used in existing methods are pre-trained through classification tasks on the ImageNet dataset. Although these backbones can effectively perceive the semantic categories of images, they cannot accurately perceive the regional boundaries within one image, which limits the model performance. Recently, Segment Anything Model (SAM) has achieved precise image segmentation based on point or box prompts, thanks to its excellent perception of region boundaries within one image. However, it cannot effectively provide semantic information of images. This paper proposes a new few-shot segmentation method that can effectively perceive both semantic categories and regional boundaries. This method first utilizes the SAM encoder to perceive regions and obtain the query embedding. Then the support and query images are input into a backbone pre-trained on ImageNet to perceive semantics and generate a rough segmentation prompt (RSP). This query embedding is combined with the prompt to generate a pixel-level query prototype, which can better match the query embedding. Finally, the query embedding, prompt, and prototype are combined and input into the designed multi-layer prompt transformer decoder, which is more efficient and lightweight, and can provide a more accurate segmentation result. In addition, other methods can be easily combined with our framework to improve their performance. Plenty of experiments on PASCAL-5i and COCO-20i under 1-shot and 5-shot settings prove the effectiveness of our method. Our method also achieves new state-of-the-art. Codes are available at https://github.com/Jiaguang-NE U/SAM-RSP.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] MW-SAM:Mangrove wetland remote sensing image segmentation network based on segment anything model
    Zhang, Yu
    Wang, Xin
    Cai, Jingye
    Yang, Qun
    IET IMAGE PROCESSING, 2024, 18 (14) : 4503 - 4513
  • [42] MAET-SAM: Magneto-Acousto-Electrical Tomography segmentation network based on the segment anything model
    Bu, Shuaiyu
    Li, Yuanyuan
    Liu, Guoqiang
    Li, Yifan
    Mathematical Biosciences and Engineering, 2025, 22 (03) : 585 - 603
  • [43] A Domain-Adaptive Segmentation Method Based on Segment Anything Model for Mechanical Assembly
    Wang, Jinlei
    Chen, Chengjun
    Dai, Chenggang
    Hong, Jun
    MEASUREMENT, 2024, 235
  • [44] Improving Cup -Rim Segmentation by Fair Error -Bound Scaling With Segment Anything Model (SAM)
    Zha, Lucy
    Luo, Yan
    Tian, Yu
    Shi, Min
    Kim, Leo A.
    Elze, Tobias
    Wang, Mengyu
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [45] Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7372 - 7378
  • [46] Kernel-based similarity sorting and allocation for few-shot semantic segmentation
    Ze-yu Liu
    Jian-wei Liu
    Neural Computing and Applications, 2022, 34 : 21939 - 21960
  • [47] Few-Shot Semantic Segmentation based on Detail-Preserving-Aware Loss
    Hsu, Chih-Chung
    Ma, Sin-Di
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 581 - 582
  • [48] Kernel-based similarity sorting and allocation for few-shot semantic segmentation
    Liu, Ze-yu
    Liu, Jian-wei
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (24): : 21939 - 21960
  • [49] ARNET:ATTENTION-BASED REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Li, Rusheng
    Liu, Hanhui
    Zhu, Yuesheng
    Bai, Zhiqiang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2238 - 2242
  • [50] Few-shot and Fast Texture Segmentation Based on Non-padding Convolution
    Zhou, Zehao
    Chai, Lin
    Jin, Lizuo
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6701 - 6708