SAM-RSP: A new few-shot segmentation method based on segment anything model and rough segmentation prompts

被引:0
|
作者
Li, Jiaguang [1 ]
Wei, Ying [1 ]
Zhang, Wei [1 ]
Shi, Zhenrui [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Peoples R China
关键词
Few-shot segmentation; Prompt learning; Prototype learning; Segment anything model (SAM); Semantic segmentation;
D O I
10.1016/j.imavis.2024.105214
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot segmentation (FSS) aims to segment novel classes with a few labeled images. The backbones used in existing methods are pre-trained through classification tasks on the ImageNet dataset. Although these backbones can effectively perceive the semantic categories of images, they cannot accurately perceive the regional boundaries within one image, which limits the model performance. Recently, Segment Anything Model (SAM) has achieved precise image segmentation based on point or box prompts, thanks to its excellent perception of region boundaries within one image. However, it cannot effectively provide semantic information of images. This paper proposes a new few-shot segmentation method that can effectively perceive both semantic categories and regional boundaries. This method first utilizes the SAM encoder to perceive regions and obtain the query embedding. Then the support and query images are input into a backbone pre-trained on ImageNet to perceive semantics and generate a rough segmentation prompt (RSP). This query embedding is combined with the prompt to generate a pixel-level query prototype, which can better match the query embedding. Finally, the query embedding, prompt, and prototype are combined and input into the designed multi-layer prompt transformer decoder, which is more efficient and lightweight, and can provide a more accurate segmentation result. In addition, other methods can be easily combined with our framework to improve their performance. Plenty of experiments on PASCAL-5i and COCO-20i under 1-shot and 5-shot settings prove the effectiveness of our method. Our method also achieves new state-of-the-art. Codes are available at https://github.com/Jiaguang-NE U/SAM-RSP.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A few-shot semantic segmentation method based on adaptively mining correlation network
    Huang, Zhifu
    Jiang, Bin
    Liu, Yu
    ROBOTICA, 2023, 41 (06) : 1828 - 1836
  • [22] DETR-SAM: Automated Few-Shot Segmentation with Detection Transformer and Keypoint Matching
    Khanmohamadi, Mohamadreza
    Farahani, Bahar
    2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 345 - 350
  • [23] Few-shot object segmentation with a new feature aggregation module
    Liu, Kaijun
    Lyu, Shujing
    Shivakumara, Palaiahnakote
    Lu, Yue
    DISPLAYS, 2023, 78
  • [24] ICDAR 2024 Competition on Few-Shot and Many-Shot Layout Segmentation of Ancient Manuscripts (SAM)
    Zottin, Silvia
    De Nardin, Axel
    Foresti, Gian Luca
    Colombi, Emanuela
    Piciarelli, Claudio
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 315 - 331
  • [25] Leaf only SAM: A segment anything pipeline for zero-shot automated leaf segmentation
    Williams, Dominic
    Macfarlane, Fraser
    Britten, Avril
    SMART AGRICULTURAL TECHNOLOGY, 2024, 8
  • [26] Few-Shot Segmentation based on Global-cross Attention
    Wang, Cailing
    Xu, Yinpeng
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4905 - 4910
  • [27] SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology
    Zhang, Jingwei
    Ma, Ke
    Kapse, Saarthak
    Saltz, Joel
    Vakalopoulou, Maria
    Prasanna, Prateek
    Samaras, Dimitris
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023 WORKSHOPS, 2023, 14393 : 161 - 170
  • [28] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
    Ye, Maoyuan
    Zhang, Jing
    Liu, Juhua
    Liu, Chenyu
    Yin, Baocai
    Liu, Cong
    Du, Bo
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1431 - 1447
  • [29] Medical SAM adapter: Adapting segment anything model for medical image segmentation
    Wu, Junde
    Wang, Ziyue
    Hong, Mingxuan
    Ji, Wei
    Fu, Huazhu
    Xu, Yanwu
    Xu, Min
    Jin, Yueming
    MEDICAL IMAGE ANALYSIS, 2025, 102
  • [30] A meta-learning based method for segmentation of few-shot magnetic resonance images
    Chen X.
    Fu Z.
    Yao Y.
    Shengwu Yixue Gongchengxue Zazhi/Journal of Biomedical Engineering, 2023, 40 (02): : 193 - 201