SAMPolyBuild: Adapting the Segment Anything Model for polygonal building extraction

被引:2
|
作者
Wang, Chenhao [1 ,2 ]
Chen, Jingbo [1 ]
Meng, Yu [1 ]
Deng, Yupeng [1 ]
Li, Kai [1 ,2 ,3 ]
Kong, Yunlong [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, 9 Dengzhuang South Rd, Beijing 101408, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, 1 East Yanqi Lake Rd, Beijing 100049, Peoples R China
[3] City Univ Hong Kong, Sch Data Sci, Hong Kong 999077, Peoples R China
关键词
Building extraction; Building vectorization; Foundation model; Instance segmentation; High-resolution remote sensing images; NETWORKS; IMAGERY;
D O I
10.1016/j.isprsjprs.2024.09.018
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Extracting polygonal buildings from high-resolution remote sensing images is a critical task for large-scale mapping, 3D city modeling, and various geographic information system applications. Traditional methods are often restricted in accurately delineating boundaries and exhibit limited generalizability, which can affect their real-world applicability. The Segment Anything Model (SAM), a promptable segmentation model trained on an unprecedentedly large dataset, demonstrates remarkable generalization ability across various scenarios. In this context, we present SAMPolyBuild, an innovative framework that adapts SAM for polygonal building extraction, allowing for both automatic and prompt-based extraction. To fulfill the requirement for object location prompts in SAM, we developed the Auto Bbox Prompter, which is trained to detect building bounding boxes directly from the image encoder features of the SAM. The boundary precision of the SAM mask results was insufficient for vector polygon extraction, especially when challenged by blurry edges and tree occlusions. Therefore, we extended the SAM decoder with additional parameters to enable multitask learning to predict masks and generate Gaussian vertex and boundary maps simultaneously. Furthermore, we developed a mask- guided vertex connection algorithm to generate the final polygon. Extensive evaluation on the WHU-Mix vector dataset and SpaceNet datasets demonstrate that our method achieves a new state-of-the-art in terms of accuracy and generalizability, significantly improving average precision (AP), average recall (AR), intersection over union (IoU), boundary F1, and vertex F1 metrics. Moreover, by combining the automatic and prompt modes of our framework, we found that 91.2% of the building polygons predicted by SAMPolyBuild on out- of-domain data closely match the quality of manually delineated polygons. The source code is available at https://github.com/wchh-2000/SAMPolyBuild.
引用
收藏
页码:707 / 720
页数:14
相关论文
共 50 条
  • [41] Segment Anything Model Is a Good Teacher for Local Feature Learning
    Wu, Jingqian
    Xu, Rongtao
    Wood-Doughty, Zach
    Wang, Changwei
    Xu, Shibiao
    Lam, Edmund Y.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 2097 - 2111
  • [42] Segment Shards: Cross-Prompt Adversarial Attacks against the Segment Anything Model
    Huang, Shize
    Fan, Qianhui
    Zhang, Zhaoxin
    Liu, Xiaowen
    Song, Guanqun
    Qin, Jinzhe
    APPLIED SCIENCES-BASEL, 2024, 14 (08):
  • [43] LeSAM: Adapt Segment Anything Model for Medical Lesion Segmentation
    Gu, Yunbo
    Wu, Qianyu
    Tang, Hui
    Mai, Xiaoli
    Shu, Huazhong
    Li, Baosheng
    Chen, Yang
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) : 6031 - 6041
  • [44] Exploring Semantic Prompts in the Segment Anything Model for Domain Adaptation
    Wang, Ziquan
    Zhang, Yongsheng
    Zhang, Zhenchao
    Jiang, Zhipeng
    Yu, Ying
    Li, Li
    Li, Lei
    REMOTE SENSING, 2024, 16 (05)
  • [45] Segment anything model for medical image analysis: An experimental study
    Mazurowski, Maciej A.
    Dong, Haoyu
    Gu, Hanxue
    Yang, Jichen
    Konz, Nicholas
    Zhang, Yixin
    MEDICAL IMAGE ANALYSIS, 2023, 89
  • [46] The Segment Anything Model (SAM) for accelerating the smart farming revolution
    Carraro, Alberto
    Sozzi, Marco
    Marinello, Francesco
    SMART AGRICULTURAL TECHNOLOGY, 2023, 6
  • [47] EVALUATION OF THE SEGMENT ANYTHING MODEL (SAM) FOR BRAIN TUMOR SEGMENTATION
    Peivandi, Mohammad
    Zhang, Jason
    Lu, Michael
    Li, Chengyin
    Zhu, Dongxiao
    Kou, Zhifeng
    IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI 2024, 2024,
  • [48] Multi-LoRA Fine-Tuned Segment Anything Model for Urban Man-Made Object Extraction
    Lu, Xiaoyan
    Weng, Qihao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [49] PolyWorld: Polygonal Building Extraction with Graph Neural Networks in Satellite Images
    Zorzi, Stefano
    Bazrafkan, Shabab
    Habenschuss, Stefan
    Fraundorfer, Friedrich
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1838 - 1847
  • [50] Superpixel-Guided Segment Anything Model for Liver Tumor Segmentation with Couinaud Segment Prompt
    Lyu, Fei
    Xu, Jingwen
    Zhu, Ye
    Wong, Grace Lai-Hung
    Yuen, Pong C.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 678 - 688