SAMPolyBuild: Adapting the Segment Anything Model for polygonal building extraction

被引:2
|
作者
Wang, Chenhao [1 ,2 ]
Chen, Jingbo [1 ]
Meng, Yu [1 ]
Deng, Yupeng [1 ]
Li, Kai [1 ,2 ,3 ]
Kong, Yunlong [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, 9 Dengzhuang South Rd, Beijing 101408, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, 1 East Yanqi Lake Rd, Beijing 100049, Peoples R China
[3] City Univ Hong Kong, Sch Data Sci, Hong Kong 999077, Peoples R China
关键词
Building extraction; Building vectorization; Foundation model; Instance segmentation; High-resolution remote sensing images; NETWORKS; IMAGERY;
D O I
10.1016/j.isprsjprs.2024.09.018
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Extracting polygonal buildings from high-resolution remote sensing images is a critical task for large-scale mapping, 3D city modeling, and various geographic information system applications. Traditional methods are often restricted in accurately delineating boundaries and exhibit limited generalizability, which can affect their real-world applicability. The Segment Anything Model (SAM), a promptable segmentation model trained on an unprecedentedly large dataset, demonstrates remarkable generalization ability across various scenarios. In this context, we present SAMPolyBuild, an innovative framework that adapts SAM for polygonal building extraction, allowing for both automatic and prompt-based extraction. To fulfill the requirement for object location prompts in SAM, we developed the Auto Bbox Prompter, which is trained to detect building bounding boxes directly from the image encoder features of the SAM. The boundary precision of the SAM mask results was insufficient for vector polygon extraction, especially when challenged by blurry edges and tree occlusions. Therefore, we extended the SAM decoder with additional parameters to enable multitask learning to predict masks and generate Gaussian vertex and boundary maps simultaneously. Furthermore, we developed a mask- guided vertex connection algorithm to generate the final polygon. Extensive evaluation on the WHU-Mix vector dataset and SpaceNet datasets demonstrate that our method achieves a new state-of-the-art in terms of accuracy and generalizability, significantly improving average precision (AP), average recall (AR), intersection over union (IoU), boundary F1, and vertex F1 metrics. Moreover, by combining the automatic and prompt modes of our framework, we found that 91.2% of the building polygons predicted by SAMPolyBuild on out- of-domain data closely match the quality of manually delineated polygons. The source code is available at https://github.com/wchh-2000/SAMPolyBuild.
引用
收藏
页码:707 / 720
页数:14
相关论文
共 50 条
  • [21] Segment anything model for medical images?
    Huang, Yuhao
    Yang, Xin
    Liu, Lian
    Zhou, Han
    Chang, Ao
    Zhou, Xinrui
    Chen, Rusi
    Yu, Junxuan
    Chen, Jiongquan
    Chen, Chaoyu
    Liu, Sijing
    Chi, Haozhe
    Hu, Xindi
    Yue, Kejuan
    Li, Lei
    Grau, Vicente
    Fan, Deng-Ping
    Dong, Fajin
    Ni, Dong
    MEDICAL IMAGE ANALYSIS, 2024, 92
  • [22] Polygonal Building Extraction by Frame Field Learning
    Girard, Nicolas
    Smirnov, Dmitriy
    Solomon, Justin
    Tarabalka, Yuliya
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5887 - 5896
  • [23] DED-SAM:Adapting Segment Anything Model 2 for Dual Encoder-Decoder Change Detection
    Qiu, Junlong
    Liu, Wei
    Zhang, Xin
    Li, Erzhu
    Zhang, Lianpeng
    Li, Xing
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 995 - 1006
  • [24] Mammo-SAM: Adapting Foundation Segment Anything Model for Automatic Breast Mass Segmentation in Whole Mammograms
    Xiong, Xinyu
    Wang, Churan
    Li, Wenxue
    Li, Guanbin
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 176 - 185
  • [25] Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images
    Wang, Hualiang
    Lin, Yiqun
    Ding, Xinpeng
    Li, Xiaomeng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 636 - 646
  • [26] EviPrompt: A Training-Free Evidential Prompt Generation Method for Adapting Segment Anything Model in Medical Images
    Xu, Yinsong
    Tang, Jiaqi
    Men, Aidong
    Chen, Qingchao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6204 - 6215
  • [27] Matte anything: Interactive natural image matting with segment anything model
    Yao, Jingfeng
    Wang, Xinggang
    Ye, Lang
    Liu, Wenyu
    IMAGE AND VISION COMPUTING, 2024, 147
  • [28] ASS-CD: Adapting Segment Anything Model and Swin-Transformer for Change Detection in Remote Sensing Images
    Wei, Chenlong
    Wu, Xiaofeng
    Wang, Bin
    REMOTE SENSING, 2025, 17 (03)
  • [29] UV-AdaptFormer: adapting the segment anything model for urban village identification from high-resolution satellite imagery
    Feng, Wenqing
    Guan, Fangli
    Tu, Jihui
    Xu, Wei
    REMOTE SENSING LETTERS, 2025, 16 (06) : 573 - 583
  • [30] Make Segment Anything Model Perfect on Shadow Detection
    Chen, Xiao-Diao
    Wu, Wen
    Yang, Wenya
    Qin, Hongshuai
    Wu, Xiantao
    Mao, Xiaoyang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 13