SAMPolyBuild: Adapting the Segment Anything Model for polygonal building extraction

被引:2
|
作者
Wang, Chenhao [1 ,2 ]
Chen, Jingbo [1 ]
Meng, Yu [1 ]
Deng, Yupeng [1 ]
Li, Kai [1 ,2 ,3 ]
Kong, Yunlong [1 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, 9 Dengzhuang South Rd, Beijing 101408, Peoples R China
[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, 1 East Yanqi Lake Rd, Beijing 100049, Peoples R China
[3] City Univ Hong Kong, Sch Data Sci, Hong Kong 999077, Peoples R China
关键词
Building extraction; Building vectorization; Foundation model; Instance segmentation; High-resolution remote sensing images; NETWORKS; IMAGERY;
D O I
10.1016/j.isprsjprs.2024.09.018
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Extracting polygonal buildings from high-resolution remote sensing images is a critical task for large-scale mapping, 3D city modeling, and various geographic information system applications. Traditional methods are often restricted in accurately delineating boundaries and exhibit limited generalizability, which can affect their real-world applicability. The Segment Anything Model (SAM), a promptable segmentation model trained on an unprecedentedly large dataset, demonstrates remarkable generalization ability across various scenarios. In this context, we present SAMPolyBuild, an innovative framework that adapts SAM for polygonal building extraction, allowing for both automatic and prompt-based extraction. To fulfill the requirement for object location prompts in SAM, we developed the Auto Bbox Prompter, which is trained to detect building bounding boxes directly from the image encoder features of the SAM. The boundary precision of the SAM mask results was insufficient for vector polygon extraction, especially when challenged by blurry edges and tree occlusions. Therefore, we extended the SAM decoder with additional parameters to enable multitask learning to predict masks and generate Gaussian vertex and boundary maps simultaneously. Furthermore, we developed a mask- guided vertex connection algorithm to generate the final polygon. Extensive evaluation on the WHU-Mix vector dataset and SpaceNet datasets demonstrate that our method achieves a new state-of-the-art in terms of accuracy and generalizability, significantly improving average precision (AP), average recall (AR), intersection over union (IoU), boundary F1, and vertex F1 metrics. Moreover, by combining the automatic and prompt modes of our framework, we found that 91.2% of the building polygons predicted by SAMPolyBuild on out- of-domain data closely match the quality of manually delineated polygons. The source code is available at https://github.com/wchh-2000/SAMPolyBuild.
引用
收藏
页码:707 / 720
页数:14
相关论文
共 50 条
  • [1] SAMP: Adapting Segment Anything Model for Pose Estimation
    Zhu, Zhihang
    Yan, Yunfeng
    Chen, Yi
    Jin, Haoyuan
    Nie, Xuesong
    Qi, Donglian
    Chen, Xi
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [2] Adapting Segment Anything Model (SAM) for Retinal OCT
    Fazekas, Botond
    Morano, Jose
    Lachinov, Dmitrii
    Aresta, Guilherme
    Bogunovic, Hrvoje
    OPHTHALMIC MEDICAL IMAGE ANALYSIS, OMIA 2023, 2023, 14096 : 92 - 101
  • [3] Adapting Segment Anything Model for self-supervised monocular depth estimation
    Zhang, Dongdong
    Wang, Chunping
    Fu, Qiang
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
  • [4] Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images
    Ding, Lei
    Zhu, Kun
    Peng, Daifeng
    Tang, Hao
    Yang, Kuiwu
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11
  • [5] UV-SAM: Adapting Segment Anything Model for Urban Village Identification
    Zhang, Xin
    Liu, Yu
    Lin, Yuming
    Liao, Qingmin
    Li, Yong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22520 - 22528
  • [6] WebSAM-Adapter: Adapting Segment Anything Model for Web Page Segmentation
    Ren, Bowen
    Qian, Zefeng
    Sun, Yuchen
    Gao, Chao
    Zhang, Chongyang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 439 - 454
  • [7] ShadowAdapter: Adapting Segment Anything Model with Auto-Prompt for shadow detection
    Jie, Leiping
    Zhang, Hui
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [8] Medical SAM adapter: Adapting segment anything model for medical image segmentation
    Wu, Junde
    Wang, Ziyue
    Hong, Mingxuan
    Ji, Wei
    Fu, Huazhu
    Xu, Yanwu
    Xu, Min
    Jin, Yueming
    MEDICAL IMAGE ANALYSIS, 2025, 102
  • [9] Adapting the Segment Anything Model for Plant Recognition and Automated Phenotypic Parameter Measurement
    Zhang, Wenqi
    Dang, L. Minh
    Nguyen, Le Quan
    Alam, Nur
    Bui, Ngoc Dung
    Park, Han Yong
    Moon, Hyeonjoon
    HORTICULTURAE, 2024, 10 (04)
  • [10] Adapting the segment anything model for multi-modal retinal anomaly detection and localization
    Li, Jingtao
    Chen, Ting
    Wang, Xinyu
    Zhong, Yanfei
    Xiao, Xuan
    INFORMATION FUSION, 2025, 113