SAMPolyBuild: Adapting the Segment Anything Model for polygonal building extraction

被引：2

作者：

Wang, Chenhao ^{[1
,2
]}

Chen, Jingbo ^{[1
]}

Meng, Yu ^{[1
]}

Deng, Yupeng ^{[1
]}

Li, Kai ^{[1
,2
,3
]}

Kong, Yunlong ^{[1
]}

机构：

[1] Chinese Acad Sci, Aerosp Informat Res Inst, 9 Dengzhuang South Rd, Beijing 101408, Peoples R China

[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, 1 East Yanqi Lake Rd, Beijing 100049, Peoples R China

[3] City Univ Hong Kong, Sch Data Sci, Hong Kong 999077, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2024年 / 218卷

关键词：

Building extraction; Building vectorization; Foundation model; Instance segmentation; High-resolution remote sensing images; NETWORKS; IMAGERY;

D O I：

10.1016/j.isprsjprs.2024.09.018

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Extracting polygonal buildings from high-resolution remote sensing images is a critical task for large-scale mapping, 3D city modeling, and various geographic information system applications. Traditional methods are often restricted in accurately delineating boundaries and exhibit limited generalizability, which can affect their real-world applicability. The Segment Anything Model (SAM), a promptable segmentation model trained on an unprecedentedly large dataset, demonstrates remarkable generalization ability across various scenarios. In this context, we present SAMPolyBuild, an innovative framework that adapts SAM for polygonal building extraction, allowing for both automatic and prompt-based extraction. To fulfill the requirement for object location prompts in SAM, we developed the Auto Bbox Prompter, which is trained to detect building bounding boxes directly from the image encoder features of the SAM. The boundary precision of the SAM mask results was insufficient for vector polygon extraction, especially when challenged by blurry edges and tree occlusions. Therefore, we extended the SAM decoder with additional parameters to enable multitask learning to predict masks and generate Gaussian vertex and boundary maps simultaneously. Furthermore, we developed a mask- guided vertex connection algorithm to generate the final polygon. Extensive evaluation on the WHU-Mix vector dataset and SpaceNet datasets demonstrate that our method achieves a new state-of-the-art in terms of accuracy and generalizability, significantly improving average precision (AP), average recall (AR), intersection over union (IoU), boundary F1, and vertex F1 metrics. Moreover, by combining the automatic and prompt modes of our framework, we found that 91.2% of the building polygons predicted by SAMPolyBuild on out- of-domain data closely match the quality of manually delineated polygons. The source code is available at https://github.com/wchh-2000/SAMPolyBuild.

引用

页码：707 / 720

页数：14

共 50 条

[1] SAMP: Adapting Segment Anything Model for Pose Estimation
Zhu, Zhihang
Yan, Yunfeng
Chen, Yi
Jin, Haoyuan
Nie, Xuesong
Qi, Donglian
Chen, Xi
2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
[2] Adapting Segment Anything Model (SAM) for Retinal OCT
Fazekas, Botond
Morano, Jose
Lachinov, Dmitrii
Aresta, Guilherme
Bogunovic, Hrvoje
OPHTHALMIC MEDICAL IMAGE ANALYSIS, OMIA 2023, 2023, 14096 : 92 - 101
[3] Adapting Segment Anything Model for self-supervised monocular depth estimation
Zhang, Dongdong
Wang, Chunping
Fu, Qiang
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (06)
[4] Adapting Segment Anything Model for Change Detection in VHR Remote Sensing Images
Ding, Lei
Zhu, Kun
Peng, Daifeng
Tang, Hao
Yang, Kuiwu
Bruzzone, Lorenzo
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11
[5] UV-SAM: Adapting Segment Anything Model for Urban Village Identification
Zhang, Xin
Liu, Yu
Lin, Yuming
Liao, Qingmin
Li, Yong
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 20, 2024, : 22520 - 22528
[6] WebSAM-Adapter: Adapting Segment Anything Model for Web Page Segmentation
Ren, Bowen
Qian, Zefeng
Sun, Yuchen
Gao, Chao
Zhang, Chongyang
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 439 - 454
[7] ShadowAdapter: Adapting Segment Anything Model with Auto-Prompt for shadow detection
Jie, Leiping
Zhang, Hui
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
[8] Medical SAM adapter: Adapting segment anything model for medical image segmentation
Wu, Junde
Wang, Ziyue
Hong, Mingxuan
Ji, Wei
Fu, Huazhu
Xu, Yanwu
Xu, Min
Jin, Yueming
MEDICAL IMAGE ANALYSIS, 2025, 102
[9] Adapting the Segment Anything Model for Plant Recognition and Automated Phenotypic Parameter Measurement
Zhang, Wenqi
Dang, L. Minh
Nguyen, Le Quan
Alam, Nur
Bui, Ngoc Dung
Park, Han Yong
Moon, Hyeonjoon
HORTICULTURAE, 2024, 10 (04)
[10] Adapting the segment anything model for multi-modal retinal anomaly detection and localization
Li, Jingtao
Chen, Ting
Wang, Xinyu
Zhong, Yanfei
Xiao, Xuan
INFORMATION FUSION, 2025, 113

← 1 2 3 4 5 →