SAMPolyBuild: Adapting the Segment Anything Model for polygonal building extraction

被引：2

作者：

Wang, Chenhao ^{[1
,2
]}

Chen, Jingbo ^{[1
]}

Meng, Yu ^{[1
]}

Deng, Yupeng ^{[1
]}

Li, Kai ^{[1
,2
,3
]}

Kong, Yunlong ^{[1
]}

机构：

[1] Chinese Acad Sci, Aerosp Informat Res Inst, 9 Dengzhuang South Rd, Beijing 101408, Peoples R China

[2] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, 1 East Yanqi Lake Rd, Beijing 100049, Peoples R China

[3] City Univ Hong Kong, Sch Data Sci, Hong Kong 999077, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2024年 / 218卷

关键词：

Building extraction; Building vectorization; Foundation model; Instance segmentation; High-resolution remote sensing images; NETWORKS; IMAGERY;

D O I：

10.1016/j.isprsjprs.2024.09.018

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Extracting polygonal buildings from high-resolution remote sensing images is a critical task for large-scale mapping, 3D city modeling, and various geographic information system applications. Traditional methods are often restricted in accurately delineating boundaries and exhibit limited generalizability, which can affect their real-world applicability. The Segment Anything Model (SAM), a promptable segmentation model trained on an unprecedentedly large dataset, demonstrates remarkable generalization ability across various scenarios. In this context, we present SAMPolyBuild, an innovative framework that adapts SAM for polygonal building extraction, allowing for both automatic and prompt-based extraction. To fulfill the requirement for object location prompts in SAM, we developed the Auto Bbox Prompter, which is trained to detect building bounding boxes directly from the image encoder features of the SAM. The boundary precision of the SAM mask results was insufficient for vector polygon extraction, especially when challenged by blurry edges and tree occlusions. Therefore, we extended the SAM decoder with additional parameters to enable multitask learning to predict masks and generate Gaussian vertex and boundary maps simultaneously. Furthermore, we developed a mask- guided vertex connection algorithm to generate the final polygon. Extensive evaluation on the WHU-Mix vector dataset and SpaceNet datasets demonstrate that our method achieves a new state-of-the-art in terms of accuracy and generalizability, significantly improving average precision (AP), average recall (AR), intersection over union (IoU), boundary F1, and vertex F1 metrics. Moreover, by combining the automatic and prompt modes of our framework, we found that 91.2% of the building polygons predicted by SAMPolyBuild on out- of-domain data closely match the quality of manually delineated polygons. The source code is available at https://github.com/wchh-2000/SAMPolyBuild.

引用

页码：707 / 720

页数：14

共 50 条

[21] Segment anything model for medical images?
Huang, Yuhao
Yang, Xin
Liu, Lian
Zhou, Han
Chang, Ao
Zhou, Xinrui
Chen, Rusi
Yu, Junxuan
Chen, Jiongquan
Chen, Chaoyu
Liu, Sijing
Chi, Haozhe
Hu, Xindi
Yue, Kejuan
Li, Lei
Grau, Vicente
Fan, Deng-Ping
Dong, Fajin
Ni, Dong
MEDICAL IMAGE ANALYSIS, 2024, 92
[22] Polygonal Building Extraction by Frame Field Learning
Girard, Nicolas
Smirnov, Dmitriy
Solomon, Justin
Tarabalka, Yuliya
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5887 - 5896
[23] DED-SAM:Adapting Segment Anything Model 2 for Dual Encoder-Decoder Change Detection
Qiu, Junlong
Liu, Wei
Zhang, Xin
Li, Erzhu
Zhang, Lianpeng
Li, Xing
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 995 - 1006
[24] Mammo-SAM: Adapting Foundation Segment Anything Model for Automatic Breast Mass Segmentation in Whole Mammograms
Xiong, Xinyu
Wang, Churan
Li, Wenxue
Li, Guanbin
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT I, 2024, 14348 : 176 - 185
[25] Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images
Wang, Hualiang
Lin, Yiqun
Ding, Xinpeng
Li, Xiaomeng
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT IX, 2024, 15009 : 636 - 646
[26] EviPrompt: A Training-Free Evidential Prompt Generation Method for Adapting Segment Anything Model in Medical Images
Xu, Yinsong
Tang, Jiaqi
Men, Aidong
Chen, Qingchao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6204 - 6215
[27] Matte anything: Interactive natural image matting with segment anything model
Yao, Jingfeng
Wang, Xinggang
Ye, Lang
Liu, Wenyu
IMAGE AND VISION COMPUTING, 2024, 147
[28] ASS-CD: Adapting Segment Anything Model and Swin-Transformer for Change Detection in Remote Sensing Images
Wei, Chenlong
Wu, Xiaofeng
Wang, Bin
REMOTE SENSING, 2025, 17 (03)
[29] UV-AdaptFormer: adapting the segment anything model for urban village identification from high-resolution satellite imagery
Feng, Wenqing
Guan, Fangli
Tu, Jihui
Xu, Wei
REMOTE SENSING LETTERS, 2025, 16 (06) : 573 - 583
[30] Make Segment Anything Model Perfect on Shadow Detection
Chen, Xiao-Diao
Wu, Wen
Yang, Wenya
Qin, Hongshuai
Wu, Xiantao
Mao, Xiaoyang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 13

← 1 2 3 4 5 →