Road Extraction by Multiscale Deformable Transformer From Remote Sensing Images

被引:14
|
作者
Hu, Peng-Cheng [1 ]
Chen, Si-Bao [1 ]
Huang, Li-Li [1 ]
Wang, Gui-Zhou [2 ]
Tang, Jin [1 ]
Luo, Bin [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Zenmorn AHU AI Joint Lab, MOE Key Lab ICSP,IMIS Lab Anhui Prov,Anhui Prov Ke, Hefei 230601, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiscale deformable transformer; remote sensing; road extraction; self-attention mechanism;
D O I
10.1109/LGRS.2023.3299985
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Rapid progress has been made in the research of high-resolution remote sensing road extraction tasks in the past years but due to the diversity of road types and the complexity of road context, extracting the perfect road network is still fraught with difficulties and challenges. Many convolutional neural networks (CNNs) based on encoder-decoder structures have demonstrated their effectiveness. Transformer's self-attention mechanism shows more powerful performance than CNNs in modeling global feature dependencies. In this letter, we propose a multiscale deformable transformer network (MDTNet) based on encoder-decoder structure to extract road networks from remote sensing images. The core of MDTNet is our proposed multiscale deformable self-attention (MDSA) mechanism. MDSA can capture more comprehensive features than conventional self-attention. In addition, roads are not present in certain blocks of areas like other objects, but are interwoven throughout the image in such a long, linear fashion that information about certain road segments may be overlooked. To minimize residual errors in road segmentations, our MDSA incorporates a deformable design on feature maps, which effectively enhances the salience of road features relative to their surroundings. Extensive experiments on several public remote sensing road datasets show that our MDTNet achieves higher segmentation [F1 score and intersection over union (IoU)] and connectivity [average path length similarity (APLS)] accuracy, which verifies the effectiveness of our approach.
引用
收藏
页数:5
相关论文
共 50 条
  • [11] A survey of automatic road extraction from remote sensing images
    Wu L.
    Hu Y.-A.
    Zidonghua Xuebao/Acta Automatica Sinica, 2010, 36 (07): : 912 - 922
  • [12] Convolution and Transformer based hybrid neural network for Road Extraction in Remote Sensing Images
    Liu, Shufan
    Wang, Yang
    Wang, Haoqi
    Xiong, Youqiang
    Liu, Yinfeng
    Xie, Chenxi
    2024 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, ICMA 2024, 2024, : 471 - 476
  • [13] Building extraction from remote sensing images via the multiscale information fusion method under the Transformer architecture
    Liu, Yi
    Zhang, Yinjie
    Ao, Yang
    Jiang, Dalong
    Zhang, Zhaorui
    National Remote Sensing Bulletin, 2024, 28 (12) : 3173 - 3183
  • [14] RADANet: Road Augmented Deformable Attention Network for Road Extraction From Complex High-Resolution Remote-Sensing Images
    Dai, Ling
    Zhang, Guangyun
    Zhang, Rongting
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [15] ENHANCE ESSENTIAL FEATURES FOR ROAD EXTRACTION FROM REMOTE SENSING IMAGES
    Zao, Yifan
    Chen, Hao
    Liu, Liqin
    Shi, Zhenwei
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3023 - 3026
  • [16] Road Extraction from Remote Sensing Images Based on Adaptive Morphology
    Fang Yupin
    Wang Xiaopeng
    Li Xinna
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (16)
  • [17] Fourier-Deformable Convolution Network for Road Segmentation From Remote Sensing Images
    Liu, Huajun
    Zhou, Xinyu
    Wang, Cailing
    Chen, Suting
    Kong, Hui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [18] Fast extraction of buildings from remote sensing images by fusion of CNN and Transformer
    Zhang Y.
    Guo W.
    Wu C.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (11): : 1700 - 1709
  • [19] PT-RE: Prompt-Based Multimodal Transformer for Road Network Extraction From Remote Sensing Images
    Han, Yuxuan
    Liu, Qingxiao
    Liu, Haiou
    Hu, Xiuzhong
    Wang, Boyang
    IEEE SENSORS JOURNAL, 2024, 24 (21) : 35832 - 35844
  • [20] Road Extraction from Remote Sensing Imagery with Spatial Attention Based on Swin Transformer
    Zhu, Xianhong
    Huang, Xiaohui
    Cao, Weijia
    Yang, Xiaofei
    Zhou, Yunfei
    Wang, Shaokai
    REMOTE SENSING, 2024, 16 (07)