Road Extraction by Multiscale Deformable Transformer From Remote Sensing Images

被引:14
|
作者
Hu, Peng-Cheng [1 ]
Chen, Si-Bao [1 ]
Huang, Li-Li [1 ]
Wang, Gui-Zhou [2 ]
Tang, Jin [1 ]
Luo, Bin [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, Zenmorn AHU AI Joint Lab, MOE Key Lab ICSP,IMIS Lab Anhui Prov,Anhui Prov Ke, Hefei 230601, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
Multiscale deformable transformer; remote sensing; road extraction; self-attention mechanism;
D O I
10.1109/LGRS.2023.3299985
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Rapid progress has been made in the research of high-resolution remote sensing road extraction tasks in the past years but due to the diversity of road types and the complexity of road context, extracting the perfect road network is still fraught with difficulties and challenges. Many convolutional neural networks (CNNs) based on encoder-decoder structures have demonstrated their effectiveness. Transformer's self-attention mechanism shows more powerful performance than CNNs in modeling global feature dependencies. In this letter, we propose a multiscale deformable transformer network (MDTNet) based on encoder-decoder structure to extract road networks from remote sensing images. The core of MDTNet is our proposed multiscale deformable self-attention (MDSA) mechanism. MDSA can capture more comprehensive features than conventional self-attention. In addition, roads are not present in certain blocks of areas like other objects, but are interwoven throughout the image in such a long, linear fashion that information about certain road segments may be overlooked. To minimize residual errors in road segmentations, our MDSA incorporates a deformable design on feature maps, which effectively enhances the salience of road features relative to their surroundings. Extensive experiments on several public remote sensing road datasets show that our MDTNet achieves higher segmentation [F1 score and intersection over union (IoU)] and connectivity [average path length similarity (APLS)] accuracy, which verifies the effectiveness of our approach.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Multiscale Road Extraction in Remote Sensing Images
    Wulamu, Aziguli
    Shi, Zuxian
    Zhang, Dezheng
    He, Zheyu
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2019, 2019
  • [2] Road Extraction Method of Remote Sensing Image Based on Deformable Attention Transformer
    Zhao, Ling
    Zhang, Jianing
    Meng, Xiujun
    Zhou, Wenming
    Zhang, Zhenshi
    Peng, Chengli
    SYMMETRY-BASEL, 2024, 16 (04):
  • [3] DDCTNet: A Deformable and Dynamic Cross-Transformer Network for Road Extraction From High-Resolution Remote Sensing Images
    Gao, Lipeng
    Zhou, Yiqing
    Tian, Jiangtao
    Cai, Wenjing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 19
  • [4] BDTNet: Road Extraction by Bi-Direction Transformer From Remote Sensing Images
    Luo, Lin
    Wang, Jia-Xin
    Chen, Si-Bao
    Tang, Jin
    Luo, Bin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [5] A review of road extraction from remote sensing images
    Weixing Wang
    Nan Yang
    Yi Zhang
    Fengping Wang
    Ting Cao
    Patrik Eklund
    Journal of Traffic and Transportation Engineering(English Edition), 2016, 3 (03) : 271 - 282
  • [6] A review of road extraction from remote sensing images
    Wang, Weixing
    Yang, Nan
    Zhang, Yi
    Wang, Fengping
    Cao, Ting
    Eklund, Patrik
    JOURNAL OF TRAFFIC AND TRANSPORTATION ENGINEERING-ENGLISH EDITION, 2016, 3 (03) : 271 - 282
  • [7] CAFormer: a connectivity-aware vision transformer for road extraction from remote sensing images
    Wang, Xite
    Qin, Changsheng
    Bai, Mei
    Ma, Qian
    Li, Guanyu
    VISUAL COMPUTER, 2025,
  • [8] Road Extraction From Remote Sensing Images via Channel Attention and Multilayer Axial Transformer
    Meng, Qingliang
    Zhou, Daoxiang
    Zhang, Xiaokai
    Yang, Zhigang
    Chen, Zehua
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [9] RoadFormer: Pyramidal deformable vision transformers for road network extraction with remote sensing images
    Jiang, Xiaoling
    Li, Yinyin
    Jiang, Tao
    Xie, Junhao
    Wu, Yilong
    Cai, Qianfeng
    Jiang, Jinhui
    Xu, Jiaming
    Zhang, Hui
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 113
  • [10] ROAD EXTRACTION TECHNIQUES FROM REMOTE SENSING IMAGES: A REVIEW
    Kahraman, I.
    Karas, I. R.
    Akay, A. E.
    INTERNATIONAL CONFERENCE ON GEOMATIC & GEOSPATIAL TECHNOLOGY (GGT 2018): GEOSPATIAL AND DISASTER RISK MANAGEMENT, 2018, 42-4 (W9): : 339 - 342