DSMENet: A Road Segmentation Network Based on Dual-Branch Dynamic Snake Convolutional Encoding and Multi-modal Information Iterative Enhancement

被引:0
|
作者
Li, Zhiyang [1 ]
Pan, Xuran [1 ]
Yang, Shuhao [1 ]
Yang, Xinqi [1 ]
Xu, Kexing [1 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin Econ & Technol Dev Area TEDA, 9 Dishisan Dajie, Tianjin 300457, Peoples R China
关键词
Remote Sensing Image; Multi-Modal; Road Segmentation; Snake Convolution; Information Enhancement;
D O I
10.1007/978-981-97-5615-5_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Road segmentation from remote sensing images plays an important role in basic map data processing and services. However, roads in remote sensing images are characterized by long and narrow spans, intricate topological structures and being easily obscured, making road segmentation a challenging task in the field of re-mote sensing image object segmentation. To improve the accuracy and connectivity of road segmentation, this paper proposes a method based on Dual-branch dynamic Snake convolutional encoding and Multi-modal information iterative Enhancement (DSMENet). The multi-modal data are first encoded separately by dual-branch dynamic snake convolution encoders to adaptively focus on slender and winding local structures, accurately capturing the features of tube-like roads; next, attention driven feature fusion of multi-modal features are performed at different stages of the encoders, which are then input into the decoder for spatial resolution restoration. Finally, a multi-modal information iterative enhancement module is embedded at the end of the network to fully exploit spatial detail features of original multi-modal data and enhance the features at the end of the de-coder, thereby improving the connectivity of road segmentation. Experimental evaluations on the BJRoad dataset demonstrate that (1) The dynamic snake convolution enables the model to focus on tube-like roads effectively, resulting in a significant reduction in false alarms and an improvement in road segmentation accuracy. (2) The multi-modal information iterative enhancement module can provide supplementary spatial detail information to the road segmentation results, mitigating the effects of shadow occlusions and enhancing the connectivity of road segmentation.
引用
收藏
页码:168 / 179
页数:12
相关论文
共 26 条
  • [1] Multi-modal remote sensing image segmentation based on attention-driven dual-branch encoding framework
    Li, Zhiyang
    Pan, Xuran
    Xu, Kexing
    Yang, Xinqi
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (02)
  • [2] DBTrans: A Dual-Branch Vision Transformer for Multi-Modal Brain Tumor Segmentation
    Zeng, Xinyi
    Zeng, Pinxian
    Tang, Cheng
    Wang, Peng
    Yan, Binyu
    Wang, Yan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 502 - 512
  • [3] DBGAN: Dual-Branch Generative Adversarial Network for Multi-Modal MRI Translation
    Lyu, Jun
    Yan, Shouang
    Hossain, M. Shamim
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08) : 1 - 22
  • [4] Dual-branch crack segmentation network with multi-shape kernel based on convolutional neural network and Mamba
    Zhang, Jianming
    Li, Dianwen
    Zeng, Zhigao
    Zhang, Rui
    Wang, Jin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 150
  • [5] Dual-Branch Dynamic Object Segmentation Network Based on Spatio-Temporal Information Fusion
    Huang, Fei
    Wang, Zhiwen
    Zheng, Yu
    Wang, Qi
    Hao, Bingsen
    Xiang, Yangkai
    ELECTRONICS, 2024, 13 (20)
  • [6] Dual-branch multi-modal convergence network for crater detection using Chang'e image
    Lin, Feng
    Hu, Xie
    Lin, Yiling
    Li, Yao
    Liu, Yang
    Li, Dongmei
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 134
  • [7] Dual-branch multi-information aggregation network with transformer and convolution for polyp segmentation
    Zhang, Wenyu
    Lu, Fuxiang
    Su, Hongjing
    Hu, Yawen
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
  • [8] DMFusion: A dual-branch multi-scale feature fusion network for medical multi-modal image fusion
    Ma, Gengchen
    Qiu, Xihe
    Tan, Xiaoyu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
  • [9] Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network
    Wang, Jinpeng
    Su, Nan
    Zhao, Chunhui
    Yan, Yiming
    Feng, Shou
    REMOTE SENSING, 2024, 16 (20)
  • [10] Cerebral aneurysm image segmentation based on multi-modal convolutional neural network
    Meng, Chengjie
    Yang, Debiao
    Chen, Dan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208