DSMENet: A Road Segmentation Network Based on Dual-Branch Dynamic Snake Convolutional Encoding and Multi-modal Information Iterative Enhancement

被引:0
|
作者
Li, Zhiyang [1 ]
Pan, Xuran [1 ]
Yang, Shuhao [1 ]
Yang, Xinqi [1 ]
Xu, Kexing [1 ]
机构
[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin Econ & Technol Dev Area TEDA, 9 Dishisan Dajie, Tianjin 300457, Peoples R China
关键词
Remote Sensing Image; Multi-Modal; Road Segmentation; Snake Convolution; Information Enhancement;
D O I
10.1007/978-981-97-5615-5_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Road segmentation from remote sensing images plays an important role in basic map data processing and services. However, roads in remote sensing images are characterized by long and narrow spans, intricate topological structures and being easily obscured, making road segmentation a challenging task in the field of re-mote sensing image object segmentation. To improve the accuracy and connectivity of road segmentation, this paper proposes a method based on Dual-branch dynamic Snake convolutional encoding and Multi-modal information iterative Enhancement (DSMENet). The multi-modal data are first encoded separately by dual-branch dynamic snake convolution encoders to adaptively focus on slender and winding local structures, accurately capturing the features of tube-like roads; next, attention driven feature fusion of multi-modal features are performed at different stages of the encoders, which are then input into the decoder for spatial resolution restoration. Finally, a multi-modal information iterative enhancement module is embedded at the end of the network to fully exploit spatial detail features of original multi-modal data and enhance the features at the end of the de-coder, thereby improving the connectivity of road segmentation. Experimental evaluations on the BJRoad dataset demonstrate that (1) The dynamic snake convolution enables the model to focus on tube-like roads effectively, resulting in a significant reduction in false alarms and an improvement in road segmentation accuracy. (2) The multi-modal information iterative enhancement module can provide supplementary spatial detail information to the road segmentation results, mitigating the effects of shadow occlusions and enhancing the connectivity of road segmentation.
引用
收藏
页码:168 / 179
页数:12
相关论文
共 26 条
  • [11] Multi-modal dataset and fusion network for simultaneous semantic segmentation of on-road dynamic objects
    Cho, Jieun
    Ha, Jinsu
    Song, Hamin
    Jang, Sungmoon
    Jo, Kichun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143
  • [12] Multi-modal fusion of satellite and street-view images for urban village classification based on a dual-branch deep neural network
    Chen, Boan
    Feng, Quanlong
    Niu, Bowen
    Yan, Fengqin
    Gao, Bingbo
    Yang, Jianyu
    Gong, Jianhua
    Liu, Jiantao
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 109
  • [13] A novel intelligent bearing fault diagnosis method based on VMD denoising and dual-branch multi-modal feature fusion
    Li, Youjia
    Zhang, Zhongwei
    Jiao, Zonghao
    Shao, Mingyu
    Dai, Xiangjun
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025,
  • [14] A multi-modal and multi-stage fusion enhancement network for segmentation based on OCT and OCTA images
    Quan, Xiongwen
    Hou, Guangyao
    Yin, Wenya
    Zhang, Han
    INFORMATION FUSION, 2025, 113
  • [15] A double-branch convolutional neural network model for species identification based on multi-modal data
    Sun, Yuxin
    Tian, Ye
    Zhang, Yiyi
    Yu, Mengting
    Su, Xiaoquan
    Wang, Qi
    Guo, Jinjia
    Lu, Yuan
    Ren, Lihui
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 318
  • [16] An epilepsy detection method based on multi-dimensional feature extraction and dual-branch hypergraph convolutional network
    Liu, Jiacen
    Yang, Yong
    Li, Feng
    Luo, Jing
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [17] Brain tumor segmentation based on the dual-path network of multi-modal MRI images
    Fang, Lingling
    Wang, Xin
    PATTERN RECOGNITION, 2022, 124
  • [18] MEDICAL IMAGE SEGMENTATION BASED ON MULTI-MODAL CONVOLUTIONAL NEURAL NETWORK: STUDY ON IMAGE FUSION SCHEMES
    Guo, Zhe
    Li, Xiang
    Huang, Heng
    Guo, Ning
    Li, Quanzheng
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 903 - 907
  • [19] Sound event detection in traffic scenes based on graph convolutional network to obtain multi-modal information
    Jiang, Yanji
    Guo, Dingxu
    Wang, Lan
    Zhang, Haitao
    Dong, Hao
    Qiu, Youli
    Zou, Huiwen
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5653 - 5668
  • [20] Dual-attention transformer-based hybrid network for multi-modal medical image segmentation
    Zhang, Menghui
    Zhang, Yuchen
    Liu, Shuaibing
    Han, Yahui
    Cao, Honggang
    Qiao, Bingbing
    SCIENTIFIC REPORTS, 2024, 14 (01):