DSMENet: A Road Segmentation Network Based on Dual-Branch Dynamic Snake Convolutional Encoding and Multi-modal Information Iterative Enhancement

被引：0

作者：

Li, Zhiyang ^{[1
]}

Pan, Xuran ^{[1
]}

Yang, Shuhao ^{[1
]}

Yang, Xinqi ^{[1
]}

Xu, Kexing ^{[1
]}

机构：

[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin Econ & Technol Dev Area TEDA, 9 Dishisan Dajie, Tianjin 300457, Peoples R China

来源：

ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024 | 2024年 / 14873卷

关键词：

Remote Sensing Image; Multi-Modal; Road Segmentation; Snake Convolution; Information Enhancement;

D O I：

10.1007/978-981-97-5615-5_14

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Road segmentation from remote sensing images plays an important role in basic map data processing and services. However, roads in remote sensing images are characterized by long and narrow spans, intricate topological structures and being easily obscured, making road segmentation a challenging task in the field of re-mote sensing image object segmentation. To improve the accuracy and connectivity of road segmentation, this paper proposes a method based on Dual-branch dynamic Snake convolutional encoding and Multi-modal information iterative Enhancement (DSMENet). The multi-modal data are first encoded separately by dual-branch dynamic snake convolution encoders to adaptively focus on slender and winding local structures, accurately capturing the features of tube-like roads; next, attention driven feature fusion of multi-modal features are performed at different stages of the encoders, which are then input into the decoder for spatial resolution restoration. Finally, a multi-modal information iterative enhancement module is embedded at the end of the network to fully exploit spatial detail features of original multi-modal data and enhance the features at the end of the de-coder, thereby improving the connectivity of road segmentation. Experimental evaluations on the BJRoad dataset demonstrate that (1) The dynamic snake convolution enables the model to focus on tube-like roads effectively, resulting in a significant reduction in false alarms and an improvement in road segmentation accuracy. (2) The multi-modal information iterative enhancement module can provide supplementary spatial detail information to the road segmentation results, mitigating the effects of shadow occlusions and enhancing the connectivity of road segmentation.

引用

页码：168 / 179

页数：12

共 26 条

[11] Multi-modal dataset and fusion network for simultaneous semantic segmentation of on-road dynamic objects
Cho, Jieun
Ha, Jinsu
Song, Hamin
Jang, Sungmoon
Jo, Kichun
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143
[12] Multi-modal fusion of satellite and street-view images for urban village classification based on a dual-branch deep neural network
Chen, Boan
Feng, Quanlong
Niu, Bowen
Yan, Fengqin
Gao, Bingbo
Yang, Jianyu
Gong, Jianhua
Liu, Jiantao
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 109
[13] A novel intelligent bearing fault diagnosis method based on VMD denoising and dual-branch multi-modal feature fusion
Li, Youjia
Zhang, Zhongwei
Jiao, Zonghao
Shao, Mingyu
Dai, Xiangjun
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2025,
[14] A multi-modal and multi-stage fusion enhancement network for segmentation based on OCT and OCTA images
Quan, Xiongwen
Hou, Guangyao
Yin, Wenya
Zhang, Han
INFORMATION FUSION, 2025, 113
[15] A double-branch convolutional neural network model for species identification based on multi-modal data
Sun, Yuxin
Tian, Ye
Zhang, Yiyi
Yu, Mengting
Su, Xiaoquan
Wang, Qi
Guo, Jinjia
Lu, Yuan
Ren, Lihui
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 318
[16] An epilepsy detection method based on multi-dimensional feature extraction and dual-branch hypergraph convolutional network
Liu, Jiacen
Yang, Yong
Li, Feng
Luo, Jing
FRONTIERS IN PHYSIOLOGY, 2024, 15
[17] Brain tumor segmentation based on the dual-path network of multi-modal MRI images
Fang, Lingling
Wang, Xin
PATTERN RECOGNITION, 2022, 124
[18] MEDICAL IMAGE SEGMENTATION BASED ON MULTI-MODAL CONVOLUTIONAL NEURAL NETWORK: STUDY ON IMAGE FUSION SCHEMES
Guo, Zhe
Li, Xiang
Huang, Heng
Guo, Ning
Li, Quanzheng
2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 903 - 907
[19] Sound event detection in traffic scenes based on graph convolutional network to obtain multi-modal information
Jiang, Yanji
Guo, Dingxu
Wang, Lan
Zhang, Haitao
Dong, Hao
Qiu, Youli
Zou, Huiwen
COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (04) : 5653 - 5668
[20] Dual-attention transformer-based hybrid network for multi-modal medical image segmentation
Zhang, Menghui
Zhang, Yuchen
Liu, Shuaibing
Han, Yahui
Cao, Honggang
Qiao, Bingbing
SCIENTIFIC REPORTS, 2024, 14 (01):

← 1 2 3 →