DSMENet: A Road Segmentation Network Based on Dual-Branch Dynamic Snake Convolutional Encoding and Multi-modal Information Iterative Enhancement

被引：0

作者：

Li, Zhiyang ^{[1
]}

Pan, Xuran ^{[1
]}

Yang, Shuhao ^{[1
]}

Yang, Xinqi ^{[1
]}

Xu, Kexing ^{[1
]}

机构：

[1] Tianjin Univ Sci & Technol, Coll Artificial Intelligence, Tianjin Econ & Technol Dev Area TEDA, 9 Dishisan Dajie, Tianjin 300457, Peoples R China

来源：

ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024 | 2024年 / 14873卷

关键词：

Remote Sensing Image; Multi-Modal; Road Segmentation; Snake Convolution; Information Enhancement;

D O I：

10.1007/978-981-97-5615-5_14

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Road segmentation from remote sensing images plays an important role in basic map data processing and services. However, roads in remote sensing images are characterized by long and narrow spans, intricate topological structures and being easily obscured, making road segmentation a challenging task in the field of re-mote sensing image object segmentation. To improve the accuracy and connectivity of road segmentation, this paper proposes a method based on Dual-branch dynamic Snake convolutional encoding and Multi-modal information iterative Enhancement (DSMENet). The multi-modal data are first encoded separately by dual-branch dynamic snake convolution encoders to adaptively focus on slender and winding local structures, accurately capturing the features of tube-like roads; next, attention driven feature fusion of multi-modal features are performed at different stages of the encoders, which are then input into the decoder for spatial resolution restoration. Finally, a multi-modal information iterative enhancement module is embedded at the end of the network to fully exploit spatial detail features of original multi-modal data and enhance the features at the end of the de-coder, thereby improving the connectivity of road segmentation. Experimental evaluations on the BJRoad dataset demonstrate that (1) The dynamic snake convolution enables the model to focus on tube-like roads effectively, resulting in a significant reduction in false alarms and an improvement in road segmentation accuracy. (2) The multi-modal information iterative enhancement module can provide supplementary spatial detail information to the road segmentation results, mitigating the effects of shadow occlusions and enhancing the connectivity of road segmentation.

引用

页码：168 / 179

页数：12

共 26 条

[1] Multi-modal remote sensing image segmentation based on attention-driven dual-branch encoding framework
Li, Zhiyang
Pan, Xuran
Xu, Kexing
Yang, Xinqi
JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (02)
[2] DBTrans: A Dual-Branch Vision Transformer for Multi-Modal Brain Tumor Segmentation
Zeng, Xinyi
Zeng, Pinxian
Tang, Cheng
Wang, Peng
Yan, Binyu
Wang, Yan
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT IV, 2023, 14223 : 502 - 512
[3] DBGAN: Dual-Branch Generative Adversarial Network for Multi-Modal MRI Translation
Lyu, Jun
Yan, Shouang
Hossain, M. Shamim
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (08) : 1 - 22
[4] Dual-branch crack segmentation network with multi-shape kernel based on convolutional neural network and Mamba
Zhang, Jianming
Li, Dianwen
Zeng, Zhigao
Zhang, Rui
Wang, Jin
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 150
[5] Dual-Branch Dynamic Object Segmentation Network Based on Spatio-Temporal Information Fusion
Huang, Fei
Wang, Zhiwen
Zheng, Yu
Wang, Qi
Hao, Bingsen
Xiang, Yangkai
ELECTRONICS, 2024, 13 (20)
[6] Dual-branch multi-modal convergence network for crater detection using Chang'e image
Lin, Feng
Hu, Xie
Lin, Yiling
Li, Yao
Liu, Yang
Li, Dongmei
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 134
[7] Dual-branch multi-information aggregation network with transformer and convolution for polyp segmentation
Zhang, Wenyu
Lu, Fuxiang
Su, Hongjing
Hu, Yawen
COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 168
[8] DMFusion: A dual-branch multi-scale feature fusion network for medical multi-modal image fusion
Ma, Gengchen
Qiu, Xihe
Tan, Xiaoyu
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
[9] Multi-Modal Object Detection Method Based on Dual-Branch Asymmetric Attention Backbone and Feature Fusion Pyramid Network
Wang, Jinpeng
Su, Nan
Zhao, Chunhui
Yan, Yiming
Feng, Shou
REMOTE SENSING, 2024, 16 (20)
[10] Cerebral aneurysm image segmentation based on multi-modal convolutional neural network
Meng, Chengjie
Yang, Debiao
Chen, Dan
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208

← 1 2 3 →