Remote Sensing Image Road Segmentation Method Integrating CNN-Transformer and UNet

被引:2
|
作者
Wang, Rui [1 ]
Cai, Mingxiang [1 ]
Xia, Zixuan [2 ]
Zhou, Zhicui [3 ]
机构
[1] China Transport Telecommun & Informat Ctr, Beijing 100011, Peoples R China
[2] Heilongjiang Univ Technol, Harbin 150022, Heilongjiang, Peoples R China
[3] No 1 Middle Sch Weifang, Jixi 150022, Heilongjiang, Peoples R China
关键词
Road segmentation; deep learning; CNN-transformer; attention; UNet; EXTRACTION;
D O I
10.1109/ACCESS.2023.3344797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time and accurate road information is crucial for updating electronic navigation maps. To address the problem of low precision and poor robustness in current semantic segmentation methods for road extraction from remote sensing imagery, we proposed a UNet road semantic segmentation model based on attention mechanism improvement. First, we introduce a CNN-Transformer hybrid structure to the encoder to enhance the feature extraction capabilities of global and local details. Second, the traditional upsampling module in the decoder is replaced with a dual upsampling module to improve feature extraction capabilities and segmentation accuracy. Furthermore, the hard-swish activation function is used instead of ReLU activation function to smooth the curve, which helps to improve the generalization and non-linear feature extraction abilities and avoid gradient vanishing. Finally, a comprehensive loss function combining cross entropy and dice is used to strengthen the segmentation result constraints and further improve segmentation accuracy. Experimental validation is performed on the Ottawa Road Dataset and the Massachusetts Road Dataset. Experimental results show that compared with U-Net, PSPNet, DeepLab V3 and TransUNet networks, this algorithm is the best in terms of MIoU, MPA and F1 score. Among them, on the Ottawa road data set, the MPA of this algorithm reached 95.48%. On the Massachusetts road data set, MPA is 92.56%. This method shows good performance in road extraction.
引用
收藏
页码:144446 / 144455
页数:10
相关论文
共 50 条
  • [31] CNN-transformer dual branch collaborative model for semantic segmentation of high-resolution remote sensing images
    Zhu, Xiaotong
    Peng, Taile
    Guo, Jia
    Wang, Hao
    Cao, Taotao
    PHOTOGRAMMETRIC RECORD, 2025, 40 (189):
  • [32] TransUMobileNet: Integrating multi-channel attention fusion with hybrid CNN-Transformer architecture for medical image segmentation
    Cai, Sijing
    Jiang, Yukun
    Xiao, Yuwei
    Zeng, Jian
    Zhou, Guangming
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [33] EGCM-UNet: Edge Guided Hybrid CNN-Mamba UNet for farmland remote sensing image semantic segmentation
    Zheng, Jianhua
    Fu, Yusha
    Chen, Xiaohan
    Zhao, Ruolin
    Lu, Junde
    Zhao, Huanghui
    Chen, Qian
    GEOCARTO INTERNATIONAL, 2025, 40 (01)
  • [34] An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution
    Zhang, Wenjian
    Tan, Zheng
    Lv, Qunbo
    Li, Jiaao
    Zhu, Baoyu
    Liu, Yangyang
    REMOTE SENSING, 2024, 16 (05)
  • [35] HTC-Net: A hybrid CNN-transformer framework for medical image segmentation
    Tang, Hui
    Chen, Yuanbin
    Wang, Tao
    Zhou, Yuanbo
    Zhao, Longxuan
    Gao, Qinquan
    Du, Min
    Tan, Tao
    Zhang, Xinlin
    Tong, Tong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [36] HSACT: A hierarchical semantic-aware CNN-Transformer for remote sensing image spectral super-resolution
    Zhou, Chengle
    He, Zhi
    Zou, Liwei
    Li, Yunfei
    Plaza, Antonio
    NEUROCOMPUTING, 2025, 636
  • [37] Alternate encoder and dual decoder CNN-Transformer networks for medical image segmentation
    Zhang, Lin
    Guo, Xinyu
    Sun, Hongkun
    Wang, Weigang
    Yao, Liwei
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [38] Multi-Scale Orthogonal Model CNN-Transformer for Medical Image Segmentation
    Zhou, Wuyi
    Zeng, Xianhua
    Zhou, Mingkun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (10)
  • [39] Efficient Transformer for Remote Sensing Image Segmentation
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Yang, Zhifang
    Li, Jiangyun
    REMOTE SENSING, 2021, 13 (18)
  • [40] Single-Image Superresolution for RGB Remote Sensing Imagery via Multiscale CNN-Transformer Feature Fusion
    Yao, Xudong
    Zhang, Haopeng
    Wen, Sizhe
    Shi, Zhenwei
    Jiang, Zhiguo
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 1302 - 1316