Remote Sensing Image Road Segmentation Method Integrating CNN-Transformer and UNet

被引:2
|
作者
Wang, Rui [1 ]
Cai, Mingxiang [1 ]
Xia, Zixuan [2 ]
Zhou, Zhicui [3 ]
机构
[1] China Transport Telecommun & Informat Ctr, Beijing 100011, Peoples R China
[2] Heilongjiang Univ Technol, Harbin 150022, Heilongjiang, Peoples R China
[3] No 1 Middle Sch Weifang, Jixi 150022, Heilongjiang, Peoples R China
关键词
Road segmentation; deep learning; CNN-transformer; attention; UNet; EXTRACTION;
D O I
10.1109/ACCESS.2023.3344797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time and accurate road information is crucial for updating electronic navigation maps. To address the problem of low precision and poor robustness in current semantic segmentation methods for road extraction from remote sensing imagery, we proposed a UNet road semantic segmentation model based on attention mechanism improvement. First, we introduce a CNN-Transformer hybrid structure to the encoder to enhance the feature extraction capabilities of global and local details. Second, the traditional upsampling module in the decoder is replaced with a dual upsampling module to improve feature extraction capabilities and segmentation accuracy. Furthermore, the hard-swish activation function is used instead of ReLU activation function to smooth the curve, which helps to improve the generalization and non-linear feature extraction abilities and avoid gradient vanishing. Finally, a comprehensive loss function combining cross entropy and dice is used to strengthen the segmentation result constraints and further improve segmentation accuracy. Experimental validation is performed on the Ottawa Road Dataset and the Massachusetts Road Dataset. Experimental results show that compared with U-Net, PSPNet, DeepLab V3 and TransUNet networks, this algorithm is the best in terms of MIoU, MPA and F1 score. Among them, on the Ottawa road data set, the MPA of this algorithm reached 95.48%. On the Massachusetts road data set, MPA is 92.56%. This method shows good performance in road extraction.
引用
收藏
页码:144446 / 144455
页数:10
相关论文
共 50 条
  • [21] ACTNet: A Dual-Attention Adapter with a CNN-Transformer Network for the Semantic Segmentation of Remote Sensing Imagery
    Zhang, Zheng
    Liu, Fanchen
    Liu, Changan
    Tian, Qing
    Qu, Hongquan
    REMOTE SENSING, 2023, 15 (09)
  • [22] BoucaNet: A CNN-Transformer for Smoke Recognition on Remote Sensing Satellite Images
    Ghali, Rafik
    Akhloufi, Moulay A.
    FIRE-SWITZERLAND, 2023, 6 (12):
  • [23] A CNN-Transformer Combined Remote Sensing Imagery Spatiotemporal Fusion Model
    Jiang, Mingyu
    Shao, Hua
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 13995 - 14009
  • [24] HCTNet: A hybrid CNN-transformer network for breast ultrasound image segmentation
    He, Qiqi
    Yang, Qiuju
    Xie, Minghao
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 155
  • [25] Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection
    Gao, Yuhao
    Pei, Gensheng
    Sheng, Mengmeng
    Sun, Zeren
    Chen, Tao
    Yao, Yazhou
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [26] Multiscale Fusion CNN-Transformer Network for High-Resolution Remote Sensing Image Change Detection
    Jiang, Ming
    Chen, Yimin
    Dong, Zhe
    Liu, Xiaoping
    Zhang, Xinchang
    Zhang, Honghui
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5280 - 5293
  • [27] Remote Sensing Image Classification Method Based on Fusion of CNN and Transformer
    Jin Chuan
    Tong Changqing
    LASER & OPTOELECTRONICS PROGRESS, 2023, 60 (20)
  • [28] DBCT-Net:A dual branch hybrid CNN-transformer network for remote sensing image fusion
    Wang, Quanli
    Jin, Xin
    Jiang, Qian
    Wu, Liwen
    Zhang, Yunchun
    Zhou, Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 233
  • [29] MFTransNet: A Multi-Modal Fusion with CNN-Transformer Network for Semantic Segmentation of HSR Remote Sensing Images
    He, Shumeng
    Yang, Houqun
    Zhang, Xiaoying
    Li, Xuanyu
    MATHEMATICS, 2023, 11 (03)
  • [30] Hyperspectral Image Compression Sensing Network With CNN-Transformer Mixture Architectures
    Zhang, Lei
    Zhang, Longsheng
    Song, Chengpeng
    Zhang, Peng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5