CTCFNet: CNN-Transformer Complementary and Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Lu, Chen [1 ]
Zhang, Xian [1 ]
Du, Kaile [1 ]
Xu, Han [1 ]
Liu, Guangcan [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Remote sensing; Transformers; Decoding; Semantic segmentation; Bidirectional control; Complementary information; convolutional neural network (CNN) transformer; feature fusion; semantic segmentation; CLUSTERING ALGORITHMS; TEXTURE; CLASSIFICATION; EXTRACTION;
D O I
10.1109/TGRS.2024.3458446
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Semantic segmentation of high-resolution remote sensing images poses challenges such as scale variability, diverse objects, and obstruction by surface elements. These factors often lead existing methods to suffer from issues like missed and false detections, as well as coarse segmentation boundaries. To tackle these challenges, this article proposes a CNN-transformer complementary and fusion network, termed as CTCFNet. It aims to enhance segmentation accuracy and robustness by extracting and integrating the complementary global and local information from high-resolution remote sensing images. The CTCFNet operates through two primary stages: feature extraction and fusion. In the feature extraction stage, a feature extractor employs convolutional neural network (CNN) and pyramid vision transformer (PVT) blocks to extract both local and global features. A boundary loss is also proposed to improve the segmentation performance for object textures and boundaries. In the feature fusion stage, a feature aggregation module (FAM) is first designed to effectively fuse local and global features at the same scale, facilitating the feature extractor to obtain more comprehensive representations. On this basis, a bi-directional decoder (BiDecoder) reconstructs multiscale features through both top-down and bottom-up directions, resulting in more precise segmentation outputs. Experiments on several high-resolution remote sensing image datasets demonstrate that the proposed method outperforms the state-of-the-art methods in terms of segmentation accuracy and generalization. The code is available at https://github.com/ChenLu0000/CTCFNet.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] CTFNet: CNN-Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [2] Multiscale Fusion CNN-Transformer Network for High-Resolution Remote Sensing Image Change Detection
    Jiang, Ming
    Chen, Yimin
    Dong, Zhe
    Liu, Xiaoping
    Zhang, Xinchang
    Zhang, Honghui
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 5280 - 5293
  • [3] CNN-transformer dual branch collaborative model for semantic segmentation of high-resolution remote sensing images
    Zhu, Xiaotong
    Peng, Taile
    Guo, Jia
    Wang, Hao
    Cao, Taotao
    PHOTOGRAMMETRIC RECORD, 2025, 40 (189):
  • [4] CNN and Transformer Fusion for Remote Sensing Image Semantic Segmentation
    Chen, Xin
    Li, Dongfen
    Liu, Mingzhe
    Jia, Jiaru
    REMOTE SENSING, 2023, 15 (18)
  • [5] MFTransNet: A Multi-Modal Fusion with CNN-Transformer Network for Semantic Segmentation of HSR Remote Sensing Images
    He, Shumeng
    Yang, Houqun
    Zhang, Xiaoying
    Li, Xuanyu
    MATHEMATICS, 2023, 11 (03)
  • [6] CMTFNet: CNN and Multiscale Transformer Fusion Network for Remote-Sensing Image Semantic Segmentation
    Wu, Honglin
    Huang, Peng
    Zhang, Min
    Tang, Wenlong
    Yu, Xinyu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [7] A CNN-Transformer Network Combining CBAM for Change Detection in High-Resolution Remote Sensing Images
    Yin, Mengmeng
    Chen, Zhibo
    Zhang, Chengjian
    REMOTE SENSING, 2023, 15 (09)
  • [8] Cascaded CNN and global-local attention transformer network-based semantic segmentation for high-resolution remote sensing image
    Liu, Xiaohui
    Zhang, Lei
    Wang, Rui
    Li, Xiaoyu
    Xu, Jiyang
    Lu, Xiaochen
    JOURNAL OF APPLIED REMOTE SENSING, 2024, 18 (03)
  • [9] Relating CNN-Transformer Fusion Network for Remote Sensing Change Detection
    Gao, Yuhao
    Pei, Gensheng
    Sheng, Mengmeng
    Sun, Zeren
    Chen, Tao
    Yao, Yazhou
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME 2024, 2024,
  • [10] ACTNet: A Dual-Attention Adapter with a CNN-Transformer Network for the Semantic Segmentation of Remote Sensing Imagery
    Zhang, Zheng
    Liu, Fanchen
    Liu, Changan
    Tian, Qing
    Qu, Hongquan
    REMOTE SENSING, 2023, 15 (09)