CTCFNet: CNN-Transformer Complementary and Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Lu, Chen [1 ]
Zhang, Xian [1 ]
Du, Kaile [1 ]
Xu, Han [1 ]
Liu, Guangcan [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Remote sensing; Transformers; Decoding; Semantic segmentation; Bidirectional control; Complementary information; convolutional neural network (CNN) transformer; feature fusion; semantic segmentation; CLUSTERING ALGORITHMS; TEXTURE; CLASSIFICATION; EXTRACTION;
D O I
10.1109/TGRS.2024.3458446
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Semantic segmentation of high-resolution remote sensing images poses challenges such as scale variability, diverse objects, and obstruction by surface elements. These factors often lead existing methods to suffer from issues like missed and false detections, as well as coarse segmentation boundaries. To tackle these challenges, this article proposes a CNN-transformer complementary and fusion network, termed as CTCFNet. It aims to enhance segmentation accuracy and robustness by extracting and integrating the complementary global and local information from high-resolution remote sensing images. The CTCFNet operates through two primary stages: feature extraction and fusion. In the feature extraction stage, a feature extractor employs convolutional neural network (CNN) and pyramid vision transformer (PVT) blocks to extract both local and global features. A boundary loss is also proposed to improve the segmentation performance for object textures and boundaries. In the feature fusion stage, a feature aggregation module (FAM) is first designed to effectively fuse local and global features at the same scale, facilitating the feature extractor to obtain more comprehensive representations. On this basis, a bi-directional decoder (BiDecoder) reconstructs multiscale features through both top-down and bottom-up directions, resulting in more precise segmentation outputs. Experiments on several high-resolution remote sensing image datasets demonstrate that the proposed method outperforms the state-of-the-art methods in terms of segmentation accuracy and generalization. The code is available at https://github.com/ChenLu0000/CTCFNet.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] High-resolution remote sensing image semantic segmentation based on a deep feature aggregation network
    Wang, Zhen
    Guo, Jianxin
    Huang, Wenzhun
    Zhang, Shanwen
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2021, 32 (09)
  • [22] TCNet: Multiscale Fusion of Transformer and CNN for Semantic Segmentation of Remote Sensing Images
    Xiang, Xuyang
    Gong, Wenping
    Li, Shuailong
    Chen, Jun
    Ren, Tianhe
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3123 - 3136
  • [23] Progressive CNN-transformer semantic compensation network for polyp segmentation
    Li, Daxiang
    Li, Denghui
    Liu, Ying
    Tang, Yao
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2024, 32 (16): : 2523 - 2536
  • [24] Multilevel Feature Fusion and Attention Network for High-Resolution Remote Sensing Image Semantic Labeling
    Zhang, Yijie
    Cheng, Jian
    Bai, Haiwei
    Wang, Qi
    Liang, Xingyu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [25] Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery
    Zhang, Cheng
    Jiang, Wanshou
    Zhang, Yuan
    Wang, Wei
    Zhao, Qing
    Wang, Chenjie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [26] A lightweight distillation CNN-transformer architecture for remote sensing image super-resolution
    Wang, Yu
    Shao, Zhenfeng
    Lu, Tao
    Liu, Lifeng
    Huang, Xiao
    Wang, Jiaming
    Jiang, Kui
    Zeng, Kangli
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (01) : 3560 - 3579
  • [27] CTHNet: A CNN-Transformer Hybrid Network for Landslide Identification in Loess Plateau Regions Using High-Resolution Remote Sensing Images
    Li, Juan
    Zhang, Jin
    Fu, Yongyong
    SENSORS, 2025, 25 (01)
  • [28] CCTNet: CNN and Cross-Shaped Transformer Hybrid Network for Remote Sensing Image Semantic Segmentation
    Wu, Honglin
    Zeng, Zhaobin
    Huang, Peng
    Yu, Xinyu
    Zhang, Min
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 19986 - 19997
  • [29] SSNet: A Novel Transformer and CNN Hybrid Network for Remote Sensing Semantic Segmentation
    Yao, Min
    Zhang, Yaozu
    Liu, Guofeng
    Pang, Dongdong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 3023 - 3037
  • [30] Hybrid CNN and Transformer Network for Semantic Segmentation of UAV Remote Sensing Images
    Zhou X.
    Zhou L.
    Gong S.
    Zhang H.
    Zhong S.
    Xia Y.
    Huang Y.
    IEEE Journal on Miniaturization for Air and Space Systems, 2024, 5 (01): : 33 - 41