CTCFNet: CNN-Transformer Complementary and Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

被引:0
|
作者
Lu, Chen [1 ]
Zhang, Xian [1 ]
Du, Kaile [1 ]
Xu, Han [1 ]
Liu, Guangcan [1 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Image segmentation; Feature extraction; Remote sensing; Transformers; Decoding; Semantic segmentation; Bidirectional control; Complementary information; convolutional neural network (CNN) transformer; feature fusion; semantic segmentation; CLUSTERING ALGORITHMS; TEXTURE; CLASSIFICATION; EXTRACTION;
D O I
10.1109/TGRS.2024.3458446
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Semantic segmentation of high-resolution remote sensing images poses challenges such as scale variability, diverse objects, and obstruction by surface elements. These factors often lead existing methods to suffer from issues like missed and false detections, as well as coarse segmentation boundaries. To tackle these challenges, this article proposes a CNN-transformer complementary and fusion network, termed as CTCFNet. It aims to enhance segmentation accuracy and robustness by extracting and integrating the complementary global and local information from high-resolution remote sensing images. The CTCFNet operates through two primary stages: feature extraction and fusion. In the feature extraction stage, a feature extractor employs convolutional neural network (CNN) and pyramid vision transformer (PVT) blocks to extract both local and global features. A boundary loss is also proposed to improve the segmentation performance for object textures and boundaries. In the feature fusion stage, a feature aggregation module (FAM) is first designed to effectively fuse local and global features at the same scale, facilitating the feature extractor to obtain more comprehensive representations. On this basis, a bi-directional decoder (BiDecoder) reconstructs multiscale features through both top-down and bottom-up directions, resulting in more precise segmentation outputs. Experiments on several high-resolution remote sensing image datasets demonstrate that the proposed method outperforms the state-of-the-art methods in terms of segmentation accuracy and generalization. The code is available at https://github.com/ChenLu0000/CTCFNet.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Hybrid Attention Fusion Embedded in Transformer for Remote Sensing Image Semantic Segmentation
    Chen, Yan
    Dong, Quan
    Wang, Xiaofeng
    Zhang, Qianchuan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4421 - 4435
  • [42] Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images
    Wu, Xinjia
    Zhang, Jing
    Li, Wensheng
    Li, Jiafeng
    Zhuo, Li
    Zhang, Jie
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (04) : 1280 - 1307
  • [43] Remote sensing image change detection based on CNN-Transformer structure
    Pan, Mengyang
    Yang, Hang
    Fan, Xianghui
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1361 - 1379
  • [44] A Full-Scale Connected CNN-Transformer Network for Remote Sensing Image Change Detection
    Chen, Min
    Zhang, Qiangjiang
    Ge, Xuming
    Xu, Bo
    Hu, Han
    Zhu, Qing
    Zhang, Xin
    REMOTE SENSING, 2023, 15 (22)
  • [45] Fuzzy neighbourhood neural network for high-resolution remote sensing image segmentation
    Qu, Tingting
    Xu, Jindong
    Chong, Qianpeng
    Liu, Zhaowei
    Yan, Weiqing
    Wang, Xuan
    Song, Yongchao
    Ni, Mengying
    EUROPEAN JOURNAL OF REMOTE SENSING, 2023, 56 (01)
  • [46] AFNet: Adaptive Fusion Network for Remote Sensing Image Semantic Segmentation
    Liu, Rui
    Mi, Li
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (09): : 7871 - 7886
  • [47] Irregular adaptive refinement network for semantic segmentation of high-resolution remote sensing images
    Deng, Lulu
    Zhang, Changlun
    He, Qiang
    Wang, Hengyou
    Huo, Lianzhi
    Mu, Haibing
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (5-6): : 11235 - 11246
  • [48] A Semantic Segmentation Approach Based on DeepLab Network in High-Resolution Remote Sensing Images
    Hu, Hangtao
    Cai, Shuo
    Wang, Wei
    Zhang, Peng
    Li, Zhiyong
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 292 - 304
  • [49] HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images
    Xu, Zhiyong
    Zhang, Weicun
    Zhang, Tianxiang
    Li, Jiangyun
    REMOTE SENSING, 2021, 13 (01) : 1 - 23
  • [50] Spatially adaptive interaction network for semantic segmentation of high-resolution remote sensing images
    Weidong Song
    Huan He
    Jiguang Dai
    Guohui Jia
    Scientific Reports, 15 (1)