CTCFNet: CNN-Transformer Complementary and Fusion Network for High-Resolution Remote Sensing Image Semantic Segmentation

被引：0

作者：

Lu, Chen ^{[1
]}

Zhang, Xian ^{[1
]}

Du, Kaile ^{[1
]}

Xu, Han ^{[1
]}

Liu, Guangcan ^{[1
]}

机构：

[1] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Image segmentation; Feature extraction; Remote sensing; Transformers; Decoding; Semantic segmentation; Bidirectional control; Complementary information; convolutional neural network (CNN) transformer; feature fusion; semantic segmentation; CLUSTERING ALGORITHMS; TEXTURE; CLASSIFICATION; EXTRACTION;

D O I：

10.1109/TGRS.2024.3458446

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Semantic segmentation of high-resolution remote sensing images poses challenges such as scale variability, diverse objects, and obstruction by surface elements. These factors often lead existing methods to suffer from issues like missed and false detections, as well as coarse segmentation boundaries. To tackle these challenges, this article proposes a CNN-transformer complementary and fusion network, termed as CTCFNet. It aims to enhance segmentation accuracy and robustness by extracting and integrating the complementary global and local information from high-resolution remote sensing images. The CTCFNet operates through two primary stages: feature extraction and fusion. In the feature extraction stage, a feature extractor employs convolutional neural network (CNN) and pyramid vision transformer (PVT) blocks to extract both local and global features. A boundary loss is also proposed to improve the segmentation performance for object textures and boundaries. In the feature fusion stage, a feature aggregation module (FAM) is first designed to effectively fuse local and global features at the same scale, facilitating the feature extractor to obtain more comprehensive representations. On this basis, a bi-directional decoder (BiDecoder) reconstructs multiscale features through both top-down and bottom-up directions, resulting in more precise segmentation outputs. Experiments on several high-resolution remote sensing image datasets demonstrate that the proposed method outperforms the state-of-the-art methods in terms of segmentation accuracy and generalization. The code is available at https://github.com/ChenLu0000/CTCFNet.

引用

页数：17

共 50 条

[41] Hybrid Attention Fusion Embedded in Transformer for Remote Sensing Image Semantic Segmentation
Chen, Yan
Dong, Quan
Wang, Xiaofeng
Zhang, Qianchuan
Kang, Menglei
Jiang, Wenxiang
Wang, Mengyuan
Xu, Lixiang
Zhang, Chen
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 4421 - 4435
[42] Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images
Wu, Xinjia
Zhang, Jing
Li, Wensheng
Li, Jiafeng
Zhuo, Li
Zhang, Jie
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2023, 44 (04) : 1280 - 1307
[43] Remote sensing image change detection based on CNN-Transformer structure
Pan, Mengyang
Yang, Hang
Fan, Xianghui
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1361 - 1379
[44] A Full-Scale Connected CNN-Transformer Network for Remote Sensing Image Change Detection
Chen, Min
Zhang, Qiangjiang
Ge, Xuming
Xu, Bo
Hu, Han
Zhu, Qing
Zhang, Xin
REMOTE SENSING, 2023, 15 (22)
[45] Fuzzy neighbourhood neural network for high-resolution remote sensing image segmentation
Qu, Tingting
Xu, Jindong
Chong, Qianpeng
Liu, Zhaowei
Yan, Weiqing
Wang, Xuan
Song, Yongchao
Ni, Mengying
EUROPEAN JOURNAL OF REMOTE SENSING, 2023, 56 (01)
[46] AFNet: Adaptive Fusion Network for Remote Sensing Image Semantic Segmentation
Liu, Rui
Mi, Li
Chen, Zhenzhong
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (09): : 7871 - 7886
[47] Irregular adaptive refinement network for semantic segmentation of high-resolution remote sensing images
Deng, Lulu
Zhang, Changlun
He, Qiang
Wang, Hengyou
Huo, Lianzhi
Mu, Haibing
Journal of Intelligent and Fuzzy Systems, 2024, 46 (5-6): : 11235 - 11246
[48] A Semantic Segmentation Approach Based on DeepLab Network in High-Resolution Remote Sensing Images
Hu, Hangtao
Cai, Shuo
Wang, Wei
Zhang, Peng
Li, Zhiyong
IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 292 - 304
[49] HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images
Xu, Zhiyong
Zhang, Weicun
Zhang, Tianxiang
Li, Jiangyun
REMOTE SENSING, 2021, 13 (01) : 1 - 23
[50] Spatially adaptive interaction network for semantic segmentation of high-resolution remote sensing images
Weidong Song
Huan He
Jiguang Dai
Guohui Jia
Scientific Reports, 15 (1)

← 1 2 3 4 5 →