Semi-Supervised Adversarial Semantic Segmentation Network Using Transformer and Multiscale Convolution for High-Resolution Remote Sensing Imagery

被引:18
|
作者
Zheng, Yalan [1 ,2 ,3 ,4 ]
Yang, Mengyuan [1 ,2 ,3 ,4 ]
Wang, Min [1 ,2 ,3 ,4 ]
Qian, Xiaojun [5 ]
Yang, Rui [1 ,2 ,3 ,4 ]
Zhang, Xin [6 ]
Dong, Wen [6 ]
机构
[1] Nanjing Normal Univ, Minist Educ, Key Lab Virtual Geog Environm, Nanjing 210023, Peoples R China
[2] Nanjing Normal Univ, Sch Geog, Nanjing 210023, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China
[4] State Key Lab Cultivat Base Geog Environm Evolut, Nanjing 210023, Peoples R China
[5] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210097, Peoples R China
[6] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
semantic segmentation; semi-supervised learning; transformer; adversarial learning; remote sensing; building extraction; EXTRACTION;
D O I
10.3390/rs14081786
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic segmentation is a crucial approach for remote sensing interpretation. High-precision semantic segmentation results are obtained at the cost of manually collecting massive pixelwise annotations. Remote sensing imagery contains complex and variable ground objects and obtaining abundant manual annotations is expensive and arduous. The semi-supervised learning (SSL) strategy can enhance the generalization capability of a model with a small number of labeled samples. In this study, a novel semi-supervised adversarial semantic segmentation network is developed for remote sensing information extraction. A multiscale input convolution module (MICM) is designed to extract sufficient local features, while a Transformer module (TM) is applied for long-range dependency modeling. These modules are integrated to construct a segmentation network with a double-branch encoder. Additionally, a double-branch discriminator network with different convolution kernel sizes is proposed. The segmentation network and discriminator network are jointly trained under the semi-supervised adversarial learning (SSAL) framework to improve its segmentation accuracy in cases with small amounts of labeled data. Taking building extraction as a case study, experiments on three datasets with different resolutions are conducted to validate the proposed network. Semi-supervised semantic segmentation models, in which DeepLabv2, the pyramid scene parsing network (PSPNet), UNet and TransUNet are taken as backbone networks, are utilized for performance comparisons. The results suggest that the approach effectively improves the accuracy of semantic segmentation. The F1 and mean intersection over union (mIoU) accuracy measures are improved by 0.82-11.83% and 0.74-7.5%, respectively, over those of other methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Adversarial Semi-Supervised Semantic Segmentation with Attention Mechanism
    Yun, Fei
    Yin, Yanjun
    Zhang, Wenxuan
    Zhi, Min
    Computer Engineering and Applications, 2023, 59 (08) : 254 - 262
  • [32] Semi Supervised Semantic Segmentation Using Generative Adversarial Network
    Souly, Nasim
    Spampinato, Concetto
    Shah, Mubarak
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 5689 - 5697
  • [33] Encoding Contextual Information by Interlacing Transformer and Convolution for Remote Sensing Imagery Semantic Segmentation
    Li, Xin
    Xu, Feng
    Xia, Runliang
    Li, Tao
    Chen, Ziqi
    Wang, Xinyuan
    Xu, Zhennan
    Lyu, Xin
    REMOTE SENSING, 2022, 14 (16)
  • [34] BAS4Net: Boundary-Aware Semi-Supervised Semantic Segmentation Network for Very High Resolution Remote Sensing Images
    Sun, Xian
    Shi, Aijun
    Huang, Hai
    Mayer, Helmut
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 5398 - 5413
  • [35] ClassHyPer: ClassMix-Based Hybrid Perturbations for Deep Semi-Supervised Semantic Segmentation of Remote Sensing Imagery
    He, Yongjun
    Wang, Jinfei
    Liao, Chunhua
    Shan, Bo
    Zhou, Xin
    REMOTE SENSING, 2022, 14 (04)
  • [36] A Bias Correction Semi-Supervised Semantic Segmentation Framework for Remote Sensing Images
    Zhang, Li
    Tan, Zhenshan
    Zheng, Yuzhi
    Zhang, Guo
    Zhang, Wen
    Li, Zhijiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [37] Semisupervised Multiscale Generative Adversarial Network for Semantic Segmentation of Remote Sensing Image
    Wang, Jiaqi
    Liu, Bing
    Zhou, Yong
    Zhao, Jiaqi
    Xia, Shixiong
    Yang, Yuancan
    Zhang, Man
    Ming, Liu Ming
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [38] Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery
    Zhang, Cheng
    Jiang, Wanshou
    Zhang, Yuan
    Wang, Wei
    Zhao, Qing
    Wang, Chenjie
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [39] AANet: Adaptive Attention Networks for Semantic Segmentation of High-Resolution Remote Sensing Imagery
    Chen, Yan
    Zhang, Qianchuan
    Wang, Xiaofeng
    Dong, Quan
    Kang, Menglei
    Jiang, Wenxiang
    Wang, Mengyuan
    Xu, Lixiang
    Zhang, Chen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14640 - 14655
  • [40] A Semi-Supervised Pyramid Cross-Temporal Attention Transformer for Change Detection in High-Resolution Remote Sensing Images
    Lv, Pengyuan
    Li, Mengchen
    Zhong, Yanfei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21