Semi-Supervised Adversarial Semantic Segmentation Network Using Transformer and Multiscale Convolution for High-Resolution Remote Sensing Imagery

被引:18
|
作者
Zheng, Yalan [1 ,2 ,3 ,4 ]
Yang, Mengyuan [1 ,2 ,3 ,4 ]
Wang, Min [1 ,2 ,3 ,4 ]
Qian, Xiaojun [5 ]
Yang, Rui [1 ,2 ,3 ,4 ]
Zhang, Xin [6 ]
Dong, Wen [6 ]
机构
[1] Nanjing Normal Univ, Minist Educ, Key Lab Virtual Geog Environm, Nanjing 210023, Peoples R China
[2] Nanjing Normal Univ, Sch Geog, Nanjing 210023, Peoples R China
[3] Jiangsu Ctr Collaborat Innovat Geog Informat Reso, Nanjing 210023, Peoples R China
[4] State Key Lab Cultivat Base Geog Environm Evolut, Nanjing 210023, Peoples R China
[5] Nanjing Normal Univ, Sch Artificial Intelligence, Nanjing 210097, Peoples R China
[6] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
semantic segmentation; semi-supervised learning; transformer; adversarial learning; remote sensing; building extraction; EXTRACTION;
D O I
10.3390/rs14081786
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic segmentation is a crucial approach for remote sensing interpretation. High-precision semantic segmentation results are obtained at the cost of manually collecting massive pixelwise annotations. Remote sensing imagery contains complex and variable ground objects and obtaining abundant manual annotations is expensive and arduous. The semi-supervised learning (SSL) strategy can enhance the generalization capability of a model with a small number of labeled samples. In this study, a novel semi-supervised adversarial semantic segmentation network is developed for remote sensing information extraction. A multiscale input convolution module (MICM) is designed to extract sufficient local features, while a Transformer module (TM) is applied for long-range dependency modeling. These modules are integrated to construct a segmentation network with a double-branch encoder. Additionally, a double-branch discriminator network with different convolution kernel sizes is proposed. The segmentation network and discriminator network are jointly trained under the semi-supervised adversarial learning (SSAL) framework to improve its segmentation accuracy in cases with small amounts of labeled data. Taking building extraction as a case study, experiments on three datasets with different resolutions are conducted to validate the proposed network. Semi-supervised semantic segmentation models, in which DeepLabv2, the pyramid scene parsing network (PSPNet), UNet and TransUNet are taken as backbone networks, are utilized for performance comparisons. The results suggest that the approach effectively improves the accuracy of semantic segmentation. The F1 and mean intersection over union (mIoU) accuracy measures are improved by 0.82-11.83% and 0.74-7.5%, respectively, over those of other methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] High-resolution remote sensing image semantic segmentation based on semi-supervised full convolution network method
    Geng Y.
    Tao C.
    Shen J.
    Zou Z.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2020, 49 (04): : 499 - 508
  • [2] Multiscale Progressive Segmentation Network for High-Resolution Remote Sensing Imagery
    Hang, Renlong
    Yang, Ping
    Zhou, Feng
    Liu, Qingshan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] CHANGE DETECTION OF HIGH-RESOLUTION REMOTE SENSING IMAGE BASED ON SEMI-SUPERVISED SEGMENTATION AND ADVERSARIAL LEARNING
    Yang, Shengnan
    Hou, Shilong
    Zhang, Yifan
    Wang, Hongyu
    Ma, Xiaorui
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1055 - 1058
  • [4] SMAF-Net: Sharing Multiscale Adversarial Feature for High-Resolution Remote Sensing Imagery Semantic Segmentation
    Chen, Jie
    Zhu, Jingru
    Sun, Geng
    Li, Jianhui
    Deng, Min
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (11) : 1921 - 1925
  • [5] Semi-Supervised Building Detection from High-Resolution Remote Sensing Imagery
    Zheng, Daoyuan
    Kang, Jianing
    Wu, Kaishun
    Feng, Yuting
    Guo, Han
    Zheng, Xiaoyun
    Li, Shengwen
    Fang, Fang
    SUSTAINABILITY, 2023, 15 (15)
  • [6] Multiscale Cascaded Network for the Semantic Segmentation of High-Resolution Remote Sensing Images
    Zhang, Xiaolu
    Wang, Zhaoshun
    Wei, Anlei
    CANADIAN JOURNAL OF REMOTE SENSING, 2023, 49 (01)
  • [7] SEMI-SUPERVISED SPARSE RELEARNING REPRESENTATION CLASSIFICATION FOR HIGH-RESOLUTION REMOTE SENSING IMAGERY
    Li, Jiayi
    Huang, Xin
    Zhang, Liangpei
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 2618 - 2621
  • [8] Semantic Segmentation of High-Resolution Remote Sensing Images Using Multiscale Skip Connection Network
    Ma, Bifang
    Chang, Chih-Yung
    IEEE SENSORS JOURNAL, 2022, 22 (04) : 3745 - 3755
  • [9] Semi-supervised semantic segmentation using an improved generative adversarial network
    Xu, Di
    Wang, Zhili
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 9709 - 9719
  • [10] Semi-supervised semantic segmentation based on Generative Adversarial Networks for remote sensing images
    Liu Yu-Xi
    Zhang Bo
    Wang Bin
    JOURNAL OF INFRARED AND MILLIMETER WAVES, 2020, 39 (04) : 473 - 482