Structured Adversarial Self-Supervised Learning for Robust Object Detection in Remote Sensing Images

被引:0
|
作者
The Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong [1 ]
不详 [2 ]
210049, China
不详 [3 ]
710071, China
机构
来源
关键词
Job analysis - Object detection - Object recognition - Supervised learning;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection plays a crucial role in scene understanding and has extensive practical applications. In the field of remote sensing object detection, both detection accuracy and robustness are of significant concern. Existing methods heavily rely on sophisticated adversarial training strategies that tend to improve robustness at the expense of accuracy. However, detection robustness is not always indicative of improved accuracy. Therefore, in this article, we research how to enhance robustness, while still preserving high accuracy, or even improve both simultaneously, with simple vanilla adversarial training or even in the absence thereof. In pursuit of a solution, we first conduct an exploratory investigation by shifting our attention from adversarial training, referred to as adversarial fine-tuning, to adversarial pretraining. Specifically, we propose a novel pretraining paradigm, namely, structured adversarial self-supervised (SASS) pretraining, to strengthen both clean accuracy and adversarial robustness for object detection in remote sensing images. At a high level, SASS pretraining aims to unify adversarial learning and self-supervised learning into pretraining and encode structured knowledge into pretrained representations for powerful transferability to downstream detection. Moreover, to fully explore the inherent robustness of vision Transformers and facilitate their pretraining efficiency, by leveraging the recent masked image modeling (MIM) as the pretext task, we further instantiate SASS pretraining into a concise end-to-end framework, named structured adversarial MIM (SA-MIM). SA-MIM consists of two pivotal components: structured adversarial attack and structured MIM (S-MIM). The former establishes structured adversaries for the context of adversarial pretraining, while the latter introduces a structured local-sampling global-masking strategy to adapt to hierarchical encoder architectures. Comprehensive experiments on three different datasets have demonstrated the significant superiority of the proposed pretraining paradigm over previous counterparts for remote sensing object detection. More importantly, regardless of with or without adversarial fine-tuning, it enables simultaneous improvements in detection accuracy and robustness as expected, promisingly alleviating the dependence on complicated adversarial fine-tuning. © 2024 IEEE.
引用
收藏
页码:1 / 20
相关论文
共 50 条
  • [31] TASK-RELATED SELF-SUPERVISED LEARNING FOR REMOTE SENSING IMAGE CHANGE DETECTION
    Cai, Zhinan
    Jiang, Zhiyu
    Yuan, Yuan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1535 - 1539
  • [32] Contrastive Self-Supervised Learning With Smoothed Representation for Remote Sensing
    Jung, Heechul
    Oh, Yoonju
    Jeong, Seongho
    Lee, Chaehyeon
    Jeon, Taegyun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [33] COLOR-AWARE SELF-SUPERVISED LEARNING FOR SCENE CLASSIFICATION AND SEGMENTATION OF REMOTE SENSING IMAGES
    Xu, Guozheng
    Jiang, Xue
    Liu, Xingzhao
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5049 - 5052
  • [34] Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images
    Dong, Zhe
    Liu, Tianzhu
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [35] Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images
    Dong, Zhe
    Liu, Tianzhu
    Gu, Yanfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [36] Global and Local Contrastive Self-Supervised Learning for Semantic Segmentation of HR Remote Sensing Images
    Li, Haifeng
    Li, Yi
    Zhang, Guo
    Liu, Ruoyun
    Huang, Haozhe
    Zhu, Qing
    Tao, Chao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [37] SACNet: A Novel Self-Supervised Learning Method for Shadow Detection from High-Resolution Remote Sensing Images
    Chen, Dehai
    Kang, Jian
    Wang, Lanying
    Yu, Yongtao
    Zhou, Weixun
    Guan, Haiyan
    Karim, Mannan
    JOURNAL OF GEOVISUALIZATION AND SPATIAL ANALYSIS, 2025, 9 (01)
  • [38] Classification of Polyps in Endoscopic Images Using Self-Supervised Structured Learning
    Huang, Qi-Xian
    Lin, Guo-Shiang
    Sun, Hung-Min
    IEEE ACCESS, 2023, 11 : 50025 - 50037
  • [39] ADVERSARIAL EXAMPLE GENERATION METHOD FOR OBJECT DETECTION IN REMOTE SENSING IMAGES
    Jiang, Wanghan
    Zhou, Yue
    Jiang, Xue
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5810 - 5813
  • [40] Object-Centric Masked Image Modeling-Based Self-Supervised Pretraining for Remote Sensing Object Detection
    Zhang, Tong
    Zhuang, Yin
    Chen, He
    Chen, Liang
    Wang, Guanqun
    Gao, Peng
    Dong, Hao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 5013 - 5025