Structured Adversarial Self-Supervised Learning for Robust Object Detection in Remote Sensing Images

被引:0
|
作者
The Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong [1 ]
不详 [2 ]
210049, China
不详 [3 ]
710071, China
机构
来源
关键词
Job analysis - Object detection - Object recognition - Supervised learning;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection plays a crucial role in scene understanding and has extensive practical applications. In the field of remote sensing object detection, both detection accuracy and robustness are of significant concern. Existing methods heavily rely on sophisticated adversarial training strategies that tend to improve robustness at the expense of accuracy. However, detection robustness is not always indicative of improved accuracy. Therefore, in this article, we research how to enhance robustness, while still preserving high accuracy, or even improve both simultaneously, with simple vanilla adversarial training or even in the absence thereof. In pursuit of a solution, we first conduct an exploratory investigation by shifting our attention from adversarial training, referred to as adversarial fine-tuning, to adversarial pretraining. Specifically, we propose a novel pretraining paradigm, namely, structured adversarial self-supervised (SASS) pretraining, to strengthen both clean accuracy and adversarial robustness for object detection in remote sensing images. At a high level, SASS pretraining aims to unify adversarial learning and self-supervised learning into pretraining and encode structured knowledge into pretrained representations for powerful transferability to downstream detection. Moreover, to fully explore the inherent robustness of vision Transformers and facilitate their pretraining efficiency, by leveraging the recent masked image modeling (MIM) as the pretext task, we further instantiate SASS pretraining into a concise end-to-end framework, named structured adversarial MIM (SA-MIM). SA-MIM consists of two pivotal components: structured adversarial attack and structured MIM (S-MIM). The former establishes structured adversaries for the context of adversarial pretraining, while the latter introduces a structured local-sampling global-masking strategy to adapt to hierarchical encoder architectures. Comprehensive experiments on three different datasets have demonstrated the significant superiority of the proposed pretraining paradigm over previous counterparts for remote sensing object detection. More importantly, regardless of with or without adversarial fine-tuning, it enables simultaneous improvements in detection accuracy and robustness as expected, promisingly alleviating the dependence on complicated adversarial fine-tuning. © 2024 IEEE.
引用
收藏
页码:1 / 20
相关论文
共 50 条
  • [41] Self-supervised learning for robust object retrieval without human annotations
    Van den Herrewegen, Jarne
    Tourwe, Tom
    Wyffels, Francis
    COMPUTERS & GRAPHICS-UK, 2023, 115 : 13 - 24
  • [42] SAM-Induced Pseudo Fully Supervised Learning for Weakly Supervised Object Detection in Remote Sensing Images
    Qian, Xiaoliang
    Lin, Chenyang
    Chen, Zhiwu
    Wang, Wei
    REMOTE SENSING, 2024, 16 (09)
  • [43] Pixel-Level Self-Supervised Learning for Semi-Supervised Building Extraction From Remote Sensing Images
    Yu, Anzhu
    Liu, Bing
    Cao, Xuefeng
    Qiu, Chunping
    Guo, Wenyue
    Quan, Yujun
    IEEE Geoscience and Remote Sensing Letters, 2022, 19
  • [44] Pixel-Level Self-Supervised Learning for Semi-Supervised Building Extraction From Remote Sensing Images
    Yu, Anzhu
    Liu, Bing
    Cao, Xuefeng
    Qiu, Chunping
    Guo, Wenyue
    Quan, Yujun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [45] A Self-supervised Adversarial Learning Approach for Network Intrusion Detection System
    Deng, Lirui
    Zhao, Youjian
    Bao, Heng
    CYBER SECURITY, CNCERT 2022, 2022, 1699 : 73 - 85
  • [46] Self-supervised multimodal change detection based on difference contrast learning for remote sensing imagery
    Hou, Xuan
    Bai, Yunpeng
    Xie, Yefan
    Zhang, Yunfeng
    Fu, Lei
    Li, Ying
    Shang, Changjing
    Shen, Qiang
    PATTERN RECOGNITION, 2025, 159
  • [47] RAPID WILDFIRE HOTSPOT DETECTION USING SELF-SUPERVISED LEARNING ON TEMPORAL REMOTE SENSING DATA
    Barco, Luca
    Urbanelli, Angelica
    Rossi, Claudio
    IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2061 - 2065
  • [48] Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
    Dong, Huihui
    Ma, Wenping
    Wu, Yue
    Zhang, Jun
    Jiao, Licheng
    REMOTE SENSING, 2020, 12 (11)
  • [49] Semi- and Self-Supervised Metric Learning for Remote Sensing Applications
    Hernandez-Sequeira, Itza
    Fernandez-Beltran, Ruben
    Pla, Filiberto
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [50] Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks
    Akiva, Peri
    Purri, Matthew
    Leotta, Matthew
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8193 - 8205