Structured Adversarial Self-Supervised Learning for Robust Object Detection in Remote Sensing Images

被引:0
|
作者
The Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong [1 ]
不详 [2 ]
210049, China
不详 [3 ]
710071, China
机构
来源
关键词
Job analysis - Object detection - Object recognition - Supervised learning;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Object detection plays a crucial role in scene understanding and has extensive practical applications. In the field of remote sensing object detection, both detection accuracy and robustness are of significant concern. Existing methods heavily rely on sophisticated adversarial training strategies that tend to improve robustness at the expense of accuracy. However, detection robustness is not always indicative of improved accuracy. Therefore, in this article, we research how to enhance robustness, while still preserving high accuracy, or even improve both simultaneously, with simple vanilla adversarial training or even in the absence thereof. In pursuit of a solution, we first conduct an exploratory investigation by shifting our attention from adversarial training, referred to as adversarial fine-tuning, to adversarial pretraining. Specifically, we propose a novel pretraining paradigm, namely, structured adversarial self-supervised (SASS) pretraining, to strengthen both clean accuracy and adversarial robustness for object detection in remote sensing images. At a high level, SASS pretraining aims to unify adversarial learning and self-supervised learning into pretraining and encode structured knowledge into pretrained representations for powerful transferability to downstream detection. Moreover, to fully explore the inherent robustness of vision Transformers and facilitate their pretraining efficiency, by leveraging the recent masked image modeling (MIM) as the pretext task, we further instantiate SASS pretraining into a concise end-to-end framework, named structured adversarial MIM (SA-MIM). SA-MIM consists of two pivotal components: structured adversarial attack and structured MIM (S-MIM). The former establishes structured adversaries for the context of adversarial pretraining, while the latter introduces a structured local-sampling global-masking strategy to adapt to hierarchical encoder architectures. Comprehensive experiments on three different datasets have demonstrated the significant superiority of the proposed pretraining paradigm over previous counterparts for remote sensing object detection. More importantly, regardless of with or without adversarial fine-tuning, it enables simultaneous improvements in detection accuracy and robustness as expected, promisingly alleviating the dependence on complicated adversarial fine-tuning. © 2024 IEEE.
引用
收藏
页码:1 / 20
相关论文
共 50 条
  • [21] Graph Adversarial Self-Supervised Learning
    Yang, Longqi
    Zhang, Liangliang
    Yang, Wenjing
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [22] Adversarial Self-Supervised Contrastive Learning
    Kim, Minseon
    Tack, Jihoon
    Hwang, Sung Ju
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [23] Self-Supervised Adversarial Imitation Learning
    Monteiro, Juarez
    Gavenski, Nathan
    Meneguzzi, Felipe
    Barros, Rodrigo C.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [24] Deep anomaly detection with self-supervised learning and adversarial training
    Zhang, Xianchao
    Mu, Jie
    Zhang, Xiaotong
    Liu, Han
    Zong, Linlin
    Li, Yuangang
    PATTERN RECOGNITION, 2022, 121
  • [25] Adversarial Self-Supervised Learning for Out-of-Domain Detection
    Zeng, Zhiyuan
    He, Keqing
    Yan, Yuanmeng
    Xu, Hong
    Xu, Weiran
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5631 - 5639
  • [26] Remote sensing image intelligent interpretation: from supervised learning to self-supervised learning
    Tao C.
    Yin Z.
    Zhu Q.
    Li H.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2021, 50 (08): : 1122 - 1134
  • [27] SEMI-SUPERVISED OBJECT DETECTION IN REMOTE SENSING IMAGES BASED ON ACTIVE LEARNING
    Wang, Yuhao
    Yao, Lifan
    Meng, Gang
    Zhang, Xinye
    Song, Jiayun
    Zhang, Haopeng
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 5571 - 5574
  • [28] Weakly Supervised Object Detection for Remote Sensing Images: A Survey
    Fasana, Corrado
    Pasini, Samuele
    Milani, Federico
    Fraternali, Piero
    REMOTE SENSING, 2022, 14 (21)
  • [29] Self-supervised audiovisual representation learning for remote sensing data
    Heidler, Konrad
    Mou, Lichao
    Hu, Di
    Jin, Pu
    Li, Guangyao
    Gan, Chuang
    Wen, Ji-Rong
    Zhu, Xiao Xiang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 116
  • [30] Self-supervised Learning for Robust Surface Defect Detection
    Aqeel, Muhammad
    Sharifi, Shakiba
    Cristani, Marco
    Setti, Francesco
    DEEP LEARNING THEORY AND APPLICATIONS, PT II, DELTA 2024, 2024, 2172 : 164 - 177