Structured Adversarial Self-Supervised Learning for Robust Object Detection in Remote Sensing Images

被引：0

作者：

The Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong ^{[1
]}

不详 ^{[2
]}

210049, China

不详 ^{[3
]}

710071, China

机构：

来源：

IEEE Trans Geosci Remote Sens | 2024年 / 1-20期

关键词：

Job analysis - Object detection - Object recognition - Supervised learning;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Object detection plays a crucial role in scene understanding and has extensive practical applications. In the field of remote sensing object detection, both detection accuracy and robustness are of significant concern. Existing methods heavily rely on sophisticated adversarial training strategies that tend to improve robustness at the expense of accuracy. However, detection robustness is not always indicative of improved accuracy. Therefore, in this article, we research how to enhance robustness, while still preserving high accuracy, or even improve both simultaneously, with simple vanilla adversarial training or even in the absence thereof. In pursuit of a solution, we first conduct an exploratory investigation by shifting our attention from adversarial training, referred to as adversarial fine-tuning, to adversarial pretraining. Specifically, we propose a novel pretraining paradigm, namely, structured adversarial self-supervised (SASS) pretraining, to strengthen both clean accuracy and adversarial robustness for object detection in remote sensing images. At a high level, SASS pretraining aims to unify adversarial learning and self-supervised learning into pretraining and encode structured knowledge into pretrained representations for powerful transferability to downstream detection. Moreover, to fully explore the inherent robustness of vision Transformers and facilitate their pretraining efficiency, by leveraging the recent masked image modeling (MIM) as the pretext task, we further instantiate SASS pretraining into a concise end-to-end framework, named structured adversarial MIM (SA-MIM). SA-MIM consists of two pivotal components: structured adversarial attack and structured MIM (S-MIM). The former establishes structured adversaries for the context of adversarial pretraining, while the latter introduces a structured local-sampling global-masking strategy to adapt to hierarchical encoder architectures. Comprehensive experiments on three different datasets have demonstrated the significant superiority of the proposed pretraining paradigm over previous counterparts for remote sensing object detection. More importantly, regardless of with or without adversarial fine-tuning, it enables simultaneous improvements in detection accuracy and robustness as expected, promisingly alleviating the dependence on complicated adversarial fine-tuning. © 2024 IEEE.

引用

页码：1 / 20

共 50 条

[41] Self-supervised learning for robust object retrieval without human annotations
Van den Herrewegen, Jarne
Tourwe, Tom
Wyffels, Francis
COMPUTERS & GRAPHICS-UK, 2023, 115 : 13 - 24
[42] SAM-Induced Pseudo Fully Supervised Learning for Weakly Supervised Object Detection in Remote Sensing Images
Qian, Xiaoliang
Lin, Chenyang
Chen, Zhiwu
Wang, Wei
REMOTE SENSING, 2024, 16 (09)
[43] Pixel-Level Self-Supervised Learning for Semi-Supervised Building Extraction From Remote Sensing Images
Yu, Anzhu
Liu, Bing
Cao, Xuefeng
Qiu, Chunping
Guo, Wenyue
Quan, Yujun
IEEE Geoscience and Remote Sensing Letters, 2022, 19
[44] Pixel-Level Self-Supervised Learning for Semi-Supervised Building Extraction From Remote Sensing Images
Yu, Anzhu
Liu, Bing
Cao, Xuefeng
Qiu, Chunping
Guo, Wenyue
Quan, Yujun
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[45] A Self-supervised Adversarial Learning Approach for Network Intrusion Detection System
Deng, Lirui
Zhao, Youjian
Bao, Heng
CYBER SECURITY, CNCERT 2022, 2022, 1699 : 73 - 85
[46] Self-supervised multimodal change detection based on difference contrast learning for remote sensing imagery
Hou, Xuan
Bai, Yunpeng
Xie, Yefan
Zhang, Yunfeng
Fu, Lei
Li, Ying
Shang, Changjing
Shen, Qiang
PATTERN RECOGNITION, 2025, 159
[47] RAPID WILDFIRE HOTSPOT DETECTION USING SELF-SUPERVISED LEARNING ON TEMPORAL REMOTE SENSING DATA
Barco, Luca
Urbanelli, Angelica
Rossi, Claudio
IGARSS 2024-2024 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, IGARSS 2024, 2024, : 2061 - 2065
[48] Self-Supervised Representation Learning for Remote Sensing Image Change Detection Based on Temporal Prediction
Dong, Huihui
Ma, Wenping
Wu, Yue
Zhang, Jun
Jiao, Licheng
REMOTE SENSING, 2020, 12 (11)
[49] Semi- and Self-Supervised Metric Learning for Remote Sensing Applications
Hernandez-Sequeira, Itza
Fernandez-Beltran, Ruben
Pla, Filiberto
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
[50] Self-Supervised Material and Texture Representation Learning for Remote Sensing Tasks
Akiva, Peri
Purri, Matthew
Leotta, Matthew
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8193 - 8205

← 1 2 3 4 5 →