Reinforcement Learning-based Adversarial Attacks on Object Detectors using Reward Shaping

被引：1

作者：

Shi, Zhenbo ^{[1
]}

Yang, Wei ^{[2
]}

Xu, Zhenbo ^{[3
]}

Yu, Zhidong ^{[1
]}

Huang, Liusheng ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

[2] Univ Sci & Technol China, Hefei Natl Lab, Hefei, Peoples R China

[3] Beihang Univ, Hangzhou Innovat Inst, Hangzhou, Peoples R China

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Object Detection; Reinforcement Learning; Adversarial Attack;

D O I：

10.1145/3581783.3612304

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the field of object detector attacks, previous methods primarily rely on fixed gradient optimization or patch-based cover techniques, often leading to suboptimal attack performance and excessive distortions. To address these limitations, we propose a novel attack method, Interactive Reinforcement-based Sparse Attack (IRSA), which employs Reinforcement Learning (RL) to discover the vulnerabilities of object detectors and systematically generate erroneous results. Specifically, we formulate the process of seeking optimal margins for adversarial examples as a Markov Decision Process (MDP). We tackle the RL convergence difficulty through innovative reward functions and a composite optimization method for effective and efficient policy training. Moreover, the perturbations generated by IRSA are more subtle and difficult to detect while requiring less computational effort. Our method also demonstrates strong generalization capabilities against various object detectors. In summary, IRSA is a refined, efficient, and scalable interactive, iterative, end-to-end algorithm.

引用

页码：8424 / 8432

页数：9

共 50 条

[31] Hindsight Reward Shaping in Deep Reinforcement Learning
de Villiers, Byron
Sabatta, Deon
2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 653 - 659
[32] Reinforcement learning-based feature learning tor object tracking
Liu, F
Su, JB
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 748 - 751
[33] Aircraft detection in satellite imagery using deep learning-based object detectors
Azam, Basim
Khan, Muhammad Jaleed
Bhatti, Farrukh Aziz
Maud, Abdur Rahman M.
Hussain, Syed Fawad
Hashmi, Ali Javed
Khurshid, Khurram
MICROPROCESSORS AND MICROSYSTEMS, 2022, 94
[34] Sparse reward for reinforcement learning-based continuous integration testing
Yang, Yang
Li, Zheng
Shang, Ying
Li, Qianyu
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2023, 35 (06)
[35] Deep Reinforcement Learning-based Image Captioning with Embedding Reward
Ren, Zhou
Wang, Xiaoyu
Zhang, Ning
Lv, Xutao
Li, Li-Jia
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1151 - 1159
[36] Reinforcement Learning-Based Cooperative Adversarial Algorithm for UAV Cluster
Li, Yan
Gao, Yanlong
Dai, Xunhua
Nian, Xiaohong
Wang, Haibo
Xiong, HongYun
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1129 - 1138
[37] Adversarial robustness of deep reinforcement learning-based intrusion detection
Merzouk, Mohamed Amine
Neal, Christopher
Delas, Josephine
Yaich, Reda
Boulahia-Cuppens, Nora
Cuppens, Frederic
INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (06) : 3625 - 3651
[38] Adversarial patch attacks against aerial imagery object detectors
Tang, Guijian
Jiang, Tingsong
Zhou, Weien
Li, Chao
Yao, Wen
Zhao, Yong
NEUROCOMPUTING, 2023, 537 : 128 - 140
[39] A new Potential-Based Reward Shaping for Reinforcement Learning Agent
Badnava, Babak
Esmaeili, Mona
Mozayani, Nasser
Zarkesh-Ha, Payman
2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 630 - 635
[40] Subgoal-Based Reward Shaping to Improve Efficiency in Reinforcement Learning
Okudo, Takato
Yamada, Seiji
IEEE ACCESS, 2021, 9 : 97557 - 97568

← 1 2 3 4 5 →