Reinforcement Learning-based Adversarial Attacks on Object Detectors using Reward Shaping

被引:1
|
作者
Shi, Zhenbo [1 ]
Yang, Wei [2 ]
Xu, Zhenbo [3 ]
Yu, Zhidong [1 ]
Huang, Liusheng [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Univ Sci & Technol China, Hefei Natl Lab, Hefei, Peoples R China
[3] Beihang Univ, Hangzhou Innovat Inst, Hangzhou, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Object Detection; Reinforcement Learning; Adversarial Attack;
D O I
10.1145/3581783.3612304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of object detector attacks, previous methods primarily rely on fixed gradient optimization or patch-based cover techniques, often leading to suboptimal attack performance and excessive distortions. To address these limitations, we propose a novel attack method, Interactive Reinforcement-based Sparse Attack (IRSA), which employs Reinforcement Learning (RL) to discover the vulnerabilities of object detectors and systematically generate erroneous results. Specifically, we formulate the process of seeking optimal margins for adversarial examples as a Markov Decision Process (MDP). We tackle the RL convergence difficulty through innovative reward functions and a composite optimization method for effective and efficient policy training. Moreover, the perturbations generated by IRSA are more subtle and difficult to detect while requiring less computational effort. Our method also demonstrates strong generalization capabilities against various object detectors. In summary, IRSA is a refined, efficient, and scalable interactive, iterative, end-to-end algorithm.
引用
收藏
页码:8424 / 8432
页数:9
相关论文
共 50 条
  • [31] Hindsight Reward Shaping in Deep Reinforcement Learning
    de Villiers, Byron
    Sabatta, Deon
    2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 653 - 659
  • [32] Reinforcement learning-based feature learning tor object tracking
    Liu, F
    Su, JB
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 748 - 751
  • [33] Aircraft detection in satellite imagery using deep learning-based object detectors
    Azam, Basim
    Khan, Muhammad Jaleed
    Bhatti, Farrukh Aziz
    Maud, Abdur Rahman M.
    Hussain, Syed Fawad
    Hashmi, Ali Javed
    Khurshid, Khurram
    MICROPROCESSORS AND MICROSYSTEMS, 2022, 94
  • [34] Sparse reward for reinforcement learning-based continuous integration testing
    Yang, Yang
    Li, Zheng
    Shang, Ying
    Li, Qianyu
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2023, 35 (06)
  • [35] Deep Reinforcement Learning-based Image Captioning with Embedding Reward
    Ren, Zhou
    Wang, Xiaoyu
    Zhang, Ning
    Lv, Xutao
    Li, Li-Jia
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1151 - 1159
  • [36] Reinforcement Learning-Based Cooperative Adversarial Algorithm for UAV Cluster
    Li, Yan
    Gao, Yanlong
    Dai, Xunhua
    Nian, Xiaohong
    Wang, Haibo
    Xiong, HongYun
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1129 - 1138
  • [37] Adversarial robustness of deep reinforcement learning-based intrusion detection
    Merzouk, Mohamed Amine
    Neal, Christopher
    Delas, Josephine
    Yaich, Reda
    Boulahia-Cuppens, Nora
    Cuppens, Frederic
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2024, 23 (06) : 3625 - 3651
  • [38] Adversarial patch attacks against aerial imagery object detectors
    Tang, Guijian
    Jiang, Tingsong
    Zhou, Weien
    Li, Chao
    Yao, Wen
    Zhao, Yong
    NEUROCOMPUTING, 2023, 537 : 128 - 140
  • [39] A new Potential-Based Reward Shaping for Reinforcement Learning Agent
    Badnava, Babak
    Esmaeili, Mona
    Mozayani, Nasser
    Zarkesh-Ha, Payman
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 630 - 635
  • [40] Subgoal-Based Reward Shaping to Improve Efficiency in Reinforcement Learning
    Okudo, Takato
    Yamada, Seiji
    IEEE ACCESS, 2021, 9 : 97557 - 97568