Using Reinforcement Learning to Escape Automatic Filter-based Adversarial Example Defense

被引:0
|
作者
Li, Yantao [1 ]
Dan, Kaijian [1 ]
Lei, Xinyu [2 ]
Qin, Huafeng [3 ]
Deng, Shaojiang [1 ]
Zhou, Gang [4 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
[2] Michigan Technol Univ, Dept Comp Sci, Houghton, MI USA
[3] Chongqing Technol & Business Univ, Sch Comp Sci & Informat Engn, Chongqing, Peoples R China
[4] William & Mary, Comp Sci Dept, Williamsburg, VA USA
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Adversarial examples; image classification; reinforcement learning; filter;
D O I
10.1145/3688847
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural networks can be easily fooled by the adversarial example, which is a specially crafted example with subtle and intentional perturbations. A plethora of papers have proposed to use filters to effectively defend against adversarial example attacks. However, we demonstrate that the automatic filter-based defenses may not be reliable. In this article, we present URL2AED, Using a Reinforcement Learning scheme TO escape the automatic filter-based Adversarial Example Defenses. Specifically, URL2AED uses a specially crafted policy gradient reinforcement learning (RL) algorithm to generate adversarial examples (AEs) that can escape automatic filter-based AE defenses. In particular, we properly design reward functions in policy-gradient RL for targeted attacks and non-targeted attacks, respectively. Furthermore, we customize training algorithms to reduce the possible action space in policy-gradient RL to accelerate URL2AED training while still ensuring that URL2AED generates successful AEs. To demonstrate the performance of the proposed URL2AED, we conduct extensive experiments on three public datasets in terms of different perturbation degrees of parameter, different filter parameters, transferability, and time consumption. The experimental results show that URL2AED achieves high attack success rates for automatic filter-based defenses and good cross-model transferability.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Escaping Filter-based Adversarial Example Defense: A Reinforcement Learning Approach
    Dan, Kaijian
    Lei, Xinyu
    Li, Yantao
    Qin, Huafeng
    Deng, Shaojiang
    Zhou, Gang
    2022 IEEE 19TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SMART SYSTEMS (MASS 2022), 2022, : 272 - 280
  • [2] Text Adversarial Examples Generation and Defense Based on Reinforcement Learning
    Li, Yue
    Xu, Pengjian
    Ruan, Qing
    Xu, Wusheng
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2021, 28 (04): : 1306 - 1314
  • [3] XSS adversarial example attacks based on deep reinforcement learning
    Chen, Li
    Tang, Cong
    He, Junjiang
    Zhao, Hui
    Lan, Xiaolong
    Li, Tao
    COMPUTERS & SECURITY, 2022, 120
  • [4] Adversarial Example Defense Based on the Supervision
    Yao, Ziyu
    Gao, Jiaquan
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] A Particle Filter-Based Reinforcement Learning Approach for Reliable Wireless Indoor Positioning
    Villacres, Jose Luis Carrera
    Zhao, Zhongliang
    Braun, Torsten
    Li, Zan
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) : 2457 - 2473
  • [6] Instance-based defense against adversarial attacks in Deep Reinforcement Learning
    Garcia, Javier
    Sagredo, Ismael
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
  • [7] Deep Reinforcement Learning-Based Adversarial Defense in Vehicular Communication Systems
    Sedar, Roshan
    Kalalas, Charalampos
    Vazquez-Gallego, Francisco
    Alonso-Zarate, Jesus
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 5250 - 5256
  • [8] Adversarial example defense based on image reconstruction
    Zhang, Yu
    Xu, Huan
    Pei, Chengfei
    Yang, Gaoming
    PEERJ COMPUTER SCIENCE, 2021, 7
  • [9] Kalman Filter-based Cycle-Consistent Adversarial Learning for Time Series Anomaly Detection
    Liu, Shu-Tang
    Fan, Ji-Jie
    Wang, Rui-Dong
    Han, Han
    Zhang, De-Yang
    Journal of Network Intelligence, 2024, 9 (02): : 790 - 803
  • [10] Orthogonal Filter-Based Networks for Learning
    Sienko, Wieslaw
    Citko, Wieslaw
    ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 873 - +