Environment Adversarial Reinforcement Learning

被引:0
|
作者
Cooper, John R. [1 ]
机构
[1] NASA, Langley Res Ctr, Autonomous Integrated Syst Res Branch, Hampton, VA 23681 USA
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a training method for increasing performance of reinforcement learning agents. The method is named Environment Adversarial Reinforcement Learning. The method requires the reinforcement learning environment to be parameterizable. Over the course of training, environment parameters are updated in a direction of increasing difficulty for the agent. The direction for these updates is found using a performance prediction network trained on data from tests of the agent under varying environment parameters. The method was tested on a CartPole environment. A 28-58% improvement in mean return (sum of rewards in an episode) was found when comparing performance to a baseline reinforcement learning algorithm on both easy and hard versions of the task.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Robust Safe Reinforcement Learning under Adversarial Disturbances
    Li, Zeyang
    Hu, Chuxiong
    Li, Shengbo Eben
    Cheng, Jia
    Wang, Yunan
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 334 - 341
  • [42] Attacking Deep Reinforcement Learning With Decoupled Adversarial Policy
    Mo, Kanghua
    Tang, Weixuan
    Li, Jin
    Yuan, Xu
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (01) : 758 - 768
  • [43] Robust Market Making via Adversarial Reinforcement Learning
    Spooner, Thomas
    Savani, Rahul
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4590 - 4596
  • [44] Domain Adversarial Reinforcement Learning for Partial Domain Adaptation
    Chen, Jin
    Wu, Xinxiao
    Duan, Lixin
    Gao, Shenghua
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 539 - 553
  • [45] AESMOTE: Adversarial Reinforcement Learning With SMOTE for Anomaly Detection
    Ma, Xiangyu
    Shi, Wei
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 943 - 956
  • [46] Infrared Adversarial Patch Generation Based on Reinforcement Learning
    Zhou, Shuangju
    Li, Yang
    Tan, Wenyi
    Zhao, Chenxing
    Zhou, Xin
    Pan, Quan
    MATHEMATICS, 2024, 12 (21)
  • [47] Adversarial Proximal Policy Optimisation for Robust Reinforcement Learning
    Ince, Bilkan
    Shin, Hyo-Sang
    Tsourdos, Antonios
    AIAA SCITECH 2024 FORUM, 2024,
  • [48] An adversarial reinforcement learning based system for cyber security
    Xia, Song
    Qiu, Meikang
    Jiang, Hao
    4TH IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2019) / 3RD INTERNATIONAL SYMPOSIUM ON REINFORCEMENT LEARNING (ISRL 2019), 2019, : 227 - 230
  • [49] Understanding adversarial attacks on observations in deep reinforcement learning
    You QIAOBEN
    Chengyang YING
    Xinning ZHOU
    Hang SU
    Jun ZHU
    Bo ZHANG
    ScienceChina(InformationSciences), 2024, 67 (05) : 69 - 83
  • [50] Challenges and Countermeasures for Adversarial Attacks on Deep Reinforcement Learning
    Ilahi I.
    Usama M.
    Qadir J.
    Janjua M.U.
    Al-Fuqaha A.
    Hoang D.T.
    Niyato D.
    IEEE Transactions on Artificial Intelligence, 2022, 3 (02): : 90 - 109