reinforcement learning, autonomous agents, neural networks

被引:0
|
作者
Parker-Holder, Jack [1 ]
Rajan, Raghu [2 ]
Song, Xingyou [3 ]
Biedenkapp, Andre [2 ]
Miao, Yingjie [3 ]
Eimer, Theresa [4 ]
Zhang, Baohe [2 ]
Nguyen, Vu [5 ]
Calandra, Roberto [6 ]
Faust, Aleksandra [2 ,7 ]
Hutter, Frank [3 ]
Lindauer, Marius [4 ]
机构
[1] Univ Oxford, Oxford, England
[2] Univ Freiburg, Freiburg, Germany
[3] Google Res, Brain Team, Mountain View, CA USA
[4] Leibniz Univ Hannover, Hannover, Germany
[5] Amazon, Sydney, NSW, Australia
[6] Meta AI, New York, NY USA
[7] Bosch Ctr Artificial Intelligence, Freiburg, Germany
关键词
reinforcement learning; autonomous agents; neural networks; BAYESIAN OPTIMIZATION; LEVEL; NAVIGATION; EVOLUTION; ATARI; GAME; GO;
D O I
10.1613/jair.1.13596
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path towards generally capable agents. However, the success of RL agents is often highly sensitive to design choices in the training process, which may require tedious and error-prone manual tuning. This makes it challenging to use RL for new problems and also limits its full potential. In many other areas of machine learning, AutoML has shown that it is possible to automate such design choices, and AutoML has also yielded promising initial results when applied to RL. However, Automated Reinforcement Learning (AutoRL) involves not only standard applications of AutoML but also includes additional challenges unique to RL, that naturally produce a different set of methods. As such, AutoRL has been emerging as an important area of research in RL, providing promise in a variety of applications from RNA design to playing games, such as Go. Given the diversity of methods and environments considered in RL, much of the research has been conducted in distinct subfields, ranging from meta-learning to evolution. In this survey, we seek to unify the field of AutoRL, provide a common taxonomy, discuss each area in detail and pose open problems of interest to researchers going forward.
引用
收藏
页码:517 / 568
页数:52
相关论文
共 50 条
  • [1] Neural inverse reinforcement learning in autonomous navigation
    Xia, Chen
    El Kamel, Abdelkader
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 84 : 1 - 14
  • [2] Cyber Resilience Using Autonomous Agents and Reinforcement Learning
    Cam, Hasan
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS II, 2020, 11413
  • [3] ON THE DEVELOPMENT OF AUTONOMOUS AGENTS USING DEEP REINFORCEMENT LEARNING
    Barbu, Clara
    Mocanu, Stefan Alexandru
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2021, 83 (03): : 97 - 116
  • [4] On the development of autonomous agents using deep reinforcement learning
    Barbu, Clara
    Mocanu, Ștefan Alexandru
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2021, 83 (03): : 97 - 116
  • [5] Adversarial Manipulation of Reinforcement Learning Policies in Autonomous Agents
    Huang, Yonghong
    Wang, Shih-han
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [6] Reinforcement Learning with Neural Networks: A Survey
    Modi, Bhumika
    Jethva, H. B.
    PROCEEDINGS OF FIRST INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR INTELLIGENT SYSTEMS: VOL 1, 2016, 50 : 467 - 475
  • [7] Global reinforcement learning in neural networks
    Ma, Xiaolong
    Likharev, Konstantin K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (02): : 573 - 577
  • [8] Optimising reinforcement learning for neural networks
    Hurwitz, E
    Marwala, T
    GAME-ON 2005: 6th International Conference on Intelligent Games and Simulation, 2005, : 13 - 18
  • [9] Optimizing the Neural Architecture of Reinforcement Learning Agents
    Mazyavkina, N.
    Moustafa, S.
    Trofimov, I
    Burnaev, E.
    INTELLIGENT COMPUTING, VOL 2, 2021, 284 : 591 - 606
  • [10] Discovering neural policies to drive behaviour by integrating deep reinforcement learning agents with biological neural networks
    Li, Chenguang
    Kreiman, Gabriel
    Ramanathan, Sharad
    NATURE MACHINE INTELLIGENCE, 2024, 6 (06) : 726 - 738