Neuroevolutionary reinforcement learning for generalized control of simulated helicopters

被引:22
|
作者
Koppejan, Rogier [1 ]
Whiteson, Shimon [1 ]
机构
[1] Univ Amsterdam, Informat Inst, Sci Pk 904, NL-1098 XH Amsterdam, Netherlands
关键词
Neural networks; Neuroevolution; Reinforcement learning; Helicopter control;
D O I
10.1007/s12065-011-0066-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents an extended case study in the application of neuroevolution to generalized simulated helicopter hovering, an important challenge problem for reinforcement learning. While neuroevolution is well suited to coping with the domain's complex transition dynamics and high-dimensional state and action spaces, the need to explore efficiently and learn on-line poses unusual challenges. We propose and evaluate several methods for three increasingly challenging variations of the task, including the method that won first place in the 2008 Reinforcement Learning Competition. The results demonstrate that (1) neuroevolution can be effective for complex on-line reinforcement learning tasks such as generalized helicopter hovering, (2) neuroevolution excels at finding effective helicopter hovering policies but not at learning helicopter models, (3) due to the difficulty of learning reliable models, model-based approaches to helicopter hovering are feasible only when domain expertise is available to aid the design of a suitable model representation and (4) recent advances in efficient resampling can enable neuroevolution to tackle more aggressively generalized reinforcement learning tasks.
引用
收藏
页码:219 / 241
页数:23
相关论文
共 50 条
  • [41] Occlusion Avoidance in a Simulated Environment Using Reinforcement Learning
    Szemenyei, Marton
    Szanto, Matyas
    APPLIED SCIENCES-BASEL, 2023, 13 (05):
  • [42] A reinforcement learning method based on adaptive simulated annealing
    Atiya, AF
    Parlos, AG
    Ingber, L
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 121 - 124
  • [43] A Simple Platform for Reinforcement Learning of Simulated Flight Behaviors
    Levy, Simon D.
    BIOMIMETIC AND BIOHYBRID SYSTEMS, LIVING MACHINES 2020, 2020, 12413 : 230 - 233
  • [44] Using Deep Reinforcement Learning for Navigation in Simulated Hallways
    Leao, Goncalo
    Almeida, Filipe
    Trigo, Emanuel
    Ferreira, Henrique
    Sousa, Armando
    Reis, Luis Paulo
    2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC, 2023, : 207 - 213
  • [45] Generalized learning automata for multi-agent reinforcement learning
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    AI COMMUNICATIONS, 2010, 23 (04) : 311 - 324
  • [46] Neuroevolutionary representations for learning heterogeneous treatment effects
    Burkhart, Michael C.
    Ruiz, Gabriel
    JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 71
  • [47] Premium control with reinforcement learning
    Palmborg, Lina
    Lindskog, Filip
    ASTIN BULLETIN-THE JOURNAL OF THE INTERNATIONAL ACTUARIAL ASSOCIATION, 2023, : 233 - 257
  • [48] Interpretable Control by Reinforcement Learning
    Hein, Daniel
    Limmer, Steffen
    Runkler, Thomas A.
    IFAC PAPERSONLINE, 2020, 53 (02): : 8082 - 8089
  • [49] Reinforcement learning for structural control
    Adam, Bernard
    Smith, Ian F. C.
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2008, 22 (02) : 133 - 139
  • [50] Reinforcement learning for robot control
    Smart, WD
    Kaelbling, LP
    MOBILE ROBOTS XVI, 2002, 4573 : 92 - 103