Neuroevolutionary reinforcement learning for generalized control of simulated helicopters

被引:22
|
作者
Koppejan, Rogier [1 ]
Whiteson, Shimon [1 ]
机构
[1] Univ Amsterdam, Informat Inst, Sci Pk 904, NL-1098 XH Amsterdam, Netherlands
关键词
Neural networks; Neuroevolution; Reinforcement learning; Helicopter control;
D O I
10.1007/s12065-011-0066-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents an extended case study in the application of neuroevolution to generalized simulated helicopter hovering, an important challenge problem for reinforcement learning. While neuroevolution is well suited to coping with the domain's complex transition dynamics and high-dimensional state and action spaces, the need to explore efficiently and learn on-line poses unusual challenges. We propose and evaluate several methods for three increasingly challenging variations of the task, including the method that won first place in the 2008 Reinforcement Learning Competition. The results demonstrate that (1) neuroevolution can be effective for complex on-line reinforcement learning tasks such as generalized helicopter hovering, (2) neuroevolution excels at finding effective helicopter hovering policies but not at learning helicopter models, (3) due to the difficulty of learning reliable models, model-based approaches to helicopter hovering are feasible only when domain expertise is available to aid the design of a suitable model representation and (4) recent advances in efficient resampling can enable neuroevolution to tackle more aggressively generalized reinforcement learning tasks.
引用
收藏
页码:219 / 241
页数:23
相关论文
共 50 条
  • [1] Model-free LQ Control for Unmanned Helicopters using Reinforcement Learning
    Lee, Dong Jin
    Bang, Hyochoong
    2011 11TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2011, : 117 - 120
  • [2] System Identification Based on Generalized Orthonormal Basis Function for Unmanned Helicopters: A Reinforcement Learning Approach
    Liu, Zun
    Li, Jianqiang
    Wang, Cheng
    Yu, Richard
    Chen, Jie
    He, Ying
    Sun, Changyin
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (02) : 1135 - 1145
  • [3] Generalized active control of vibrations in helicopters
    Bittanti, S
    Cuzzola, FA
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2002, 25 (02) : 340 - 351
  • [4] Novelty Search for Neuroevolutionary Reinforcement Learning of Deceptive Systems: An Application to Control of Colloidal Self-assembly
    O'Leary, Jared
    Khare, Mira M.
    Mesbah, Ali
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 2776 - 2781
  • [5] Neuroevolutionary diversity policy search for multi-objective reinforcement learning
    Zhou, Dan
    Du, Jiqing
    Arai, Sachiyo
    INFORMATION SCIENCES, 2024, 657
  • [6] A Generalized Path Integral Control Approach to Reinforcement Learning
    Theodorou, Evangelos A.
    Buchli, Jonas
    Schaal, Stefan
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 3137 - 3181
  • [7] A generalized path integral control approach to reinforcement learning
    Theodorou, Evangelos A.
    Buchli, Jonas
    Schaal, Stefan
    Journal of Machine Learning Research, 2010, 11 : 3137 - 3181
  • [8] Generalized reinforcement learning fuzzy control with vague states
    Zarandi, Mohammad Hossein Fazel
    Jouzdani, Javid
    Turksen, Ismail Burhan
    ANALYSIS AND DESIGN OF INTELLIGENT SYSTEMS USING SOFT COMPUTING TECHNIQUES, 2007, 41 : 811 - +
  • [9] Plume:Lightweight and Generalized Congestion Control with Deep Reinforcement Learning
    Dehui Wei
    Jiao Zhang
    Xuan Zhang
    Chengyuan Huang
    ChinaCommunications, 2022, 19 (12) : 101 - 117
  • [10] Generalized reinforcement learning for building control using Behavioral Cloning
    Lee, Zachary E.
    Zhang, K. Max
    APPLIED ENERGY, 2021, 304