Neuroevolutionary reinforcement learning for generalized control of simulated helicopters

被引:22
|
作者
Koppejan, Rogier [1 ]
Whiteson, Shimon [1 ]
机构
[1] Univ Amsterdam, Informat Inst, Sci Pk 904, NL-1098 XH Amsterdam, Netherlands
关键词
Neural networks; Neuroevolution; Reinforcement learning; Helicopter control;
D O I
10.1007/s12065-011-0066-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents an extended case study in the application of neuroevolution to generalized simulated helicopter hovering, an important challenge problem for reinforcement learning. While neuroevolution is well suited to coping with the domain's complex transition dynamics and high-dimensional state and action spaces, the need to explore efficiently and learn on-line poses unusual challenges. We propose and evaluate several methods for three increasingly challenging variations of the task, including the method that won first place in the 2008 Reinforcement Learning Competition. The results demonstrate that (1) neuroevolution can be effective for complex on-line reinforcement learning tasks such as generalized helicopter hovering, (2) neuroevolution excels at finding effective helicopter hovering policies but not at learning helicopter models, (3) due to the difficulty of learning reliable models, model-based approaches to helicopter hovering are feasible only when domain expertise is available to aid the design of a suitable model representation and (4) recent advances in efficient resampling can enable neuroevolution to tackle more aggressively generalized reinforcement learning tasks.
引用
收藏
页码:219 / 241
页数:23
相关论文
共 50 条
  • [31] FLIGHT MODEL FOR UNMANNED SIMULATED HELICOPTERS
    KATZ, A
    BUTLER, BE
    JOURNAL OF AIRCRAFT, 1992, 29 (04): : 521 - 526
  • [32] Neuroevolutionary Compiler Control for Code Optimization
    Heckel, Kade
    PROCEEDINGS OF THE 2023 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2023 COMPANION, 2023, : 2362 - 2365
  • [33] Efficient Opponent Exploitation in No-Limit Texas Hold'em Poker: A Neuroevolutionary Method Combined with Reinforcement Learning
    Xu, Jiahui
    Chen, Jing
    Chen, Shaofei
    ELECTRONICS, 2021, 10 (17)
  • [34] Model-based Constrained Reinforcement Learning using Generalized Control Barrier Function
    Ma, Haitong
    Chen, Jianyu
    Eben, Shengbo
    Lin, Ziyu
    Guan, Yang
    Ren, Yangang
    Zheng, Sifa
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4552 - 4559
  • [35] Deep reinforcement learning method based on DDPG with simulated annealing for satellite attitude control system
    Su, Ruipeng
    Wu, Fengge
    Zhao, Junsuo
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 390 - 395
  • [36] Reinforcement Learning Versus Proportional-Integral-Derivative Control of Hypnosis in a Simulated Intraoperative Patient
    Moore, Brett L.
    Quasny, Todd M.
    Doufas, Anthony G.
    ANESTHESIA AND ANALGESIA, 2011, 112 (02): : 350 - 359
  • [37] Generalized reinforcement learning model of dopamine activity
    Kaveri, Sivaramakrishnan
    Nakahara, Hiroyuki
    NEUROSCIENCE RESEARCH, 2008, 61 : S195 - S195
  • [38] Generalized attention-weighted reinforcement learning
    Bramlage, Lennart
    Cortese, Aurelio
    NEURAL NETWORKS, 2022, 145 : 10 - 21
  • [39] Fast reinforcement learning with generalized policy updates
    Barreto, Andre
    Hou, Shaobo
    Borsa, Diana
    Silver, David
    Precup, Doina
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (48) : 30079 - 30087
  • [40] An Evaluation Methodology for Interactive Reinforcement Learning with Simulated Users
    Bignold, Adam
    Cruz, Francisco
    Dazeley, Richard
    Vamplew, Peter
    Foale, Cameron
    BIOMIMETICS, 2021, 6 (01) : 1 - 15