Not Only Rewards but Also Constraints: Applications on Legged Robot Locomotion

被引:1
|
作者
Kim, Yunho [1 ]
Oh, Hyunsik [1 ]
Lee, Jeonghyun [1 ]
Choi, Jinhyeok [1 ]
Ji, Gwanghyeon [1 ]
Jung, Moonkyu [1 ]
Youm, Donghoon [1 ]
Hwangbo, Jemin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Robot & Artificial Intelligence Lab, Daejeon 34141, South Korea
关键词
Robots; Legged locomotion; Reinforcement learning; Optimization; Neural networks; Quadrupedal robots; Training; Constrained reinforcement learning (RL); legged locomotion; RL; REINFORCEMENT; POLICY;
D O I
10.1109/TRO.2024.3400935
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Several earlier studies have shown impressive control performance in complex robotic systems by designing the controller using a neural network and training it with model-free reinforcement learning. However, these outstanding controllers with natural motion style and high task performance are developed through extensive reward engineering, which is a highly laborious and time-consuming process of designing numerous reward terms and determining suitable reward coefficients. In this article, we propose a novel reinforcement learning framework for training neural network controllers for complex robotic systems consisting of both rewards and constraints. To let the engineers appropriately reflect their intent to constraints and handle them with minimal computation overhead, two constraint types and an efficient policy optimization algorithm are suggested. The learning framework is applied to train locomotion controllers for several legged robots with different morphology and physical attributes to traverse challenging terrains. Extensive simulation and real-world experiments demonstrate that performant controllers can be trained with significantly less reward engineering, by tuning only a single reward coefficient. Furthermore, a more straightforward and intuitive engineering process can be utilized, thanks to the interpretability and generalizability of constraints.
引用
收藏
页码:2984 / 3003
页数:20
相关论文
共 50 条
  • [31] Next generation legged robot locomotion: A review on control techniques
    Kotha, Swapnil Saha
    Akter, Nipa
    Abhi, Sarafat Hussain
    Das, Sajal Kumar
    Islam, Md. Robiul
    Ali, Md. Firoj
    Ahamed, Md. Hafiz
    Islam, Md. Manirul
    Sarker, Subrata Kumar
    Badal, Md. Faisal Rahman
    Das, Prangon
    Tasneem, Zinat
    Hasan, Md. Mehedi
    HELIYON, 2024, 10 (18)
  • [32] Three Dimensional Locomotion Control of Single-Legged Robot
    Kang, Tae Hun
    Moon, Jean Il
    2013 13TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2013), 2013, : 1117 - 1122
  • [33] A biomimetic fruit fly robot for studying the neuromechanics of legged locomotion
    Goldsmith, Clarus A.
    Haustein, Moritz
    Bueschges, Ansgar
    Szczecinski, Nicholas S.
    BIOINSPIRATION & BIOMIMETICS, 2024, 19 (06)
  • [34] Learning Feasibility Constraints for Multi-contact Locomotion of Legged Robots
    Carpentier, Justin
    Budhiraja, Rohan
    Mansard, Nicolas
    ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
  • [35] Walking locomotion of a cable-driven soft-legged robot
    Tang, Bin
    Chen, Tao
    Zhang, Ping
    Li, Shuaiqi
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2023, 237 (09) : 2163 - 2170
  • [36] Legged locomotion of a bio-inspired lightweight robot on granular media
    Qian, F.
    Zhang, T.
    Li, C.
    Shen, J.
    Hoover, A. M.
    Birkmeyer, P.
    Pullin, A.
    Fearing, R. S.
    Goldman, D., I
    Masarati, P.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2012, 52 : E143 - E143
  • [37] A Motion Planning Approach for Nonprehensile Manipulation and Locomotion Tasks of a Legged Robot
    Zhang, Guoteng
    Ma, Shugen
    Shen, Yayi
    Li, Yibin
    IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (03) : 855 - 874
  • [38] Motion Planning for Agile Legged Locomotion using Failure Margin Constraints
    Green, Kevin
    Warila, John
    Hatton, Ross L.
    Hurst, Jonathan
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10350 - 10355
  • [39] Self-organized adaptive legged locomotion in a compliant quadruped robot
    Buchli, Jonas
    Ijspeert, Auke Jan
    AUTONOMOUS ROBOTS, 2008, 25 (04) : 331 - 347
  • [40] Control of Thruster-Assisted, Bipedal Legged Locomotion of the Harpy Robot
    Dangol, Pravin
    Sihite, Eric
    Ramezani, Alireza
    FRONTIERS IN ROBOTICS AND AI, 2021, 8