Not Only Rewards but Also Constraints: Applications on Legged Robot Locomotion

被引：1

作者：

Kim, Yunho ^{[1
]}

Oh, Hyunsik ^{[1
]}

Lee, Jeonghyun ^{[1
]}

Choi, Jinhyeok ^{[1
]}

Ji, Gwanghyeon ^{[1
]}

Jung, Moonkyu ^{[1
]}

Youm, Donghoon ^{[1
]}

Hwangbo, Jemin ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Robot & Artificial Intelligence Lab, Daejeon 34141, South Korea

来源：

IEEE TRANSACTIONS ON ROBOTICS | 2024年 / 40卷

关键词：

Robots; Legged locomotion; Reinforcement learning; Optimization; Neural networks; Quadrupedal robots; Training; Constrained reinforcement learning (RL); legged locomotion; RL; REINFORCEMENT; POLICY;

D O I：

10.1109/TRO.2024.3400935

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Several earlier studies have shown impressive control performance in complex robotic systems by designing the controller using a neural network and training it with model-free reinforcement learning. However, these outstanding controllers with natural motion style and high task performance are developed through extensive reward engineering, which is a highly laborious and time-consuming process of designing numerous reward terms and determining suitable reward coefficients. In this article, we propose a novel reinforcement learning framework for training neural network controllers for complex robotic systems consisting of both rewards and constraints. To let the engineers appropriately reflect their intent to constraints and handle them with minimal computation overhead, two constraint types and an efficient policy optimization algorithm are suggested. The learning framework is applied to train locomotion controllers for several legged robots with different morphology and physical attributes to traverse challenging terrains. Extensive simulation and real-world experiments demonstrate that performant controllers can be trained with significantly less reward engineering, by tuning only a single reward coefficient. Furthermore, a more straightforward and intuitive engineering process can be utilized, thanks to the interpretability and generalizability of constraints.

引用

页码：2984 / 3003

页数：20

共 50 条

[31] Next generation legged robot locomotion: A review on control techniques
Kotha, Swapnil Saha
Akter, Nipa
Abhi, Sarafat Hussain
Das, Sajal Kumar
Islam, Md. Robiul
Ali, Md. Firoj
Ahamed, Md. Hafiz
Islam, Md. Manirul
Sarker, Subrata Kumar
Badal, Md. Faisal Rahman
Das, Prangon
Tasneem, Zinat
Hasan, Md. Mehedi
HELIYON, 2024, 10 (18)
[32] Three Dimensional Locomotion Control of Single-Legged Robot
Kang, Tae Hun
Moon, Jean Il
2013 13TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2013), 2013, : 1117 - 1122
[33] A biomimetic fruit fly robot for studying the neuromechanics of legged locomotion
Goldsmith, Clarus A.
Haustein, Moritz
Bueschges, Ansgar
Szczecinski, Nicholas S.
BIOINSPIRATION & BIOMIMETICS, 2024, 19 (06)
[34] Learning Feasibility Constraints for Multi-contact Locomotion of Legged Robots
Carpentier, Justin
Budhiraja, Rohan
Mansard, Nicolas
ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
[35] Walking locomotion of a cable-driven soft-legged robot
Tang, Bin
Chen, Tao
Zhang, Ping
Li, Shuaiqi
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2023, 237 (09) : 2163 - 2170
[36] Legged locomotion of a bio-inspired lightweight robot on granular media
Qian, F.
Zhang, T.
Li, C.
Shen, J.
Hoover, A. M.
Birkmeyer, P.
Pullin, A.
Fearing, R. S.
Goldman, D., I
Masarati, P.
INTEGRATIVE AND COMPARATIVE BIOLOGY, 2012, 52 : E143 - E143
[37] A Motion Planning Approach for Nonprehensile Manipulation and Locomotion Tasks of a Legged Robot
Zhang, Guoteng
Ma, Shugen
Shen, Yayi
Li, Yibin
IEEE TRANSACTIONS ON ROBOTICS, 2020, 36 (03) : 855 - 874
[38] Motion Planning for Agile Legged Locomotion using Failure Margin Constraints
Green, Kevin
Warila, John
Hatton, Ross L.
Hurst, Jonathan
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10350 - 10355
[39] Self-organized adaptive legged locomotion in a compliant quadruped robot
Buchli, Jonas
Ijspeert, Auke Jan
AUTONOMOUS ROBOTS, 2008, 25 (04) : 331 - 347
[40] Control of Thruster-Assisted, Bipedal Legged Locomotion of the Harpy Robot
Dangol, Pravin
Sihite, Eric
Ramezani, Alireza
FRONTIERS IN ROBOTICS AND AI, 2021, 8

← 1 2 3 4 5 →