Blind Hexapod Locomotion in Complex Terrain with Gait Adaptation Using Deep Reinforcement Learning and Classification

被引：18

作者：

Azayev, Teymur ^{[1
]}

Zimmerman, Karel ^{[2
]}

机构：

[1] CVUT FEL, E227,Karlovo 13, Prague 2, Czech Republic

[2] CVUT FEL, E226,Karlovo 13, Prague 2, Czech Republic

来源：

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS | 2020年 / 99卷 / 3-4期

关键词：

Deep; reinforcement; learning; locomotion; hexapod; neural; ROBOT;

D O I：

10.1007/s10846-020-01162-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a scalable two-level architecture for Hexapod locomotion through complex terrain without the use of exteroceptive sensors. Our approach assumes that the target complex terrain can be modeled by N discrete terrain distributions which capture individual difficulties of the target terrain. Expert policies (physical locomotion controllers) modeled by Artificial Neural Networks are trained independently in these individual scenarios using Deep Reinforcement Learning. These policies are then autonomously multiplexed during inference using a Recurrent Neural Network terrain classifier conditioned on the state history, giving an adaptive gait appropriate for the current terrain. We perform several tests to assess policy robustness by changing various parameters, such as contact, friction and actuator properties. We also show experiments of goal-based positional control of such a system and a way of selecting several gait criteria during deployment, giving us a complete solution for blind Hexapod locomotion in a practical setting. The Hexapod platform and all our experiments are modeled in the MuJoCo [1] physics simulator. Demonstrations are available in the supplementary video.

引用

页码：659 / 671

页数：13

共 50 条

[31] Classification with Costly Features Using Deep Reinforcement Learning
Janisch, Jaromir
Pevny, Tomas
Lisy, Viliam
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3959 - 3966
[32] The Effects of Using a Greedy Factor in Hexapod Gait Learning
Parker, Gary B.
Tarimo, William T.
2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 1509 - 1514
[33] Adaptive Gait Generation for Hexapod Robots Based on Reinforcement Learning and Hierarchical Framework
Qiu, Zhiying
Wei, Wu
Liu, Xiongding
ACTUATORS, 2023, 12 (02)
[34] Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation
Wei, Wandi
Wang, Zhicheng
Xie, Anhuan
Wu, Jun
Xiong, Rong
Zhu, Qiuguo
2023 IEEE-RAS 22ND INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, HUMANOIDS, 2023,
[35] Hierarchical Gait Generation for Modular Robots Using Deep Reinforcement Learning
Wang, Jiayu
Hu, Chuxiong
Zhu, Yu
2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
[36] Automated Gait Generation for Simulated Bodies using Deep Reinforcement Learning
Ananthakrishnan, Abhishek
Kanakiya, Vatsal
Ved, Dipen
Sharma, Grishma
PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 90 - 95
[37] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
Wei, Lang
Li, Yunxiang
Ai, Yunfei
Wu, Yuze
Xu, Hao
Wang, Wei
INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (9) : 1599 - 1613
[38] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
Lang Wei
Yunxiang Li
Yunfei Ai
Yuze Wu
Hao Xu
Wei Wang
Guoming Hu
International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1599 - 1613
[39] Robot Navigation of Environments with Unknown Rough Terrain Using Deep Reinforcement Learning
Zhang, Kaicheng
Niroui, Farzad
Ficocelli, Maurizio
Nejat, Goldie
2018 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2018,
[40] Stabilization of vertical motion of a vehicle on bumpy terrain using deep reinforcement learning
Salvi, Ameya
Coleman, John
Buzhardt, Jake
Krovi, Venkat
Tallapragada, Phanindra
IFAC PAPERSONLINE, 2022, 55 (37): : 276 - 281

← 1 2 3 4 5 →