Blind Hexapod Locomotion in Complex Terrain with Gait Adaptation Using Deep Reinforcement Learning and Classification

被引:18
|
作者
Azayev, Teymur [1 ]
Zimmerman, Karel [2 ]
机构
[1] CVUT FEL, E227,Karlovo 13, Prague 2, Czech Republic
[2] CVUT FEL, E226,Karlovo 13, Prague 2, Czech Republic
关键词
Deep; reinforcement; learning; locomotion; hexapod; neural; ROBOT;
D O I
10.1007/s10846-020-01162-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a scalable two-level architecture for Hexapod locomotion through complex terrain without the use of exteroceptive sensors. Our approach assumes that the target complex terrain can be modeled by N discrete terrain distributions which capture individual difficulties of the target terrain. Expert policies (physical locomotion controllers) modeled by Artificial Neural Networks are trained independently in these individual scenarios using Deep Reinforcement Learning. These policies are then autonomously multiplexed during inference using a Recurrent Neural Network terrain classifier conditioned on the state history, giving an adaptive gait appropriate for the current terrain. We perform several tests to assess policy robustness by changing various parameters, such as contact, friction and actuator properties. We also show experiments of goal-based positional control of such a system and a way of selecting several gait criteria during deployment, giving us a complete solution for blind Hexapod locomotion in a practical setting. The Hexapod platform and all our experiments are modeled in the MuJoCo [1] physics simulator. Demonstrations are available in the supplementary video.
引用
收藏
页码:659 / 671
页数:13
相关论文
共 50 条
  • [31] Classification with Costly Features Using Deep Reinforcement Learning
    Janisch, Jaromir
    Pevny, Tomas
    Lisy, Viliam
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3959 - 3966
  • [32] The Effects of Using a Greedy Factor in Hexapod Gait Learning
    Parker, Gary B.
    Tarimo, William T.
    2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 1509 - 1514
  • [33] Adaptive Gait Generation for Hexapod Robots Based on Reinforcement Learning and Hierarchical Framework
    Qiu, Zhiying
    Wei, Wu
    Liu, Xiongding
    ACTUATORS, 2023, 12 (02)
  • [34] Learning Gait-conditioned Bipedal Locomotion with Motor Adaptation
    Wei, Wandi
    Wang, Zhicheng
    Xie, Anhuan
    Wu, Jun
    Xiong, Rong
    Zhu, Qiuguo
    2023 IEEE-RAS 22ND INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, HUMANOIDS, 2023,
  • [35] Hierarchical Gait Generation for Modular Robots Using Deep Reinforcement Learning
    Wang, Jiayu
    Hu, Chuxiong
    Zhu, Yu
    2021 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2021,
  • [36] Automated Gait Generation for Simulated Bodies using Deep Reinforcement Learning
    Ananthakrishnan, Abhishek
    Kanakiya, Vatsal
    Ved, Dipen
    Sharma, Grishma
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 90 - 95
  • [37] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
    Wei, Lang
    Li, Yunxiang
    Ai, Yunfei
    Wu, Yuze
    Xu, Hao
    Wang, Wei
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (9) : 1599 - 1613
  • [38] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
    Lang Wei
    Yunxiang Li
    Yunfei Ai
    Yuze Wu
    Hao Xu
    Wei Wang
    Guoming Hu
    International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1599 - 1613
  • [39] Robot Navigation of Environments with Unknown Rough Terrain Using Deep Reinforcement Learning
    Zhang, Kaicheng
    Niroui, Farzad
    Ficocelli, Maurizio
    Nejat, Goldie
    2018 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2018,
  • [40] Stabilization of vertical motion of a vehicle on bumpy terrain using deep reinforcement learning
    Salvi, Ameya
    Coleman, John
    Buzhardt, Jake
    Krovi, Venkat
    Tallapragada, Phanindra
    IFAC PAPERSONLINE, 2022, 55 (37): : 276 - 281