Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control

被引:34
|
作者
Xiang, Guofei [1 ]
Su, Jianbo [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China
关键词
Continuous control; deep neural networks (DNNs); exploration; imitation learning (IL); reinforcement learning (RL); robotics; skill acquisition; SEARCH;
D O I
10.1109/TCYB.2019.2949596
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and control tasks. However, these methods and their extensions require extensive environmental interactions during training, which greatly prevents them from being applied to real-world robots. To alleviate this problem, we present an efficient model-free off-policy actor-critic algorithm for robotic skill acquisition and continuous control, by fusing the task reward with a task-oriented guiding reward, which is formulated by leveraging few and imperfect expert demonstrations. In this framework, the agent can explore the environment more intentionally, thus sampling efficiency can be achieved; moreover, the agent can also exploit the experience more effectively, thereby substantially improved performance can be realized simultaneously. The empirical results on robotic locomotion tasks show that the proposed scheme can lower sample complexity by 2-10 times in contrast with the state-of-the-art baseline deep RL (DRL) algorithms, while achieving performance better than that of the expert. Furthermore, the proposed algorithm achieves significant improvement in both sampling efficiency and asymptotic performance on tasks with sparse and delayed reward, wherein those baseline DRL algorithms struggle to make progress. This takes a substantial step forward to implement these methods to acquire skills autonomously for real robots.
引用
收藏
页码:1056 / 1069
页数:14
相关论文
共 50 条
  • [41] Task-oriented Resource Allocation for Mobile Edge Computing with Multi-Agent Reinforcement Learning
    Zou, Yue
    Shen, Fei
    Yan, Feng
    Tang, Liang
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [42] Learning by doing - an approach to robotic skill acquisition
    Nguyen, MC
    Graefe, V
    SICE 2001: PROCEEDINGS OF THE 40TH SICE ANNUAL CONFERENCE, INTERNATIONAL SESSION PAPERS, 2001, : 226 - 229
  • [43] Assembly skill acquisition via reinforcement learning
    Lau, HYK
    Lee, ISK
    ASSEMBLY AUTOMATION, 2001, 21 (02) : 136 - 142
  • [44] A task-oriented access control model for WfMS
    Liao, X
    Zhang, L
    Chan, SCF
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, 2005, 3439 : 168 - 177
  • [45] Personality-aware Natural Language Generation for Task-oriented Dialogue using Reinforcement Learning
    Guo, Ao
    Ohashi, Atsumoto
    Chiba, Yuya
    Tsunomori, Yuiko
    Hirai, Ryu
    Higashinaka, Ryuichiro
    2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 1823 - 1828
  • [46] Adaptive Skill Acquisition in Hierarchical Reinforcement Learning
    Holas, Juraj
    Farkas, Igor
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 383 - 394
  • [47] Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning
    Zhao, Yangyang
    Yin, Kai
    Wang, Zhenyu
    Dastani, Mehdi
    Wang, Shihan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1380 - 1391
  • [48] Task-oriented human-robot interaction control of a robotic glove utilizing forearm electromyography
    Wang, Xianhe
    Zhang, Haotian
    Teng, Long
    Tang, Chak Yin
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (16): : 11351 - 11370
  • [49] RFAC Based Task-Oriented Active Sharing Control for a Class of Robotic Rehabilitation Training Systems
    Meng, Fancheng
    Yang, Shuo
    Li, Yafeng
    Wang, Jinlei
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5471 - 5475
  • [50] On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
    Lim, Sangsoo
    Lee, Sangseon
    Piao, Yinhua
    Choi, MinGyu
    Bang, Dongmin
    Gu, Jeonghyeon
    Kim, Sun
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 4288 - 4304