Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control

被引:34
|
作者
Xiang, Guofei [1 ]
Su, Jianbo [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China
关键词
Continuous control; deep neural networks (DNNs); exploration; imitation learning (IL); reinforcement learning (RL); robotics; skill acquisition; SEARCH;
D O I
10.1109/TCYB.2019.2949596
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and control tasks. However, these methods and their extensions require extensive environmental interactions during training, which greatly prevents them from being applied to real-world robots. To alleviate this problem, we present an efficient model-free off-policy actor-critic algorithm for robotic skill acquisition and continuous control, by fusing the task reward with a task-oriented guiding reward, which is formulated by leveraging few and imperfect expert demonstrations. In this framework, the agent can explore the environment more intentionally, thus sampling efficiency can be achieved; moreover, the agent can also exploit the experience more effectively, thereby substantially improved performance can be realized simultaneously. The empirical results on robotic locomotion tasks show that the proposed scheme can lower sample complexity by 2-10 times in contrast with the state-of-the-art baseline deep RL (DRL) algorithms, while achieving performance better than that of the expert. Furthermore, the proposed algorithm achieves significant improvement in both sampling efficiency and asymptotic performance on tasks with sparse and delayed reward, wherein those baseline DRL algorithms struggle to make progress. This takes a substantial step forward to implement these methods to acquire skills autonomously for real robots.
引用
收藏
页码:1056 / 1069
页数:14
相关论文
共 50 条
  • [31] Multi-Receiver Task-Oriented Communications via Multi-Task Deep Learning
    Sagduyu, Yalin E.
    Erpek, Tugba
    Yener, Aylin
    Ulukus, Sennur
    2023 IEEE FUTURE NETWORKS WORLD FORUM, FNWF, 2024,
  • [32] Task-Oriented Adaptive Position/Force Control for Robotic Systems Under Hybrid Constraints
    Ding, Shuai
    Peng, Jinzhu
    Xin, Jianbin
    Zhang, Hui
    Wang, Yaonan
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 12612 - 12622
  • [33] Transfer Learning based Task-oriented Dialogue Policy for Multiple Domains using Hierarchical Reinforcement Learning
    Saha, Tulika
    Saha, Sriparna
    Bhattacharyya, Pushpak
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [34] Task-oriented developmental learning for humanoid robots
    Tan, KC
    Chen, YJ
    Tan, KK
    Lee, TH
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2005, 52 (03) : 906 - 914
  • [35] Continual Learning in Task-Oriented Dialogue Systems
    Madotto, Andrea
    Lin, Zhaojiang
    Zhou, Zhenpeng
    Moon, Seungwhan
    Crook, Paul
    Liu, Bing
    Yu, Zhou
    Cho, Eunjoon
    Fung, Pascale
    Wang, Zhiguang
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7452 - 7467
  • [36] Painless and accurate medical image analysis using deep reinforcement learning with task-oriented homogenized automatic pre-processing
    Yuan, Di
    Liu, Yunxin
    Xu, Zhenghua
    Zhan, Yuefu
    Chen, Junyang
    Lukasiewicz, Thomas
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [37] Learning Folksonomies from Task-Oriented Dialogues
    Puppi Wanderley, Gregory Moro
    Paraiso, Emerson Cabrera
    30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 360 - 367
  • [38] A novel task-oriented framework for dual-arm robotic assembly task
    Zhengwei WANG
    Yahui GAN
    Xianzhong DAI
    Frontiers of Mechanical Engineering, 2021, (03) : 528 - 545
  • [39] A novel task-oriented framework for dual-arm robotic assembly task
    Zhengwei Wang
    Yahui Gan
    Xianzhong Dai
    Frontiers of Mechanical Engineering, 2021, 16 : 528 - 545
  • [40] A novel task-oriented framework for dual-arm robotic assembly task
    Wang, Zhengwei
    Gan, Yahui
    Dai, Xianzhong
    FRONTIERS OF MECHANICAL ENGINEERING, 2021, 16 (03) : 528 - 545