Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control

被引:34
|
作者
Xiang, Guofei [1 ]
Su, Jianbo [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China
关键词
Continuous control; deep neural networks (DNNs); exploration; imitation learning (IL); reinforcement learning (RL); robotics; skill acquisition; SEARCH;
D O I
10.1109/TCYB.2019.2949596
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and control tasks. However, these methods and their extensions require extensive environmental interactions during training, which greatly prevents them from being applied to real-world robots. To alleviate this problem, we present an efficient model-free off-policy actor-critic algorithm for robotic skill acquisition and continuous control, by fusing the task reward with a task-oriented guiding reward, which is formulated by leveraging few and imperfect expert demonstrations. In this framework, the agent can explore the environment more intentionally, thus sampling efficiency can be achieved; moreover, the agent can also exploit the experience more effectively, thereby substantially improved performance can be realized simultaneously. The empirical results on robotic locomotion tasks show that the proposed scheme can lower sample complexity by 2-10 times in contrast with the state-of-the-art baseline deep RL (DRL) algorithms, while achieving performance better than that of the expert. Furthermore, the proposed algorithm achieves significant improvement in both sampling efficiency and asymptotic performance on tasks with sparse and delayed reward, wherein those baseline DRL algorithms struggle to make progress. This takes a substantial step forward to implement these methods to acquire skills autonomously for real robots.
引用
收藏
页码:1056 / 1069
页数:14
相关论文
共 50 条
  • [21] A Task-oriented Service Personalization Scheme for Smart Environments Using Reinforcement Learning
    Tegelund, Bjorn
    Son, Heesuk
    Lee, Dongman
    2016 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATION WORKSHOPS (PERCOM WORKSHOPS), 2016,
  • [22] Using Reinforcement Learning for Dialogue Act Classification in Task-oriented Conversation Systems
    Xia, Qingyang
    2018 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2018), 2018, : 187 - 196
  • [23] An empirical assessment of deep learning approaches to task-oriented dialog management
    Mateju, Lukas
    Griol, David
    Callejas, Zoraida
    Molina, Jose Manuel
    Sanchis, Araceli
    NEUROCOMPUTING, 2021, 439 : 327 - 339
  • [24] Learning to Model Task-Oriented Attention
    Zou, Xiaochun
    Zhao, Xinbo
    Wang, Jian
    Yang, Yongjia
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016 : 1 - 12
  • [25] Deep Reinforcement Learning of Robotic Precision Insertion Skill Accelerated by Demonstrations
    Wu, Xiapeng
    Zhang, Dapeng
    Qin, Fangbo
    Xu, De
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2019, : 1651 - 1656
  • [26] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
    Wai-Chung Kwan
    Hong-Ru Wang
    Hui-Min Wang
    Kam-Fai Wong
    Machine Intelligence Research, 2023, 20 : 318 - 334
  • [27] Robot skill acquisition in assembly process using deep reinforcement learning
    Li, Fengming
    Jiang, Qi
    Zhang, Sisi
    Wei, Meng
    Song, Rui
    NEUROCOMPUTING, 2019, 345 : 92 - 102
  • [28] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
    Kwan, Wai-Chung
    Wang, Hong-Ru
    Wang, Hui-Min
    Wong, Kam-Fai
    MACHINE INTELLIGENCE RESEARCH, 2023, 20 (03) : 318 - 334
  • [29] SociBuilder: A Novel Task-oriented Swarm Robotic System
    Leng, Yuquan
    Zhang, Yang
    Zhang, Wei
    He, Xu
    Bian, Dekun
    Zhou, Weijia
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 48 - 53
  • [30] Exploring Machine Learning and Deep Learning Frameworks for Task-Oriented Dialogue Act Classification
    Saha, Tulika
    Srivastava, Saurabh
    Firdaus, Mauajama
    Saha, Sriparna
    Ekbal, Asif
    Bhattacharyya, Pushpak
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,