Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control

被引：34

作者：

Xiang, Guofei ^{[1
]}

Su, Jianbo ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2021年 / 51卷 / 02期

关键词：

Continuous control; deep neural networks (DNNs); exploration; imitation learning (IL); reinforcement learning (RL); robotics; skill acquisition; SEARCH;

D O I：

10.1109/TCYB.2019.2949596

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and control tasks. However, these methods and their extensions require extensive environmental interactions during training, which greatly prevents them from being applied to real-world robots. To alleviate this problem, we present an efficient model-free off-policy actor-critic algorithm for robotic skill acquisition and continuous control, by fusing the task reward with a task-oriented guiding reward, which is formulated by leveraging few and imperfect expert demonstrations. In this framework, the agent can explore the environment more intentionally, thus sampling efficiency can be achieved; moreover, the agent can also exploit the experience more effectively, thereby substantially improved performance can be realized simultaneously. The empirical results on robotic locomotion tasks show that the proposed scheme can lower sample complexity by 2-10 times in contrast with the state-of-the-art baseline deep RL (DRL) algorithms, while achieving performance better than that of the expert. Furthermore, the proposed algorithm achieves significant improvement in both sampling efficiency and asymptotic performance on tasks with sparse and delayed reward, wherein those baseline DRL algorithms struggle to make progress. This takes a substantial step forward to implement these methods to acquire skills autonomously for real robots.

引用

页码：1056 / 1069

页数：14

共 50 条

[21] A Task-oriented Service Personalization Scheme for Smart Environments Using Reinforcement Learning
Tegelund, Bjorn
Son, Heesuk
Lee, Dongman
2016 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATION WORKSHOPS (PERCOM WORKSHOPS), 2016,
[22] Using Reinforcement Learning for Dialogue Act Classification in Task-oriented Conversation Systems
Xia, Qingyang
2018 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2018), 2018, : 187 - 196
[23] An empirical assessment of deep learning approaches to task-oriented dialog management
Mateju, Lukas
Griol, David
Callejas, Zoraida
Molina, Jose Manuel
Sanchis, Araceli
NEUROCOMPUTING, 2021, 439 : 327 - 339
[24] Learning to Model Task-Oriented Attention
Zou, Xiaochun
Zhao, Xinbo
Wang, Jian
Yang, Yongjia
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016 : 1 - 12
[25] Deep Reinforcement Learning of Robotic Precision Insertion Skill Accelerated by Demonstrations
Wu, Xiapeng
Zhang, Dapeng
Qin, Fangbo
Xu, De
2019 IEEE 15TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2019, : 1651 - 1656
[26] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
Wai-Chung Kwan
Hong-Ru Wang
Hui-Min Wang
Kam-Fai Wong
Machine Intelligence Research, 2023, 20 : 318 - 334
[27] Robot skill acquisition in assembly process using deep reinforcement learning
Li, Fengming
Jiang, Qi
Zhang, Sisi
Wei, Meng
Song, Rui
NEUROCOMPUTING, 2019, 345 : 92 - 102
[28] A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
Kwan, Wai-Chung
Wang, Hong-Ru
Wang, Hui-Min
Wong, Kam-Fai
MACHINE INTELLIGENCE RESEARCH, 2023, 20 (03) : 318 - 334
[29] SociBuilder: A Novel Task-oriented Swarm Robotic System
Leng, Yuquan
Zhang, Yang
Zhang, Wei
He, Xu
Bian, Dekun
Zhou, Weijia
2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2015, : 48 - 53
[30] Exploring Machine Learning and Deep Learning Frameworks for Task-Oriented Dialogue Act Classification
Saha, Tulika
Srivastava, Saurabh
Firdaus, Mauajama
Saha, Sriparna
Ekbal, Asif
Bhattacharyya, Pushpak
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,

← 1 2 3 4 5 →