Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control

被引：34

作者：

Xiang, Guofei ^{[1
]}

Su, Jianbo ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2021年 / 51卷 / 02期

关键词：

Continuous control; deep neural networks (DNNs); exploration; imitation learning (IL); reinforcement learning (RL); robotics; skill acquisition; SEARCH;

D O I：

10.1109/TCYB.2019.2949596

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and control tasks. However, these methods and their extensions require extensive environmental interactions during training, which greatly prevents them from being applied to real-world robots. To alleviate this problem, we present an efficient model-free off-policy actor-critic algorithm for robotic skill acquisition and continuous control, by fusing the task reward with a task-oriented guiding reward, which is formulated by leveraging few and imperfect expert demonstrations. In this framework, the agent can explore the environment more intentionally, thus sampling efficiency can be achieved; moreover, the agent can also exploit the experience more effectively, thereby substantially improved performance can be realized simultaneously. The empirical results on robotic locomotion tasks show that the proposed scheme can lower sample complexity by 2-10 times in contrast with the state-of-the-art baseline deep RL (DRL) algorithms, while achieving performance better than that of the expert. Furthermore, the proposed algorithm achieves significant improvement in both sampling efficiency and asymptotic performance on tasks with sparse and delayed reward, wherein those baseline DRL algorithms struggle to make progress. This takes a substantial step forward to implement these methods to acquire skills autonomously for real robots.

引用

页码：1056 / 1069

页数：14

共 50 条

[41] Task-oriented Resource Allocation for Mobile Edge Computing with Multi-Agent Reinforcement Learning
Zou, Yue
Shen, Fei
Yan, Feng
Tang, Liang
2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
[42] Learning by doing - an approach to robotic skill acquisition
Nguyen, MC
Graefe, V
SICE 2001: PROCEEDINGS OF THE 40TH SICE ANNUAL CONFERENCE, INTERNATIONAL SESSION PAPERS, 2001, : 226 - 229
[43] Assembly skill acquisition via reinforcement learning
Lau, HYK
Lee, ISK
ASSEMBLY AUTOMATION, 2001, 21 (02) : 136 - 142
[44] A task-oriented access control model for WfMS
Liao, X
Zhang, L
Chan, SCF
INFORMATION SECURITY PRACTICE AND EXPERIENCE, 2005, 3439 : 168 - 177
[45] Personality-aware Natural Language Generation for Task-oriented Dialogue using Reinforcement Learning
Guo, Ao
Ohashi, Atsumoto
Chiba, Yuya
Tsunomori, Yuiko
Hirai, Ryu
Higashinaka, Ryuichiro
2023 32ND IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION, RO-MAN, 2023, : 1823 - 1828
[46] Adaptive Skill Acquisition in Hierarchical Reinforcement Learning
Holas, Juraj
Farkas, Igor
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 383 - 394
[47] Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning
Zhao, Yangyang
Yin, Kai
Wang, Zhenyu
Dastani, Mehdi
Wang, Shihan
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1380 - 1391
[48] Task-oriented human-robot interaction control of a robotic glove utilizing forearm electromyography
Wang, Xianhe
Zhang, Haotian
Teng, Long
Tang, Chak Yin
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (16): : 11351 - 11370
[49] RFAC Based Task-Oriented Active Sharing Control for a Class of Robotic Rehabilitation Training Systems
Meng, Fancheng
Yang, Shuo
Li, Yafeng
Wang, Jinlei
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 5471 - 5475
[50] On modeling and utilizing chemical compound information with deep learning technologies: A task-oriented approach
Lim, Sangsoo
Lee, Sangseon
Piao, Yinhua
Choi, MinGyu
Bang, Dongmin
Gu, Jeonghyeon
Kim, Sun
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 4288 - 4304

← 1 2 3 4 5 →