Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control

被引：34

作者：

Xiang, Guofei ^{[1
]}

Su, Jianbo ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Automat, Key Lab Syst Control & Informat Proc, Minist Educ, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2021年 / 51卷 / 02期

关键词：

Continuous control; deep neural networks (DNNs); exploration; imitation learning (IL); reinforcement learning (RL); robotics; skill acquisition; SEARCH;

D O I：

10.1109/TCYB.2019.2949596

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and control tasks. However, these methods and their extensions require extensive environmental interactions during training, which greatly prevents them from being applied to real-world robots. To alleviate this problem, we present an efficient model-free off-policy actor-critic algorithm for robotic skill acquisition and continuous control, by fusing the task reward with a task-oriented guiding reward, which is formulated by leveraging few and imperfect expert demonstrations. In this framework, the agent can explore the environment more intentionally, thus sampling efficiency can be achieved; moreover, the agent can also exploit the experience more effectively, thereby substantially improved performance can be realized simultaneously. The empirical results on robotic locomotion tasks show that the proposed scheme can lower sample complexity by 2-10 times in contrast with the state-of-the-art baseline deep RL (DRL) algorithms, while achieving performance better than that of the expert. Furthermore, the proposed algorithm achieves significant improvement in both sampling efficiency and asymptotic performance on tasks with sparse and delayed reward, wherein those baseline DRL algorithms struggle to make progress. This takes a substantial step forward to implement these methods to acquire skills autonomously for real robots.

引用

页码：1056 / 1069

页数：14

共 50 条

[31] Multi-Receiver Task-Oriented Communications via Multi-Task Deep Learning
Sagduyu, Yalin E.
Erpek, Tugba
Yener, Aylin
Ulukus, Sennur
2023 IEEE FUTURE NETWORKS WORLD FORUM, FNWF, 2024,
[32] Task-Oriented Adaptive Position/Force Control for Robotic Systems Under Hybrid Constraints
Ding, Shuai
Peng, Jinzhu
Xin, Jianbin
Zhang, Hui
Wang, Yaonan
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (10) : 12612 - 12622
[33] Transfer Learning based Task-oriented Dialogue Policy for Multiple Domains using Hierarchical Reinforcement Learning
Saha, Tulika
Saha, Sriparna
Bhattacharyya, Pushpak
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[34] Task-oriented developmental learning for humanoid robots
Tan, KC
Chen, YJ
Tan, KK
Lee, TH
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2005, 52 (03) : 906 - 914
[35] Continual Learning in Task-Oriented Dialogue Systems
Madotto, Andrea
Lin, Zhaojiang
Zhou, Zhenpeng
Moon, Seungwhan
Crook, Paul
Liu, Bing
Yu, Zhou
Cho, Eunjoon
Fung, Pascale
Wang, Zhiguang
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7452 - 7467
[36] Painless and accurate medical image analysis using deep reinforcement learning with task-oriented homogenized automatic pre-processing
Yuan, Di
Liu, Yunxin
Xu, Zhenghua
Zhan, Yuefu
Chen, Junyang
Lukasiewicz, Thomas
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
[37] Learning Folksonomies from Task-Oriented Dialogues
Puppi Wanderley, Gregory Moro
Paraiso, Emerson Cabrera
30TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, VOLS I AND II, 2015, : 360 - 367
[38] A novel task-oriented framework for dual-arm robotic assembly task
Zhengwei WANG
Yahui GAN
Xianzhong DAI
Frontiers of Mechanical Engineering, 2021, (03) : 528 - 545
[39] A novel task-oriented framework for dual-arm robotic assembly task
Zhengwei Wang
Yahui Gan
Xianzhong Dai
Frontiers of Mechanical Engineering, 2021, 16 : 528 - 545
[40] A novel task-oriented framework for dual-arm robotic assembly task
Wang, Zhengwei
Gan, Yahui
Dai, Xianzhong
FRONTIERS OF MECHANICAL ENGINEERING, 2021, 16 (03) : 528 - 545

← 1 2 3 4 5 →