PROBING TRANSFER IN DEEP REINFORCEMENT LEARNING WITHOUT TASK ENGINEERING

被引：0

作者：

Rusu, Andrei A. ^{[1
]}

Flennerhag, Sebastian ^{[1
]}

Rao, Dushyant ^{[1
]}

Pascanu, Razvan ^{[1
]}

Hadsell, Raia ^{[1
]}

机构：

[1] DeepMind, London, England

来源：

CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199 | 2022年 / 199卷

关键词：

GO; ENVIRONMENT; LEVEL; GAME;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We evaluate the use of original game curricula supported by the Atari 2600 console as a heterogeneous transfer benchmark for deep reinforcement learning agents. Game designers created curricula using combinations of several discrete modifications to the basic versions of games such as Space Invaders, Breakout and Freeway, making them progressively more challenging for human players. By formally organising these modifications into several factors of variation, we are able to show that Analyses of Variance (ANOVA) are a potent tool for studying the effects of human-relevant domain changes on the learning and transfer performance of a deep reinforcement learning agent. Since no manual task engineering is needed on our part, leveraging the original multi-factorial design avoids the pitfalls of unintentionally biasing the experimental setup. We find that game design factors have a large and statistically significant impact on an agent's ability to learn, and so do their combinatorial interactions. Furthermore, we show that zero-shot transfer from the basic games to their respective variations is possible, but the variance in performance is also largely explained by interactions between factors. As such, we argue that Atari game curricula offer a challenging benchmark for transfer learning in RL, that can help the community better understand the generalisation capabilities of RL agents along dimensions which meaningfully impact human generalisation performance. As a start, we report that value-function finetuning of regularly trained agents achieves positive transfer in a majority of cases, but significant headroom for algorithmic innovation remains. We conclude with the observation that selective transfer from multiple variants could further improve performance.

引用

页数：24

共 50 条

[1] Addressing Reward Engineering for Deep Reinforcement Learning on Multi-stage Task
Chen, Bin
Su, Jianhua
NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 309 - 317
[2] Transfer Learning in Deep Reinforcement Learning
Islam, Tariqul
Abid, Dm. Mehedi Hasan
Rahman, Tanvir
Zaman, Zahura
Mia, Kausar
Hossain, Ramim
PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 145 - 153
[3] Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
Xu, Zhiyuan
Wu, Kun
Che, Zhengping
Tang, Jian
Ye, Jieping
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[4] Symbolic Task Inference in Deep Reinforcement Learning
Hasanbeig, Hosein
Jeppu, Natasha Yogananda
Abate, Alessandro
Melham, Tom
Kroening, Daniel
Journal of Artificial Intelligence Research, 2024, 80 : 1099 - 1137
[5] Transfer Learning in Deep Reinforcement Learning: A Survey
Zhu, Zhuangdi
Lin, Kaixiang
Jain, Anil K.
Zhou, Jiayu
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13344 - 13362
[6] Symbolic Task Inference in Deep Reinforcement Learning
Hasanbeig, Hosein
Jeppu, Natasha Yogananda
Abate, Alessandro
Melham, Tom
Kroening, Daniel
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 1099 - 1137
[7] Deep Reinforcement Learning Task for Portfolio Construction
Belyakov, Boris
Sizykh, Dmitry
21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 1077 - 1082
[8] Task Allocation of Multiple Unmanned Aerial Vehicles Based on Deep Transfer Reinforcement Learning
Yin, Yongfeng
Guo, Yang
Su, Qingran
Wang, Zhetao
DRONES, 2022, 6 (08)
[9] Task similarity measures for transfer in reinforcement learning task libraries
Carroll, JL
Seppi, K
Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 803 - 808
[10] A Multi-Task-Learning-Based Transfer Deep Reinforcement Learning Design for Autonomic Optical Networks
Chen, Xiaoliang
Proietti, Roberto
Liu, Che-Yu
Yoo, S. J. Ben
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2021, 39 (09) : 2878 - 2889

← 1 2 3 4 5 →