Wide and Deep Reinforcement Learning for Grid-based Action Games

被引：3

作者：

Montoya, Juan M. ^{[1
]}

Borgelt, Christian ^{[1
]}

机构：

[1] Univ Konstanz, Chair Bioinformat & Informat Min, Constance, Germany

来源：

PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2 | 2019年

关键词：

Wide and Deep Reinforcement Learning; Wide Deep Q-Networks; Value Function Approximation; Reinforcement Learning Agents;

D O I：

10.5220/0007313200500059

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For the last decade Deep Reinforcement Learning has undergone exponential development; however, less has been done to integrate linear methods into it. Our Wide and Deep Reinforcement Learning framework provides a tool that combines linear and non-linear methods into one. For practical implementations, our framework can help integrate expert knowledge while improving the performance of existing Deep Reinforcement Learning algorithms. Our research aims to generate a simple practical framework to extend such algorithms. To test this framework we develop an extension of the popular Deep Q-Networks algorithm, which we name Wide Deep Q-Networks. We analyze its performance compared to Deep Q-Networks and Linear Agents, as well as human players. We apply our new algorithm to Berkley's Pac-Man environment. Our algorithm considerably outperforms Deep Q-Networks' both in terms of learning speed and ultimate performance showing its potential for boosting existing algorithms.

引用

页码：50 / 59

页数：10

共 50 条

[31] Deep Reinforcement Learning for Conversational Robots Playing Games
Cuayahuitl, Heriberto
2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 771 - 776
[32] Deep Reinforcement Learning for Navigation in AAA Video Games
Alonso, Eloi
Peter, Maxim
Goumard, David
Romoff, Joshua
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2133 - 2139
[33] Deep Reinforcement Learning with Transformers for Text Adventure Games
Xu, Yunqiu
Chen, Ling
Fang, Meng
Wang, Yang
Zhang, Chengqi
2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 65 - 72
[34] Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition
Tang, Yansong
Tian, Yi
Lu, Jiwen
Li, Peiyang
Zhou, Jie
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5323 - 5332
[35] Extracting Action Sequences from Texts Based on Deep Reinforcement Learning
Feng, Wenfeng
Zhuo, Hankz Hankui
Kambhampati, Subbarao
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4064 - 4070
[36] Deep Reinforcement Learning Based Coalition Formation for Energy Trading in Smart Grid
Sadeghi, Mohammad
Erol-Kantarci, Melike
2021 IEEE 4TH 5G WORLD FORUM (5GWF 2021), 2021, : 200 - 205
[37] Energy Trading in Smart Grid: A Deep Reinforcement Learning-based Approach
Zhang, Feiye
Yang, Qingyu
PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3677 - 3682
[38] Deep Reinforcement Learning-Based Smart Grid Resource Allocation System
Lang, Qiong
Zhu, La Ba Dun
Ren, Mi Ma Ci
Zhang, Rui
Wu, Yinghen
He, Wenting
Li, Mingjia
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 703 - 707
[39] Deep Reinforcement Learning Based MPPT Control for Grid Connected PV System
Vora, Kunal
Liu, Shichao
Dhulipati, Himavarsha
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS 2024, 2024,
[40] Research on grid-based personalized collaborative learning system
Zhao Chengling
Yan, Cao
Tan Xiaodong
Qi, Luo
Ying, Yu
Proceedings of 2006 International Conference on Artificial Intelligence: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 638 - 642

← 1 2 3 4 5 →