Wide and Deep Reinforcement Learning for Grid-based Action Games

被引:3
|
作者
Montoya, Juan M. [1 ]
Borgelt, Christian [1 ]
机构
[1] Univ Konstanz, Chair Bioinformat & Informat Min, Constance, Germany
来源
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2 | 2019年
关键词
Wide and Deep Reinforcement Learning; Wide Deep Q-Networks; Value Function Approximation; Reinforcement Learning Agents;
D O I
10.5220/0007313200500059
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For the last decade Deep Reinforcement Learning has undergone exponential development; however, less has been done to integrate linear methods into it. Our Wide and Deep Reinforcement Learning framework provides a tool that combines linear and non-linear methods into one. For practical implementations, our framework can help integrate expert knowledge while improving the performance of existing Deep Reinforcement Learning algorithms. Our research aims to generate a simple practical framework to extend such algorithms. To test this framework we develop an extension of the popular Deep Q-Networks algorithm, which we name Wide Deep Q-Networks. We analyze its performance compared to Deep Q-Networks and Linear Agents, as well as human players. We apply our new algorithm to Berkley's Pac-Man environment. Our algorithm considerably outperforms Deep Q-Networks' both in terms of learning speed and ultimate performance showing its potential for boosting existing algorithms.
引用
收藏
页码:50 / 59
页数:10
相关论文
共 50 条
  • [31] Deep Reinforcement Learning for Conversational Robots Playing Games
    Cuayahuitl, Heriberto
    2017 IEEE-RAS 17TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTICS (HUMANOIDS), 2017, : 771 - 776
  • [32] Deep Reinforcement Learning for Navigation in AAA Video Games
    Alonso, Eloi
    Peter, Maxim
    Goumard, David
    Romoff, Joshua
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2133 - 2139
  • [33] Deep Reinforcement Learning with Transformers for Text Adventure Games
    Xu, Yunqiu
    Chen, Ling
    Fang, Meng
    Wang, Yang
    Zhang, Chengqi
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 65 - 72
  • [34] Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Li, Peiyang
    Zhou, Jie
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5323 - 5332
  • [35] Extracting Action Sequences from Texts Based on Deep Reinforcement Learning
    Feng, Wenfeng
    Zhuo, Hankz Hankui
    Kambhampati, Subbarao
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 4064 - 4070
  • [36] Deep Reinforcement Learning Based Coalition Formation for Energy Trading in Smart Grid
    Sadeghi, Mohammad
    Erol-Kantarci, Melike
    2021 IEEE 4TH 5G WORLD FORUM (5GWF 2021), 2021, : 200 - 205
  • [37] Energy Trading in Smart Grid: A Deep Reinforcement Learning-based Approach
    Zhang, Feiye
    Yang, Qingyu
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 3677 - 3682
  • [38] Deep Reinforcement Learning-Based Smart Grid Resource Allocation System
    Lang, Qiong
    Zhu, La Ba Dun
    Ren, Mi Ma Ci
    Zhang, Rui
    Wu, Yinghen
    He, Wenting
    Li, Mingjia
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 703 - 707
  • [39] Deep Reinforcement Learning Based MPPT Control for Grid Connected PV System
    Vora, Kunal
    Liu, Shichao
    Dhulipati, Himavarsha
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON INDUSTRIAL CYBER-PHYSICAL SYSTEMS, ICPS 2024, 2024,
  • [40] Research on grid-based personalized collaborative learning system
    Zhao Chengling
    Yan, Cao
    Tan Xiaodong
    Qi, Luo
    Ying, Yu
    Proceedings of 2006 International Conference on Artificial Intelligence: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 638 - 642