The Arcade Learning Environment: An Evaluation Platform for General Agents

被引:1220
作者
Bellemare, Marc G. [1 ]
Naddaf, Yavar [2 ]
Veness, Joel [1 ]
Bowling, Michael [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
[2] Empir Results Inc, Vancouver, BC, Canada
关键词
Reinforcement learning;
D O I
10.1613/jair.3912
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article we introduce the Arcade Learning Environment (ALE): both a challenge problem and a platform and methodology for evaluating the development of general, domain-independent AI technology. ALE provides an interface to hundreds of Atari 2600 game environments, each one different, interesting, and designed to be a challenge for human players. ALE presents significant research challenges for reinforcement learning, model learning, model-based planning, imitation learning, transfer learning, and intrinsic motivation. Most importantly, it provides a rigorous testbed for evaluating and comparing approaches to these problems. We illustrate the promise of ALE by developing and benchmarking domain-independent agents designed using well-established AI techniques for both reinforcement learning and planning. In doing so, we also propose an evaluation methodology made possible by ALE, reporting empirical results on over 55 different games. All of the software, including the benchmark agents, is publicly available.
引用
收藏
页码:253 / 279
页数:27
相关论文
共 32 条
[21]   Map learning with uninterpreted sensors and effectors [J].
Pierce, D ;
Kuipers, BJ .
ARTIFICIAL INTELLIGENCE, 1997, 92 (1-2) :169-227
[22]   Rationality and intelligence [J].
Russell, SJ .
ARTIFICIAL INTELLIGENCE, 1997, 94 (1-2) :57-77
[23]  
Schaul T., 2011, ABS11091314 CORR
[24]   GENERALIZED POLYNOMIAL APPROXIMATIONS IN MARKOVIAN DECISION-PROCESSES [J].
SCHWEITZER, PJ ;
SEIDMANN, A .
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1985, 110 (02) :568-582
[25]  
Stober Jeremy, 2008, P 7 IEEE INT C DEV L
[26]  
Sutton R. S., 2011, P 10 INT C AUT AG MU
[27]  
Sutton R.S., 2017, Introduction to reinforcement learning
[28]   LIFELONG ROBOT LEARNING [J].
THRUN, S ;
MITCHELL, TM .
ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 15 (1-2) :25-46
[29]  
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698
[30]   The Reinforcement Learning Competitions [J].
Whiteson, Shimon ;
Tanner, Brian ;
White, Adam .
AI MAGAZINE, 2010, 31 (02) :81-94