NetHack is Hard to Hack

被引:0
|
作者
Piterbarg, Ulyana [1 ]
Pinto, Lerrel [1 ]
Fergus, Rob [1 ]
机构
[1] NYU, New York, NY 10012 USA
基金
美国国家科学基金会;
关键词
MANIPULATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion. However, these methods struggle in long-horizon tasks, especially in open-ended environments with multi-modal observations, such as the popular dungeon-crawler game, NetHack. Intriguingly, the NeurIPS 2021 NetHack Challenge revealed that symbolic agents outperformed neural approaches by over four times in median game score. In this paper, we delve into the reasons behind this performance gap and present an extensive study on neural policy learning for NetHack. To conduct this study, we analyze the winning symbolic agent, extending its codebase to track internal strategy selection in order to generate one of the largest available demonstration datasets. Utilizing this dataset, we examine (i) the advantages of an action hierarchy; (ii) enhancements in neural architecture; and (iii) the integration of reinforcement learning with imitation learning. Our investigations produce a state-of-the-art neural agent that surpasses previous fully neural policies by 127% in offline settings and 25% in online settings on median game score. However, we also demonstrate that mere scaling is insufficient to bridge the performance gap with the best symbolic models or even the top human players.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Danger — hard hack area
    Paul McAuley
    Nature, 2000, 404 : 21 - 21
  • [2] The NetHack Learning Environment
    Kuttler, Heinrich
    Nardelli, Nantas
    Miller, Alexander H.
    Raileanu, Roberta
    Selvatici, Marco
    Grefenstette, Edward
    Rocktaschel, Tim
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Exploration in NetHack With Secret Discovery
    Campbell, Jonathan
    Verbrugge, Clark
    IEEE TRANSACTIONS ON GAMES, 2019, 11 (04) : 363 - 373
  • [4] Hack hack job
    Stodder, SMM
    NEW REPUBLIC, 2005, 233 (21) : 4 - 4
  • [5] Hack v Hack and Munden
    不详
    LANCET, 1919, 2 : 113 - 114
  • [6] Exploration in NetHack Using Occupancy Maps
    Campbell, Jonathan
    Verbrugge, Clark
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES (FDG'17), 2017,
  • [7] HACK
    Boisseau, Michelle
    YALE REVIEW, 2012, 100 (04): : 72 - 72
  • [8] Hack this
    Webb, W
    EDN, 2004, 49 (15) : 26 - +
  • [9] Hack
    Hagerty, Bill
    BRITISH JOURNALISM REVIEW, 2014, 25 (01) : 71 - 73
  • [10] Insights from the NeurIPS 2021 NetHack Challenge
    Hambro, Eric
    Mohanty, Sharada
    Babaev, Dmitrii
    Byeon, Minwoo
    Chakraborty, Dipam
    Grefenstette, Edward
    Jiang, Minqi
    Jo, Daejin
    Kanervisto, Anssi
    Kim, Jongmin
    Kim, Sungwoong
    Kirk, Robert
    Kurin, Vitaly
    Kuttler, Heinrich
    Kwon, Taehwon
    Lee, Donghoon
    Mella, Vegard
    Nardelli, Nantas
    Nazarov, Ivan
    Ovsov, Nikita
    Parker-Holder, Jack
    Raileanu, Roberta
    Ramanauskas, Karolis
    Rocktaschel, Tim
    Rothermel, Danielle
    Samvelyan, Mikayel
    Sorokin, Dmitry
    Sypetkowski, Maciej
    Sypetkowski, Michal
    NEURIPS 2021 COMPETITIONS AND DEMONSTRATIONS TRACK, VOL 176, 2021, 176 : 41 - 52