NetHack is Hard to Hack

被引:0
|
作者
Piterbarg, Ulyana [1 ]
Pinto, Lerrel [1 ]
Fergus, Rob [1 ]
机构
[1] NYU, New York, NY 10012 USA
基金
美国国家科学基金会;
关键词
MANIPULATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion. However, these methods struggle in long-horizon tasks, especially in open-ended environments with multi-modal observations, such as the popular dungeon-crawler game, NetHack. Intriguingly, the NeurIPS 2021 NetHack Challenge revealed that symbolic agents outperformed neural approaches by over four times in median game score. In this paper, we delve into the reasons behind this performance gap and present an extensive study on neural policy learning for NetHack. To conduct this study, we analyze the winning symbolic agent, extending its codebase to track internal strategy selection in order to generate one of the largest available demonstration datasets. Utilizing this dataset, we examine (i) the advantages of an action hierarchy; (ii) enhancements in neural architecture; and (iii) the integration of reinforcement learning with imitation learning. Our investigations produce a state-of-the-art neural agent that surpasses previous fully neural policies by 127% in offline settings and 25% in online settings on median game score. However, we also demonstrate that mere scaling is insufficient to bridge the performance gap with the best symbolic models or even the top human players.
引用
收藏
页数:27
相关论文
共 50 条
  • [31] HACK 'CHARLOTTE'
    WILSON, E
    NEW YORK THEATRE CRITICS REVIEWS, 1980, 41 (05): : 323 - 323
  • [32] THE HACK HOME
    Shah, Agam
    MECHANICAL ENGINEERING, 2018, 140 (09) : 14 - 15
  • [33] Life Hack
    Kachur, Lewis
    BURLINGTON MAGAZINE, 2020, 162 (1402): : 69 - 71
  • [34] Hack for hire
    Mirian, Ariana
    Queue, 2019, 17 (04):
  • [35] Dungeons and Data: A Large-Scale NetHack Dataset
    Hambro, Eric
    Raileanu, Roberta
    Rothermel, Danielle
    Mella, Vegard
    Rocktaschel, Tim
    Kuttler, Heinrich
    Murray, Naila
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [36] HACK THE SUBURBS!
    不详
    LANDSCAPE ARCHITECTURE MAGAZINE, 2017, 107 (01) : 38 - 38
  • [37] Baghdad hack
    Borrell, Brendan
    SCIENTIST, 2008, 22 (12): : 17 - 18
  • [38] Hack attack
    Krause, J
    ABA JOURNAL, 2002, 88 : 50 - +
  • [39] Hack attacks
    Pope, C
    PROFESSIONAL ENGINEERING, 2002, 15 (21) : 24 - 25
  • [40] Traffic hack
    Chattaway, Alan
    NEW SCIENTIST, 2010, 206 (2761) : 29 - 29