NetHack is Hard to Hack

被引：0

作者：

Piterbarg, Ulyana ^{[1
]}

Pinto, Lerrel ^{[1
]}

Fergus, Rob ^{[1
]}

机构：

[1] NYU, New York, NY 10012 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

美国国家科学基金会;

关键词：

MANIPULATION;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural policy learning methods have achieved remarkable results in various control problems, ranging from Atari games to simulated locomotion. However, these methods struggle in long-horizon tasks, especially in open-ended environments with multi-modal observations, such as the popular dungeon-crawler game, NetHack. Intriguingly, the NeurIPS 2021 NetHack Challenge revealed that symbolic agents outperformed neural approaches by over four times in median game score. In this paper, we delve into the reasons behind this performance gap and present an extensive study on neural policy learning for NetHack. To conduct this study, we analyze the winning symbolic agent, extending its codebase to track internal strategy selection in order to generate one of the largest available demonstration datasets. Utilizing this dataset, we examine (i) the advantages of an action hierarchy; (ii) enhancements in neural architecture; and (iii) the integration of reinforcement learning with imitation learning. Our investigations produce a state-of-the-art neural agent that surpasses previous fully neural policies by 127% in offline settings and 25% in online settings on median game score. However, we also demonstrate that mere scaling is insufficient to bridge the performance gap with the best symbolic models or even the top human players.

引用

页数：27

共 50 条

[1] Danger — hard hack area
Paul McAuley
Nature, 2000, 404 : 21 - 21
[2] The NetHack Learning Environment
Kuttler, Heinrich
Nardelli, Nantas
Miller, Alexander H.
Raileanu, Roberta
Selvatici, Marco
Grefenstette, Edward
Rocktaschel, Tim
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[3] Exploration in NetHack With Secret Discovery
Campbell, Jonathan
Verbrugge, Clark
IEEE TRANSACTIONS ON GAMES, 2019, 11 (04) : 363 - 373
[4] Hack hack job
Stodder, SMM
NEW REPUBLIC, 2005, 233 (21) : 4 - 4
[5] Hack v Hack and Munden
不详
LANCET, 1919, 2 : 113 - 114
[6] Exploration in NetHack Using Occupancy Maps
Campbell, Jonathan
Verbrugge, Clark
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON THE FOUNDATIONS OF DIGITAL GAMES (FDG'17), 2017,
[7] HACK
Boisseau, Michelle
YALE REVIEW, 2012, 100 (04): : 72 - 72
[8] Hack this
Webb, W
EDN, 2004, 49 (15) : 26 - +
[9] Hack
Hagerty, Bill
BRITISH JOURNALISM REVIEW, 2014, 25 (01) : 71 - 73
[10] Insights from the NeurIPS 2021 NetHack Challenge
Hambro, Eric
Mohanty, Sharada
Babaev, Dmitrii
Byeon, Minwoo
Chakraborty, Dipam
Grefenstette, Edward
Jiang, Minqi
Jo, Daejin
Kanervisto, Anssi
Kim, Jongmin
Kim, Sungwoong
Kirk, Robert
Kurin, Vitaly
Kuttler, Heinrich
Kwon, Taehwon
Lee, Donghoon
Mella, Vegard
Nardelli, Nantas
Nazarov, Ivan
Ovsov, Nikita
Parker-Holder, Jack
Raileanu, Roberta
Ramanauskas, Karolis
Rocktaschel, Tim
Rothermel, Danielle
Samvelyan, Mikayel
Sorokin, Dmitry
Sypetkowski, Maciej
Sypetkowski, Michal
NEURIPS 2021 COMPETITIONS AND DEMONSTRATIONS TRACK, VOL 176, 2021, 176 : 41 - 52

← 1 2 3 4 5 →