ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation

被引：1

作者：

Pendyala, Abhijeet ^{[1
]}

Dettmer, Justin ^{[1
]}

Glasmachers, Tobias ^{[1
]}

Atamna, Asma ^{[1
]}

机构：

[1] Ruhr Univ Bochum, Bochum, Germany

来源：

MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT I | 2024年 / 14505卷

关键词：

Deep reinforcement learning; Real-world benchmark; Resource allocation;

D O I：

10.1007/978-3-031-53969-5_7

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present ContainerGym, a benchmark for reinforcement learning inspired by a real-world industrial resource allocation task. The proposed benchmark encodes a range of challenges commonly encountered in real-world sequential decision making problems, such as uncertainty. It can be configured to instantiate problems of varying degrees of difficulty, e.g., in terms of variable dimensionality. Our benchmark differs from other reinforcement learning benchmarks, including the ones aiming to encode real-world difficulties, in that it is directly derived from a real-world industrial problem, which underwent minimal simplification and streamlining. It is sufficiently versatile to evaluate reinforcement learning algorithms on any real-world problem that fits our resource allocation framework. We provide results of standard baseline methods. Going beyond the usual training reward curves, our results and the statistical tools used to interpret them allow to highlight interesting limitations of well-known deep reinforcement learning algorithms, namely PPO, TRPO and DQN.

引用

页码：78 / 92

页数：15

共 50 条

[31] Towards learning-based planning: The nuPlan benchmark for real-world autonomous driving
Karnchanachari, Napat
Geromichalos, Dimitris
Tan, Kok Seang
Li, Nanxiang
Eriksen, Christopher
Yaghoubi, Shakiba
Mehdipour, Noushin
Bernasconi, Gianmarco
Fong, Whye Kit
Guo, Yiluan
Caesar, Holger
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 629 - 636
[32] Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
Qureshi, Ahmed Hussain
Nakamura, Yutaka
Yoshikawa, Yuichiro
Ishiguro, Hiroshi
NEURAL NETWORKS, 2018, 107 : 23 - 33
[33] Non-blocking Asynchronous Training for Reinforcement Learning in Real-World Environments
Bohm, Peter
Pounds, Pauline
Chapman, Archie C.
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10927 - 10934
[34] Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition
Rajapakshe, Thejan
Rana, Rajib
Khalifa, Sara
Schuller, Bjoern W.
IEEE ACCESS, 2024, 12 : 193101 - 193114
[35] STASIS: Reinforcement Learning Simulators for Human-Centric Real-World Environments
Efstathiadis, Georgios
Emedom-Nnamdi, Patrick
Kolbeinsson, Arinbjorn
Onnela, Jukka-Pekka
Lu, Junwei
TRUSTWORTHY MACHINE LEARNING FOR HEALTHCARE, TML4H 2023, 2023, 13932 : 85 - 92
[36] Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World
Hong, Myung Rae
Kang, Sanghun
Lee, Jingoo
Seo, Sungchul
Han, Seungyong
Koh, Je-Sung
Kang, Daeshik
IEEE ACCESS, 2023, 11 : 95195 - 95200
[37] Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households
Gokhale, Gargya
Tiben, Niels
Verwee, Marie-Sophie
Lahariya, Manu
Claessens, Bert
Develder, Chris
PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 347 - 351
[38] Controlling Aluminum Strip Thickness by Clustered Reinforcement Learning With Real-World Dataset
Xiao, Ziqi
He, Zhili
Liang, Huanghuang
Hu, Chuang
Cheng, Dazhao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 9928 - 9938
[39] Exploring Applications of Deep Reinforcement Learning for Real-world Autonomous Driving Systems
Talpaert, Victor
Sobh, Ibrahim
Kiran, B. Ravi
Mannion, Patrick
Yogamani, Senthil
El-Sallab, Ahmad
Perez, Patrick
PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 564 - 572
[40] Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Wolczyk, Maciej
Zajac, Michal
Pascanu, Razvan
Kucinski, Lukasz
Milos, Piotr
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34

← 1 2 3 4 5 →