ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation

被引:1
|
作者
Pendyala, Abhijeet [1 ]
Dettmer, Justin [1 ]
Glasmachers, Tobias [1 ]
Atamna, Asma [1 ]
机构
[1] Ruhr Univ Bochum, Bochum, Germany
关键词
Deep reinforcement learning; Real-world benchmark; Resource allocation;
D O I
10.1007/978-3-031-53969-5_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present ContainerGym, a benchmark for reinforcement learning inspired by a real-world industrial resource allocation task. The proposed benchmark encodes a range of challenges commonly encountered in real-world sequential decision making problems, such as uncertainty. It can be configured to instantiate problems of varying degrees of difficulty, e.g., in terms of variable dimensionality. Our benchmark differs from other reinforcement learning benchmarks, including the ones aiming to encode real-world difficulties, in that it is directly derived from a real-world industrial problem, which underwent minimal simplification and streamlining. It is sufficiently versatile to evaluate reinforcement learning algorithms on any real-world problem that fits our resource allocation framework. We provide results of standard baseline methods. Going beyond the usual training reward curves, our results and the statistical tools used to interpret them allow to highlight interesting limitations of well-known deep reinforcement learning algorithms, namely PPO, TRPO and DQN.
引用
收藏
页码:78 / 92
页数:15
相关论文
共 50 条
  • [31] Towards learning-based planning: The nuPlan benchmark for real-world autonomous driving
    Karnchanachari, Napat
    Geromichalos, Dimitris
    Tan, Kok Seang
    Li, Nanxiang
    Eriksen, Christopher
    Yaghoubi, Shakiba
    Mehdipour, Noushin
    Bernasconi, Gianmarco
    Fong, Whye Kit
    Guo, Yiluan
    Caesar, Holger
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 629 - 636
  • [32] Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
    Qureshi, Ahmed Hussain
    Nakamura, Yutaka
    Yoshikawa, Yuichiro
    Ishiguro, Hiroshi
    NEURAL NETWORKS, 2018, 107 : 23 - 33
  • [33] Non-blocking Asynchronous Training for Reinforcement Learning in Real-World Environments
    Bohm, Peter
    Pounds, Pauline
    Chapman, Archie C.
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10927 - 10934
  • [34] Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition
    Rajapakshe, Thejan
    Rana, Rajib
    Khalifa, Sara
    Schuller, Bjoern W.
    IEEE ACCESS, 2024, 12 : 193101 - 193114
  • [35] STASIS: Reinforcement Learning Simulators for Human-Centric Real-World Environments
    Efstathiadis, Georgios
    Emedom-Nnamdi, Patrick
    Kolbeinsson, Arinbjorn
    Onnela, Jukka-Pekka
    Lu, Junwei
    TRUSTWORTHY MACHINE LEARNING FOR HEALTHCARE, TML4H 2023, 2023, 13932 : 85 - 92
  • [36] Optimizing Reinforcement Learning Control Model in Furuta Pendulum and Transferring it to Real-World
    Hong, Myung Rae
    Kang, Sanghun
    Lee, Jingoo
    Seo, Sungchul
    Han, Seungyong
    Koh, Je-Sung
    Kang, Daeshik
    IEEE ACCESS, 2023, 11 : 95195 - 95200
  • [37] Real-World Implementation of Reinforcement Learning Based Energy Coordination for a Cluster of Households
    Gokhale, Gargya
    Tiben, Niels
    Verwee, Marie-Sophie
    Lahariya, Manu
    Claessens, Bert
    Develder, Chris
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 347 - 351
  • [38] Controlling Aluminum Strip Thickness by Clustered Reinforcement Learning With Real-World Dataset
    Xiao, Ziqi
    He, Zhili
    Liang, Huanghuang
    Hu, Chuang
    Cheng, Dazhao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (08) : 9928 - 9938
  • [39] Exploring Applications of Deep Reinforcement Learning for Real-world Autonomous Driving Systems
    Talpaert, Victor
    Sobh, Ibrahim
    Kiran, B. Ravi
    Mannion, Patrick
    Yogamani, Senthil
    El-Sallab, Ahmad
    Perez, Patrick
    PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 5, 2019, : 564 - 572
  • [40] Continual World: A Robotic Benchmark For Continual Reinforcement Learning
    Wolczyk, Maciej
    Zajac, Michal
    Pascanu, Razvan
    Kucinski, Lukasz
    Milos, Piotr
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34