Improvement of systems management policies using hybrid reinforcement learning

被引:0
|
作者
Tesauro, Gerald
Jong, Nicholas K.
Das, Rajarshi
Bennani, Mohamed N.
机构
[1] IBM TJ Watson Res Ctr, Hawthorne, NY 10532 USA
[2] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[3] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement Learning (RL) holds particular promise in an emerging application domain of performance management of computing systems. In recent work, online RL yielded effective server allocation policies in a prototype Data Center, without explicit system models or built-in domain knowledge. This paper presents a substantially improved and more practical "hybrid" approach, in which RL trains offline on data collected while a queuing-theoretic policy controls the system. This approach avoids potentially poor performance in live online training. Additionally we use nonlinear function approximators instead of tabular value functions; this greatly improves scalability, and surprisingly, eliminated the need for exploratory actions. In experiments using both open-loop and closed-loop traffic as well as large switching delays, our results show significant performance improvement over state-of-art queuing model policies.
引用
收藏
页码:783 / 791
页数:9
相关论文
共 50 条
  • [1] Power management optimisation for hybrid electric systems using reinforcement learning and adaptive dynamic programming
    Sanusi, Ibrahim
    Mills, Andrew
    Konstantopoulos, George
    Dodd, Tony
    2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 2608 - 2613
  • [2] An intelligent energy management framework for hybrid-electric propulsion systems using deep reinforcement learning
    Wu, Peng
    Partridge, Julius
    Anderlini, Enrico
    Liu, Yuanchang
    Bucknall, Richard
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2025, 106 : 282 - 294
  • [3] Energy and Emission Management of Hybrid Electric Vehicles using Reinforcement Learning
    Hofstetter, Johannes
    Bauer, Hans
    Li, Wenbin
    Wachtmeister, Georg
    IFAC PAPERSONLINE, 2019, 52 (29): : 19 - 24
  • [4] Energy Management System for a Hybrid Electric Vehicle Using Reinforcement Learning
    Chen, Syuan-Yi
    Lo, Hsiang-Yu
    Tsao, Tung-Yao
    Lai, I-Wei
    IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,
  • [5] Energy Management System for a Hybrid Electric Vehicle Using Reinforcement Learning
    Chen, Syuan-Yi
    Lo, Hsiang-Yu
    Tsao, Tung-Yao
    Lai, I-Wei
    IEEE ISPCE-ASIA 2021: IEEE INTERNATIONAL SYMPOSIUM ON PRODUCT COMPLIANCE ENGINEERING - ASIA, 2021,
  • [6] Using Reinforcement Learning for Dialogue Management Policies: Towards Understanding MDP Violations and Convergence
    Heeman, Peter A.
    Fryer, Jordan
    Lunsford, Rebecca
    Rueckert, Andrew
    Selfridge, Ethan O.
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 746 - 749
  • [7] Learning Urban Driving Policies using Deep Reinforcement Learning
    Agarwal, Tanmay
    Arora, Hitesh
    Schneider, Jeff
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 607 - 614
  • [8] Hybrid reinforcement learning in autonomous mobile systems
    Youssef, SM
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL, 2005, : 460 - 464
  • [9] Energy Management for Hybrid Electric Vehicles Using Safe Hybrid-Action Reinforcement Learning
    Xu, Jinming
    Lin, Yuan
    MATHEMATICS, 2024, 12 (05)
  • [10] Power management in hybrid electric vehicles using deep recurrent reinforcement learning
    Sun, Mengshu
    Zhao, Pu
    Lin, Xue
    ELECTRICAL ENGINEERING, 2022, 104 (03) : 1459 - 1471