Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

被引:2
|
作者
Yao, Zhiyuan [1 ,2 ]
Ding, Zihan [3 ]
Clausen, Thomas [1 ]
机构
[1] Ecole Polytech, Paris, France
[2] Cisco Syst, Paris, France
[3] Princeton Univ, Princeton, NJ USA
关键词
MARL; load balancing; distributed systems;
D O I
10.1145/3511808.3557133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the network load balancing problem, a challenging real-world task for multi-agent reinforcement learning (MARL) methods. Conventional heuristic solutions like Weighted-Cost Multi-Path (WCMP) and Local Shortest Queue (LSQ) are less flexible to the changing workload distributions and arrival rates, with a poor balance among multiple load balancers. The cooperative network load balancing task is formulated as a Dec-POMDP problem, which naturally induces the MARL methods. To bridge the reality gap for applying learning-based methods, all models are directly trained and evaluated on a real-world system from moderateto large-scale setups. Experimental evaluations show that the independent and "selfish" load balancing strategies are not necessarily the globally optimal ones, while the proposed MARL solution has a superior performance over different realistic settings. Additionally, the potential difficulties of the application and deployment of MARL methods for network load balancing are analysed, which helps draw the attention of the learning and network communities to such challenges.
引用
收藏
页码:3594 / 3603
页数:10
相关论文
共 50 条
  • [31] Hierarchical multi-agent reinforcement learning
    Ghavamzadeh, Mohammad
    Mahadevan, Sridhar
    Makar, Rajbala
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
  • [32] Multi-agent Exploration with Reinforcement Learning
    Sygkounas, Alkis
    Tsipianitis, Dimitris
    Nikolakopoulos, George
    Bechlioulis, Charalampos P.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
  • [33] Partitioning in multi-agent reinforcement learning
    Sun, R
    Peterson, T
    FROM ANIMALS TO ANIMATS 6, 2000, : 325 - 332
  • [34] The Dynamics of Multi-Agent Reinforcement Learning
    Dickens, Luke
    Broda, Krysia
    Russo, Alessandra
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 367 - 372
  • [35] Multi-agent reinforcement learning: A survey
    Busoniu, Lucian
    Babuska, Robert
    De Schutter, Bart
    2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
  • [36] Mobility-aware load Balancing for Reliable Self-Organization Networks: Multi-agent Deep Reinforcement Learning
    Mohajer, Amin
    Bavaghar, Maryam
    Farrokhi, Hamid
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 202
  • [37] Learning to Routing in UAV Swarm Network: A Multi-Agent Reinforcement Learning Approach
    Wang, Zunliang
    Yao, Haipeng
    Mai, Tianle
    Xiong, Zehui
    Wu, Xiaohua
    Wu, Di
    Guo, Song
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 6611 - 6624
  • [38] Data-Driven Load Frequency Control Based on Multi-Agent Reinforcement Learning With Attention Mechanism
    Yang, Fan
    Huang, DongHua
    Li, Dongdong
    Lin, Shunfu
    Muyeen, S. M.
    Zhai, Haibao
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (06) : 5560 - 5569
  • [39] Multi-agent Reinforcement Learning and Its Application to Wireless Network Communication
    Pochaba, Sabrina
    Dorfinger, Peter
    Herlich, Matthias
    Kwitt, Roland
    Hirlaender, Simon
    COMBINING, MODELLING AND ANALYZING IMPRECISION, RANDOMNESS AND DEPENDENCE, SMPS 2024, 2024, 1458 : 363 - 370
  • [40] Multi-agent Reinforcement Learning for Swarm Retrieval with Evolving Neural Network
    Vaughan, Neil
    BIOMIMETIC AND BIOHYBRID SYSTEMS, 2018, 10928 : 522 - 526