Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

被引:2
|
作者
Yao, Zhiyuan [1 ,2 ]
Ding, Zihan [3 ]
Clausen, Thomas [1 ]
机构
[1] Ecole Polytech, Paris, France
[2] Cisco Syst, Paris, France
[3] Princeton Univ, Princeton, NJ USA
关键词
MARL; load balancing; distributed systems;
D O I
10.1145/3511808.3557133
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents the network load balancing problem, a challenging real-world task for multi-agent reinforcement learning (MARL) methods. Conventional heuristic solutions like Weighted-Cost Multi-Path (WCMP) and Local Shortest Queue (LSQ) are less flexible to the changing workload distributions and arrival rates, with a poor balance among multiple load balancers. The cooperative network load balancing task is formulated as a Dec-POMDP problem, which naturally induces the MARL methods. To bridge the reality gap for applying learning-based methods, all models are directly trained and evaluated on a real-world system from moderateto large-scale setups. Experimental evaluations show that the independent and "selfish" load balancing strategies are not necessarily the globally optimal ones, while the proposed MARL solution has a superior performance over different realistic settings. Additionally, the potential difficulties of the application and deployment of MARL methods for network load balancing are analysed, which helps draw the attention of the learning and network communities to such challenges.
引用
收藏
页码:3594 / 3603
页数:10
相关论文
共 50 条
  • [41] Towards Multi-agent Reinforcement Learning for Wireless Network Protocol Synthesis
    Dutta, Hrishikesh
    Biswas, Subir
    2021 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2021, : 614 - 622
  • [42] Distributed Multi-agent Reinforcement Learning for Directional UAV Network Control
    He, Linsheng
    Zhao, Jiamiao
    Hu, Fei
    PROCEEDINGS OF THE 32ND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2023, 2023, : 317 - 318
  • [43] Load balancing in distributed multi-agent computing systems
    Metawei, Maha A.
    Ghoneim, Salma A.
    Haggag, Sahar M.
    Nassar, Salwa M.
    AIN SHAMS ENGINEERING JOURNAL, 2012, 3 (03) : 237 - 249
  • [44] Dynamic Load Balancing in Multi-Agent Spatial Simulation
    Mistry, Bhargav
    Fukuda, Munehiro
    2015 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2015, : 141 - 146
  • [45] Accelerated Decentralized Load Balancing in Multi-Agent Networks
    Erofeeva, Victoria
    Granichin, Oleg
    Volodina, Elena
    IEEE ACCESS, 2024, 12 : 161954 - 161967
  • [46] Image-Based Multi-Agent Reinforcement Learning for Demand-Capacity Balancing
    Mas-Pujol, Sergi
    Salami, Esther
    Pastor, Enric
    AEROSPACE, 2022, 9 (10)
  • [47] AdaptAUG: Adaptive Data Augmentation Framework for Multi-Agent Reinforcement Learning
    Yul, Xin
    Tian, Yongkai
    Wang, Li
    Feng, Pu
    Wu, Wenjun
    Shi, Rongye
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 10814 - 10820
  • [48] Coordinated Load Balancing in Mobile Edge Computing Network: a Multi-Agent DRL Approach
    Ma, Manyou
    Wu, Di
    Xu, Yi Tian
    Li, Jimmy
    Jang, Seowoo
    Liu, Xue
    Dudek, Gregory
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 619 - 624
  • [49] TEAM POLICY LEARNING FOR MULTI-AGENT REINFORCEMENT LEARNING
    Cassano, Lucas
    Alghunaim, Sulaiman A.
    Sayed, Ali H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3062 - 3066
  • [50] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551