Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

被引：2

作者：

Yao, Zhiyuan ^{[1
,2
]}

Ding, Zihan ^{[3
]}

Clausen, Thomas ^{[1
]}

机构：

[1] Ecole Polytech, Paris, France

[2] Cisco Syst, Paris, France

[3] Princeton Univ, Princeton, NJ USA

来源：

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022 | 2022年

关键词：

MARL; load balancing; distributed systems;

D O I：

10.1145/3511808.3557133

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents the network load balancing problem, a challenging real-world task for multi-agent reinforcement learning (MARL) methods. Conventional heuristic solutions like Weighted-Cost Multi-Path (WCMP) and Local Shortest Queue (LSQ) are less flexible to the changing workload distributions and arrival rates, with a poor balance among multiple load balancers. The cooperative network load balancing task is formulated as a Dec-POMDP problem, which naturally induces the MARL methods. To bridge the reality gap for applying learning-based methods, all models are directly trained and evaluated on a real-world system from moderateto large-scale setups. Experimental evaluations show that the independent and "selfish" load balancing strategies are not necessarily the globally optimal ones, while the proposed MARL solution has a superior performance over different realistic settings. Additionally, the potential difficulties of the application and deployment of MARL methods for network load balancing are analysed, which helps draw the attention of the learning and network communities to such challenges.

引用

页码：3594 / 3603

页数：10

共 50 条

[31] Hierarchical multi-agent reinforcement learning
Ghavamzadeh, Mohammad
Mahadevan, Sridhar
Makar, Rajbala
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2006, 13 (02) : 197 - 229
[32] Multi-agent Exploration with Reinforcement Learning
Sygkounas, Alkis
Tsipianitis, Dimitris
Nikolakopoulos, George
Bechlioulis, Charalampos P.
2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 630 - 635
[33] Partitioning in multi-agent reinforcement learning
Sun, R
Peterson, T
FROM ANIMALS TO ANIMATS 6, 2000, : 325 - 332
[34] The Dynamics of Multi-Agent Reinforcement Learning
Dickens, Luke
Broda, Krysia
Russo, Alessandra
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 367 - 372
[35] Multi-agent reinforcement learning: A survey
Busoniu, Lucian
Babuska, Robert
De Schutter, Bart
2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
[36] Mobility-aware load Balancing for Reliable Self-Organization Networks: Multi-agent Deep Reinforcement Learning
Mohajer, Amin
Bavaghar, Maryam
Farrokhi, Hamid
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 202
[37] Learning to Routing in UAV Swarm Network: A Multi-Agent Reinforcement Learning Approach
Wang, Zunliang
Yao, Haipeng
Mai, Tianle
Xiong, Zehui
Wu, Xiaohua
Wu, Di
Guo, Song
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 6611 - 6624
[38] Data-Driven Load Frequency Control Based on Multi-Agent Reinforcement Learning With Attention Mechanism
Yang, Fan
Huang, DongHua
Li, Dongdong
Lin, Shunfu
Muyeen, S. M.
Zhai, Haibao
IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (06) : 5560 - 5569
[39] Multi-agent Reinforcement Learning and Its Application to Wireless Network Communication
Pochaba, Sabrina
Dorfinger, Peter
Herlich, Matthias
Kwitt, Roland
Hirlaender, Simon
COMBINING, MODELLING AND ANALYZING IMPRECISION, RANDOMNESS AND DEPENDENCE, SMPS 2024, 2024, 1458 : 363 - 370
[40] Multi-agent Reinforcement Learning for Swarm Retrieval with Evolving Neural Network
Vaughan, Neil
BIOMIMETIC AND BIOHYBRID SYSTEMS, 2018, 10928 : 522 - 526

← 1 2 3 4 5 →