Scalable Reinforcement Learning for Dynamic Overlay Selection in SD-WANs

被引:4
|
作者
Botta, Alessio [1 ]
Canonico, Roberto [1 ]
Navarro, Annalisa [1 ]
Stanco, Giovanni [1 ]
Ventre, Giorgio [1 ]
机构
[1] Univ Napoli Federico II, DIETI Dept, Naples, Italy
关键词
SDN; SD-WAN; Traffic Engineering; Reinforcement Learning; Scalability;
D O I
10.23919/IFIPNetworking57963.2023.10186399
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
SD-WAN promises distributed enterprises to satisfy their dynamic communication requirements over the public Internet with a substantial cost reduction and enhanced performance compared to dedicated lines. It builds interconnections between users or applications in remote sites by exploiting all available transport connections (e.g. Internet, MPLS,...), but how to combine them to enhance communication performance is still an open challenge. Previous work investigated the use of Reinforcement Learning in the SD-WAN control logic to solve this problem, but they only considered simple scenarios consisting of two sites connected by two paths. In this paper we move a step forward and pose the question of whether such a promising approach can scale to WANs spanning multiple distributed sites connected through several paths. We first conduct an analytical study of the complexity of Reinforcement Learning that considers the increase of action and state spaces when the number of sites and paths grows. We then propose a solution based on Multi-Agent Reinforcement Learning (MARL) that helps reducing the overall complexity by leveraging an agent for each site. Finally, we show the effectiveness of our solution with real experiments in an emulated environment, showing that not only it is viable, but it also achieves a reduction in network policy violations, latency, and transit costs in a multi-site scenario.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Dynamic assembly sequence selection using reinforcement learning
    Lowe, G
    Shirinzadeh, B
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2633 - 2638
  • [22] Scalable lifelong reinforcement learning
    Zhan, Yusen
    Ammar, Haitham Bou
    Taylor, Matthew E.
    PATTERN RECOGNITION, 2017, 72 : 407 - 418
  • [23] Scalable overlay network deployment for dynamic collaborative groups
    Fujita, N
    Ishikawa, Y
    Koide, T
    Tsukamoto, A
    2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 102 - 109
  • [24] Scalable reinforcement learning approaches for dynamic pricing in ride-hailing systems
    Lei, Zengxiang
    Ukkusuri, Satish V.
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2023, 178
  • [25] Scalable Multi-Agent Reinforcement Learning for Dynamic Coordinated Multipoint Clustering
    Hu, Fenghe
    Deng, Yansha
    Hamid Aghvami, A.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) : 101 - 114
  • [26] Scalable supernode selection in peer-to-peer overlay networks
    Lo, V
    Zhou, DY
    Liu, YH
    GauthierDickey, C
    Li, J
    Second International Workshop on Hot Topics in Peer-to-Peer Systems, Proceedings, 2005, : 18 - 25
  • [27] Podracer architectures for scalable reinforcement learning
    DeepMind, United Kingdom
    arXiv,
  • [28] Scalable Evolutionary Hierarchical Reinforcement Learning
    Abramowitz, Sasha
    Nitschke, Geoff
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 272 - 275
  • [29] Scalable reinforcement learning on Cray XC
    Kommaraju, Ananda, V
    Maschhoff, Kristyn J.
    Ringenburg, Michael F.
    Robbins, Benjamin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (20):
  • [30] Dynamic algorithms to provide a robust and scalable overlay routing service
    De Vleeschauwer, Bart
    De Turck, Filip
    Dhoedt, Bart
    Demeester, Piet
    INFORMATION NETWORKING: ADVANCES IN DATA COMMUNICATIONS AND WIRELESS NETWORKS, 2006, 3961 : 945 - +