Scalable Reinforcement Learning for Dynamic Overlay Selection in SD-WANs

被引:4
|
作者
Botta, Alessio [1 ]
Canonico, Roberto [1 ]
Navarro, Annalisa [1 ]
Stanco, Giovanni [1 ]
Ventre, Giorgio [1 ]
机构
[1] Univ Napoli Federico II, DIETI Dept, Naples, Italy
关键词
SDN; SD-WAN; Traffic Engineering; Reinforcement Learning; Scalability;
D O I
10.23919/IFIPNetworking57963.2023.10186399
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
SD-WAN promises distributed enterprises to satisfy their dynamic communication requirements over the public Internet with a substantial cost reduction and enhanced performance compared to dedicated lines. It builds interconnections between users or applications in remote sites by exploiting all available transport connections (e.g. Internet, MPLS,...), but how to combine them to enhance communication performance is still an open challenge. Previous work investigated the use of Reinforcement Learning in the SD-WAN control logic to solve this problem, but they only considered simple scenarios consisting of two sites connected by two paths. In this paper we move a step forward and pose the question of whether such a promising approach can scale to WANs spanning multiple distributed sites connected through several paths. We first conduct an analytical study of the complexity of Reinforcement Learning that considers the increase of action and state spaces when the number of sites and paths grows. We then propose a solution based on Multi-Agent Reinforcement Learning (MARL) that helps reducing the overall complexity by leveraging an agent for each site. Finally, we show the effectiveness of our solution with real experiments in an emulated environment, showing that not only it is viable, but it also achieves a reduction in network policy violations, latency, and transit costs in a multi-site scenario.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Dynamic channel selection with reinforcement learning for cognitive WLAN over fiber
    Li, Yi
    Ji, Hong
    Li, Xi
    Leung, Victor C. M.
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2012, 25 (08) : 1077 - 1090
  • [32] Dynamic node selection in camera networks based on approximate reinforcement learning
    Li, Qian
    Sun, Zhengxing
    Chen, Songle
    Xia, Shiming
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (24) : 17393 - 17419
  • [33] Continual portfolio selection in dynamic environments via incremental reinforcement learning
    Shu Liu
    Bo Wang
    Huaxiong Li
    Chunlin Chen
    Zhi Wang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 269 - 279
  • [34] Complexity of Spectrum Activity and Benefits of Reinforcement Learning for Dynamic Channel Selection
    Macaluso, Irene
    Finn, Danny
    Oezguel, Baris
    DaSilva, Luiz A.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2013, 31 (11) : 2237 - 2248
  • [35] Dynamic node selection in camera networks based on approximate reinforcement learning
    Qian Li
    Zhengxing Sun
    Songle Chen
    Shiming Xia
    Multimedia Tools and Applications, 2016, 75 : 17393 - 17419
  • [36] Continual portfolio selection in dynamic environments via incremental reinforcement learning
    Liu, Shu
    Wang, Bo
    Li, Huaxiong
    Chen, Chunlin
    Wang, Zhi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (01) : 269 - 279
  • [37] ACDRL: A constraint-guided dynamic reinforcement learning for index selection
    Yan, Yu
    Geng, Jian
    Wang, Hongzhi
    Zhang, Kaixin
    Wang, Yuzhuo
    INFORMATION SCIENCES, 2025, 689
  • [38] Scalable Multitask Policy Gradient Reinforcement Learning
    El Bsat, Salam
    Ammar, Haitham Bou
    Taylor, Matthew E.
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1847 - 1853
  • [39] Model selection in reinforcement learning
    Farahmand, Amir-massoud
    Szepesvari, Csaba
    MACHINE LEARNING, 2011, 85 (03) : 299 - 332
  • [40] Scalable Reinforcement Learning for Multiagent Networked Systems
    Qu, Guannan
    Wierman, Adam
    Li, Na
    OPERATIONS RESEARCH, 2022, 70 (06) : 3601 - 3628