On Deep Reinforcement Learning for Traffic Engineering in SD-WAN

被引:51
|
作者
Troia, Sebastian [1 ,2 ]
Sapienza, Federico [1 ,3 ]
Vare, Leonardo [1 ,3 ]
Maier, Guido [1 ,2 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn DEIB, I-20133 Milan, Italy
[2] SWAN Networks, I-20124 Milan, Italy
[3] Huawei Italia, I-20147 Milan, Italy
关键词
Software-Defined Networking (SDN); Software-Defined Wide Area Network (SD-WAN); deep reinforcement learning; Enterprise Networking; NETWORKING;
D O I
10.1109/JSAC.2020.3041385
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The demand for reliable and efficient Wide Area Networks (WANs) from business customers is continuously increasing. Companies and enterprises use WANs to exchange critical data between headquarters, far-off business branches and cloud data centers. Many WANs solutions have been proposed over the years, such as: leased lines, Frame Relay, Multi-Protocol Label Switching (MPLS), Virtual Private Networks (VPN). Each solution positions differently in the trade-off between reliability, Quality of Service (QoS) and cost. Today, the emerging technology for WAN is Software-Defined Wide Area Networking (SD-WAN) that introduces the Software-Defined Networking (SDN) paradigm into the enterprise-network market. SD-WAN can support differentiated services over public WAN by dynamically reconfiguring in real-time network devices at the edge of the network according to network measurements and service requirements. On the one hand, SD-WAN reduces the high costs of guaranteed QoS WAN solutions (as MPLS), without giving away reliability in practical scenarios. On the other, it brings numerous technical challenges, such as the implementation of Traffic Engineering (TE) methods. TE is critically important for enterprises not only to efficiently orchestrate network traffic among the edge devices, but also to keep their services always available. In this work, we develop different kind of TE algorithms with the aim of improving the performance of an SD-WAN based network in terms of service availability. We first evaluate the performance of baseline TE algorithms. Then, we implement different deep Reinforcement Learning (deep-RL) algorithms to overcome the limitations of the baseline approaches. Specifically, we implement three kinds of deep-RL algorithms, which are: policy gradient, TD-lambda and deep Q-learning. Results show that a deep-RL algorithm with a well-designed reward function is capable of increasing the overall network availability and guaranteeing network protection and restoration in SD-WAN.
引用
收藏
页码:2198 / 2212
页数:15
相关论文
共 50 条
  • [41] 云架构下的SD-WAN技术探讨
    李德伟
    通讯世界, 2020, 27 (02) : 13 - 14
  • [43] SD-WAN在企业中的应用分析
    陈伟锋
    黄树鑫
    周自强
    刘青梅
    朱靖雯
    曹志源
    网络安全技术与应用, 2023, (08) : 1 - 2
  • [44] SD-WAN技术优势及应用分析
    王林
    周崇杰
    科技风, 2018, (01) : 60+67 - 60
  • [45] 部署SD-WAN的六大优势
    AP
    电脑知识与技术(经验技巧), 2017, (12) : 114 - 115
  • [46] SD-WAN在企业组网中的运用
    苏彪
    中国新通信, 2021, 23 (13) : 78 - 79
  • [47] Software-Defined Virtual Private Network for SD-WAN
    Fu, Chunle
    Wang, Bailing
    Liu, Hongri
    Wang, Wei
    ELECTRONICS, 2024, 13 (13)
  • [48] SD-WAN在企业组网中的运用
    伍远娟
    杨梦皎
    陈宁
    电声技术, 2023, 47 (09) : 65 - 69
  • [50] SD-WAN多云聚合平台接入方案研究
    牛佳
    颜永明
    赵乾艮
    闵佳俊
    祁松阳
    电信科学, 2022, 38 (02) : 130 - 138