On Deep Reinforcement Learning for Traffic Engineering in SD-WAN

被引:51
|
作者
Troia, Sebastian [1 ,2 ]
Sapienza, Federico [1 ,3 ]
Vare, Leonardo [1 ,3 ]
Maier, Guido [1 ,2 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn DEIB, I-20133 Milan, Italy
[2] SWAN Networks, I-20124 Milan, Italy
[3] Huawei Italia, I-20147 Milan, Italy
关键词
Software-Defined Networking (SDN); Software-Defined Wide Area Network (SD-WAN); deep reinforcement learning; Enterprise Networking; NETWORKING;
D O I
10.1109/JSAC.2020.3041385
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The demand for reliable and efficient Wide Area Networks (WANs) from business customers is continuously increasing. Companies and enterprises use WANs to exchange critical data between headquarters, far-off business branches and cloud data centers. Many WANs solutions have been proposed over the years, such as: leased lines, Frame Relay, Multi-Protocol Label Switching (MPLS), Virtual Private Networks (VPN). Each solution positions differently in the trade-off between reliability, Quality of Service (QoS) and cost. Today, the emerging technology for WAN is Software-Defined Wide Area Networking (SD-WAN) that introduces the Software-Defined Networking (SDN) paradigm into the enterprise-network market. SD-WAN can support differentiated services over public WAN by dynamically reconfiguring in real-time network devices at the edge of the network according to network measurements and service requirements. On the one hand, SD-WAN reduces the high costs of guaranteed QoS WAN solutions (as MPLS), without giving away reliability in practical scenarios. On the other, it brings numerous technical challenges, such as the implementation of Traffic Engineering (TE) methods. TE is critically important for enterprises not only to efficiently orchestrate network traffic among the edge devices, but also to keep their services always available. In this work, we develop different kind of TE algorithms with the aim of improving the performance of an SD-WAN based network in terms of service availability. We first evaluate the performance of baseline TE algorithms. Then, we implement different deep Reinforcement Learning (deep-RL) algorithms to overcome the limitations of the baseline approaches. Specifically, we implement three kinds of deep-RL algorithms, which are: policy gradient, TD-lambda and deep Q-learning. Results show that a deep-RL algorithm with a well-designed reward function is capable of increasing the overall network availability and guaranteeing network protection and restoration in SD-WAN.
引用
收藏
页码:2198 / 2212
页数:15
相关论文
共 50 条
  • [11] SD-WAN关键技术
    柴瑶琳
    穆琙博
    马军锋
    中兴通讯技术, 2019, 25 (02) : 15 - 19
  • [12] SD-WAN助力数字政务
    顾玮
    软件和集成电路, 2018, (04) : 78 - 79
  • [13] Impact Analysis of Tunnel Probing Protocol on SD-WAN's Mainstream Traffic
    Iddalagi, Pavan
    Mishra, Amrita
    2023 15TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS, COMSNETS, 2023,
  • [14] A predictive SD-WAN traffic management method for IoT networks in multi-datacenters using deep RNN
    Absardi, Zeinab Nazemi
    Javidan, Reza
    IET COMMUNICATIONS, 2024, 18 (18) : 1151 - 1165
  • [16] SD-WAN传输限速策略分析
    赵纯熙
    童博
    刘锦波
    邮电设计技术, 2021, (10) : 78 - 82
  • [17] SD-WAN方兴未艾,谁执牛耳?
    梅雅鑫
    通信世界, 2021, (14) : 34 - 35
  • [18] SD-WAN提升企业敏捷度
    顾玮
    软件和集成电路, 2016, (01) : 38 - 39
  • [19] Request delay and survivability optimization for software defined-wide area networking (SD-WAN) using multi-agent deep reinforcement learning
    Ouamri, Mohamed Amine
    Azni, Mohamed
    Singh, Daljeet
    Almughalles, Waleed
    Muthanna, Mohammed Saleh Ali
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2023, 34 (07)
  • [20] Global QoS Policy Optimization in SD-WAN
    Quangl, Pham Tran Anh
    Leguay, Jeremie
    Xu Gong
    Xu Huiying
    2023 IEEE 9TH INTERNATIONAL CONFERENCE ON NETWORK SOFTWARIZATION, NETSOFT, 2023, : 202 - 206