On Deep Reinforcement Learning for Traffic Engineering in SD-WAN

被引:51
|
作者
Troia, Sebastian [1 ,2 ]
Sapienza, Federico [1 ,3 ]
Vare, Leonardo [1 ,3 ]
Maier, Guido [1 ,2 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn DEIB, I-20133 Milan, Italy
[2] SWAN Networks, I-20124 Milan, Italy
[3] Huawei Italia, I-20147 Milan, Italy
关键词
Software-Defined Networking (SDN); Software-Defined Wide Area Network (SD-WAN); deep reinforcement learning; Enterprise Networking; NETWORKING;
D O I
10.1109/JSAC.2020.3041385
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The demand for reliable and efficient Wide Area Networks (WANs) from business customers is continuously increasing. Companies and enterprises use WANs to exchange critical data between headquarters, far-off business branches and cloud data centers. Many WANs solutions have been proposed over the years, such as: leased lines, Frame Relay, Multi-Protocol Label Switching (MPLS), Virtual Private Networks (VPN). Each solution positions differently in the trade-off between reliability, Quality of Service (QoS) and cost. Today, the emerging technology for WAN is Software-Defined Wide Area Networking (SD-WAN) that introduces the Software-Defined Networking (SDN) paradigm into the enterprise-network market. SD-WAN can support differentiated services over public WAN by dynamically reconfiguring in real-time network devices at the edge of the network according to network measurements and service requirements. On the one hand, SD-WAN reduces the high costs of guaranteed QoS WAN solutions (as MPLS), without giving away reliability in practical scenarios. On the other, it brings numerous technical challenges, such as the implementation of Traffic Engineering (TE) methods. TE is critically important for enterprises not only to efficiently orchestrate network traffic among the edge devices, but also to keep their services always available. In this work, we develop different kind of TE algorithms with the aim of improving the performance of an SD-WAN based network in terms of service availability. We first evaluate the performance of baseline TE algorithms. Then, we implement different deep Reinforcement Learning (deep-RL) algorithms to overcome the limitations of the baseline approaches. Specifically, we implement three kinds of deep-RL algorithms, which are: policy gradient, TD-lambda and deep Q-learning. Results show that a deep-RL algorithm with a well-designed reward function is capable of increasing the overall network availability and guaranteeing network protection and restoration in SD-WAN.
引用
收藏
页码:2198 / 2212
页数:15
相关论文
共 50 条
  • [31] Dynamic QoS for High Quality SD-WAN Overlays
    Quang, Pham Tran Anh
    Leguay, Jeremie
    Zeng, Feng
    Hou, Jianqiang
    Yu, Boyuan
    Restivo, Davide
    PROCEEDINGS OF 2024 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, NOMS 2024, 2024,
  • [32] SD-WAN在物联网应用探索
    董炳泉
    广东通信技术, 2019, 39 (12) : 30 - 31+38
  • [33] SD-WAN组网产品发展实践浅析
    姚赟
    电信科学, 2020, 36(S1) (S1) : 152 - 158
  • [34] Distributed and Adaptive Traffic Engineering with Deep Reinforcement Learning
    Geng, Nan
    Xu, Mingwei
    Yang, Yuan
    Liu, Chenyi
    Yang, Jiahai
    Li, Qi
    Zhang, Shize
    2021 IEEE/ACM 29TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2021,
  • [35] Leveraging Deep Reinforcement Learning for Traffic Engineering: A Survey
    Xiao, Yang
    Liu, Jun
    Wu, Jiawei
    Ansari, Nirwan
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2021, 23 (04): : 2064 - 2097
  • [36] Intent-Based Policy Optimization in SD-WAN
    Pham Tran Anh Quang
    Martin, Sebastien
    Leguay, Jeremie
    Gong, Xu
    Zeng, Feng
    PROCEEDINGS OF THE 2021 SIGCOMM 2021 POSTER AND DEMO SESSIONS, SIGCOMM 2021 DEMOS AND POSTERS, 2024, : 74 - 75
  • [37] 基于SD-WAN构建SASE模型思路浅析
    李长连
    马季春
    蔺旋
    邮电设计技术, 2021, (06) : 78 - 83
  • [38] Improving SD-WAN Resilience: From Vertical Handoff to WAN-Aware MPTCP
    Zhang, Yang
    Tourrilhes, Jean
    Zhang, Zhi-Li
    Sharma, Puneet
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2021, 18 (01): : 347 - 361
  • [39] SD-WAN Flood Tracer: Tracking the entry points of DDoS attack flows in WAN
    Dayal, Neelam
    Srivastava, Shashank
    COMPUTER NETWORKS, 2021, 186
  • [40] DQR: An Efficient Deep Q-Based Routing Approach in Multi-Controller Software Defined WAN (SD-WAN)
    Majdoub, Manel
    El Kamel, Ali
    Youssef, Habib
    JOURNAL OF INTERCONNECTION NETWORKS, 2020, 20 (04)