Breaking the Inertial Thinking: Non-Blocking Multipath Congestion Control Based on the Single-Subflow Reinforcement Learning Model

被引:0
|
作者
Wei, Dehui [1 ]
Zhang, Jiao [1 ,2 ]
Li, Haozhe [1 ]
Liu, Yuanjie [1 ]
Zhang, Xuan [1 ]
Pan, Tian [1 ,2 ]
Huang, Tao [1 ,2 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100088, Peoples R China
[2] Purple Mt Labs, Nanjing 211111, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Training; Bandwidth; Adaptation models; Throughput; Packet loss; Linux; Kernel; Multipath TCP; congestion control; reinforcement learning; non-blocking; TCP; FAIRNESS; DESIGN; BBR;
D O I
10.1109/TNSM.2024.3380049
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Multipath TCP (MPTCP) protocol has received more attention due to the increasing number of terminals with multiple network interfaces. To meet the higher network performance demand of terminal services, many researches leverage reinforcement learning (RL) for MPTCP congestion control (CC) algorithms to improve the performance of MPTCP. However, we observe two limitations of existing RL-based mechanisms that make them impractical: 1) Fail to break the restriction of the input and output dimensions of RL, making the mechanisms unadaptable to the varying number of subflows. 2) Frequent model decisions block packet transmission, leading to under-utilization of bandwidth. This paper breaks the inertial thinking By "inertial thinking" here, we are referring to the initial reaction of others when dealing with CC in MPTCP. Given the interdependence between MPTCP subflows, scholars have traditionally opted for coupled CC. However, we have challenged this conventional thinking by independently handling the CC of different subflows in a single MPTCP flow and ensuring fairness. to overcome the above limitations and proposes Maggey, a non-blocking CC mechanism that applies the single-subflow model to multipath transmission. To this end, Maggey employs loosely coupled design principles and a unique reward function to ensure the fairness of the algorithm. Additionally, Maggey introduces iterative training to ensure the accuracy of training of the single-subflow model. Furthermore, a mode transition framework is artfully designed to avoid blocking, preserving the flexibility of RL-based CCs. These two features enhance the practicability of Maggey and the paper analyze the stability of Maggey. We implement Maggey in the Linux kernel and evaluate the performance of Maggey through extensive emulation and live experiments. The evaluation results show that Maggey boosts 26% throughput over DRL-CC at high bandwidth and improves 2%-60% throughput over traditional algorithms under different network conditions. Besides, Maggey maintains fairness in different scenarios.
引用
收藏
页码:2876 / 2887
页数:12
相关论文
共 7 条
  • [1] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
    Xu, Tao
    Gong, Lina
    Zhang, Wei
    Li, Xuhong
    Wang, Xia
    Pan, Wenwen
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
  • [2] Fault Estimate and Reinforcement Learning Based Optimal Output Feedback Control for Single-Link Robot Arm Model
    Liu, Sihan
    Yan, Hailong
    Zhao, Lixia
    Gao, Dongxiang
    ENGINEERING LETTERS, 2025, 33 (01) : 21 - 28
  • [3] Fault Estimate and Reinforcement Learning Based Optimal Output Feedback Control for Single-Link Robot Arm Model
    School of Information Engineering, Liaoning Institute of Science and Engineering, Jinzhou
    121010, China
    不详
    114051, China
    Eng. Lett., 2025, 33 (01): : 21 - 28
  • [4] Model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on reinforcement learning
    Guo, Lei
    Zhao, Han
    IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (02): : 223 - 239
  • [5] Research on Path-Following Technology of a Single-Outboard-Motor Unmanned Surface Vehicle Based on Deep Reinforcement Learning and Model Predictive Control Algorithm
    Cui, Bin
    Chen, Yuanming
    Hong, Xiaobin
    Luo, Hao
    Chen, Guanqiao
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (12)
  • [6] An efficient model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on integral reinforcement learning with exploration
    Guo, Lei
    Xiong, Wenbo
    Song, Yuan
    Gan, Dongming
    IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (06): : 748 - 763
  • [7] Application and Clinical Value of Machine Learning-Based Cervical Cancer Diagnosis and Prediction Model in Adjuvant Chemotherapy for Cervical Cancer: A Single-Center, Controlled, Non-Arbitrary Size Case-Control Study
    Wang, Yang
    Shen, Lidan
    Jin, Jun
    Wang, Guohua
    CONTRAST MEDIA & MOLECULAR IMAGING, 2022, 2022