Breaking the Inertial Thinking: Non-Blocking Multipath Congestion Control Based on the Single-Subflow Reinforcement Learning Model

被引：0

作者：

Wei, Dehui ^{[1
]}

Zhang, Jiao ^{[1
,2
]}

Li, Haozhe ^{[1
]}

Liu, Yuanjie ^{[1
]}

Zhang, Xuan ^{[1
]}

Pan, Tian ^{[1
,2
]}

Huang, Tao ^{[1
,2
]}

机构：

[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100088, Peoples R China

[2] Purple Mt Labs, Nanjing 211111, Peoples R China

来源：

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT | 2024年 / 21卷 / 03期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Training; Bandwidth; Adaptation models; Throughput; Packet loss; Linux; Kernel; Multipath TCP; congestion control; reinforcement learning; non-blocking; TCP; FAIRNESS; DESIGN; BBR;

D O I：

10.1109/TNSM.2024.3380049

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The Multipath TCP (MPTCP) protocol has received more attention due to the increasing number of terminals with multiple network interfaces. To meet the higher network performance demand of terminal services, many researches leverage reinforcement learning (RL) for MPTCP congestion control (CC) algorithms to improve the performance of MPTCP. However, we observe two limitations of existing RL-based mechanisms that make them impractical: 1) Fail to break the restriction of the input and output dimensions of RL, making the mechanisms unadaptable to the varying number of subflows. 2) Frequent model decisions block packet transmission, leading to under-utilization of bandwidth. This paper breaks the inertial thinking By "inertial thinking" here, we are referring to the initial reaction of others when dealing with CC in MPTCP. Given the interdependence between MPTCP subflows, scholars have traditionally opted for coupled CC. However, we have challenged this conventional thinking by independently handling the CC of different subflows in a single MPTCP flow and ensuring fairness. to overcome the above limitations and proposes Maggey, a non-blocking CC mechanism that applies the single-subflow model to multipath transmission. To this end, Maggey employs loosely coupled design principles and a unique reward function to ensure the fairness of the algorithm. Additionally, Maggey introduces iterative training to ensure the accuracy of training of the single-subflow model. Furthermore, a mode transition framework is artfully designed to avoid blocking, preserving the flexibility of RL-based CCs. These two features enhance the practicability of Maggey and the paper analyze the stability of Maggey. We implement Maggey in the Linux kernel and evaluate the performance of Maggey through extensive emulation and live experiments. The evaluation results show that Maggey boosts 26% throughput over DRL-CC at high bandwidth and improves 2%-60% throughput over traditional algorithms under different network conditions. Besides, Maggey maintains fairness in different scenarios.

引用

页码：2876 / 2887

页数：12

共 7 条

[1] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
Xu, Tao
Gong, Lina
Zhang, Wei
Li, Xuhong
Wang, Xia
Pan, Wenwen
ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955
[2] Fault Estimate and Reinforcement Learning Based Optimal Output Feedback Control for Single-Link Robot Arm Model
Liu, Sihan
Yan, Hailong
Zhao, Lixia
Gao, Dongxiang
ENGINEERING LETTERS, 2025, 33 (01) : 21 - 28
[3] Fault Estimate and Reinforcement Learning Based Optimal Output Feedback Control for Single-Link Robot Arm Model
School of Information Engineering, Liaoning Institute of Science and Engineering, Jinzhou
121010, China
不详
114051, China
Eng. Lett., 2025, 33 (01): : 21 - 28
[4] Model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on reinforcement learning
Guo, Lei
Zhao, Han
IET CONTROL THEORY AND APPLICATIONS, 2023, 17 (02): : 223 - 239
[5] Research on Path-Following Technology of a Single-Outboard-Motor Unmanned Surface Vehicle Based on Deep Reinforcement Learning and Model Predictive Control Algorithm
Cui, Bin
Chen, Yuanming
Hong, Xiaobin
Luo, Hao
Chen, Guanqiao
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (12)
[6] An efficient model-free adaptive optimal control of continuous-time nonlinear non-zero-sum games based on integral reinforcement learning with exploration
Guo, Lei
Xiong, Wenbo
Song, Yuan
Gan, Dongming
IET CONTROL THEORY AND APPLICATIONS, 2024, 18 (06): : 748 - 763
[7] Application and Clinical Value of Machine Learning-Based Cervical Cancer Diagnosis and Prediction Model in Adjuvant Chemotherapy for Cervical Cancer: A Single-Center, Controlled, Non-Arbitrary Size Case-Control Study
Wang, Yang
Shen, Lidan
Jin, Jun
Wang, Guohua
CONTRAST MEDIA & MOLECULAR IMAGING, 2022, 2022

← 1 →