Multi-Agent Reinforcement Learning for Dynamic Topology Optimization of Mesh Wireless Networks

被引：0

作者：

Sun, Wei ^{[1
,2
]}

Lv, Qiushuo ^{[1
,2
]}

Xiao, Yang ^{[3
]}

Liu, Zhi ^{[4
]}

Tang, Qingwei ^{[1
,2
]}

Li, Qiyue ^{[1
,2
]}

Mu, Daoming ^{[1
,2
]}

机构：

[1] Hefei Univ Technol, Sch Elect & Automat Engn, Hefei 230009, Anhui, Peoples R China

[2] Anhui Engn Technol Res Ctr Ind Automat, Hefei 230009, Peoples R China

[3] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA

[4] Univ Electrocommun, Dept Comp & Network Engn, Tokyo 1828585, Japan

来源：

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS | 2024年 / 23卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Delays; Trajectory; Topology; Network topology; Vectors; Wireless networks; Logic gates; Actor-critic; mesh wireless network; reinforcement learning; topology optimization; ad hoc wireless network; IEEE-802.11; SCHEME;

D O I：

10.1109/TWC.2024.3372694

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In Mesh Wireless Networks (MWNs), the network coverage is extended by connecting Access Points (APs) in a mesh topology, where transmitting frames by multi-hop routing has to sustain the performances, such as end-to-end (E2E) delay and channel efficiency. Several recent studies have focused on minimizing E2E delay, but these methods are unable to adapt to the dynamic nature of MWNs. Meanwhile, reinforcement-learning-based methods offer better adaptability to dynamics but suffer from the problem of high-dimensional action spaces, leading to slower convergence. In this paper, we propose a multi-agent actor-critic reinforcement learning (MACRL) algorithm to optimize multiple objectives, specifically the minimization of E2E delay and the enhancement of channel efficiency. First, to reduce the action space and speed up the convergence in the dynamical optimization process, a centralized-critic-distributed-actor scheme is proposed. Then, a multi-objective reward balancing method is designed to dynamically balance the MWNs' performances between the E2E delay and the channel efficiency. Finally, the trained MACRL algorithm is deployed in the QaulNet simulator to verify its effectiveness.

引用

页码：10501 / 10513

页数：13

共 50 条

[1] Multi-agent reinforcement learning based dynamic self-coordinated topology optimization for wireless mesh networks
Tanga, Qingwei
Sun, Wei
Liu, Zhi
Li, Qiyue
Yuan, Xiaohui
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2025, 239
[2] Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks
Nasir, Yasar Sinan
Guo, Dongning
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) : 2239 - 2250
[3] Dynamic Multi-Agent Reinforcement Learning for Control Optimization
Fagan, Derek
Meier, Rene
PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 99 - 104
[4] Efficient Communications for Multi-Agent Reinforcement Learning in Wireless Networks
Lv, Zefang
Du, Yousong
Chen, Yifan
Xiao, Liang
Han, Shuai
Ji, Xiangyang
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 583 - 588
[5] Conjectural Variations in Multi-agent Reinforcement Learning for Energy-Efficient Cognitive Wireless Mesh Networks
Chen, Xianfu
Zhao, Zhifeng
Zhang, Honggang
Chen, Tao
2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2012,
[6] Multi-Agent Reinforcement Learning for Wireless Networks Against Adversarial Communications
Lv, Zefang
Chen, Yifan
Xiao, Liang
Yang, Helin
Ji, Xiangyang
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3409 - 3414
[7] Online Multi-Agent Reinforcement Learning for Multiple Access in Wireless Networks
Xiao, Jianbin
Chen, Zhenyu
Sun, Xinghua
Zhan, Wen
Wang, Xijun
Chen, Xiang
IEEE COMMUNICATIONS LETTERS, 2023, 27 (12) : 3250 - 3254
[8] Emergent Communication in Multi-Agent Reinforcement Learning for Future Wireless Networks
Chafii M.
Naoumi S.
Alami R.
Almazrouei E.
Bennis M.
Debbah M.
IEEE Internet of Things Magazine, 2023, 6 (04): : 18 - 24
[9] DynAMO: Multi-agent reinforcement learning for dynamic anticipatory mesh optimization with applications to hyperbolic conservation laws
Dzanic, T.
Mittal, K.
Kim, D.
Yang, J.
Petrides, S.
Keith, B.
Anderson, R.
JOURNAL OF COMPUTATIONAL PHYSICS, 2024, 506
[10] Multi-agent reinforcement learning based dynamic optimization algorithm of CRE offset for heterogeneous networks
Zhang C.
Zhu J.
Liu Z.
Huang Y.
Tongxin Xuebao/Journal on Communications, 2023, 44 (12): : 86 - 98

← 1 2 3 4 5 →