Multi-Agent Reinforcement Learning for Dynamic Topology Optimization of Mesh Wireless Networks

被引:0
|
作者
Sun, Wei [1 ,2 ]
Lv, Qiushuo [1 ,2 ]
Xiao, Yang [3 ]
Liu, Zhi [4 ]
Tang, Qingwei [1 ,2 ]
Li, Qiyue [1 ,2 ]
Mu, Daoming [1 ,2 ]
机构
[1] Hefei Univ Technol, Sch Elect & Automat Engn, Hefei 230009, Anhui, Peoples R China
[2] Anhui Engn Technol Res Ctr Ind Automat, Hefei 230009, Peoples R China
[3] Univ Alabama, Dept Comp Sci, Tuscaloosa, AL 35487 USA
[4] Univ Electrocommun, Dept Comp & Network Engn, Tokyo 1828585, Japan
基金
中国国家自然科学基金;
关键词
Delays; Trajectory; Topology; Network topology; Vectors; Wireless networks; Logic gates; Actor-critic; mesh wireless network; reinforcement learning; topology optimization; ad hoc wireless network; IEEE-802.11; SCHEME;
D O I
10.1109/TWC.2024.3372694
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In Mesh Wireless Networks (MWNs), the network coverage is extended by connecting Access Points (APs) in a mesh topology, where transmitting frames by multi-hop routing has to sustain the performances, such as end-to-end (E2E) delay and channel efficiency. Several recent studies have focused on minimizing E2E delay, but these methods are unable to adapt to the dynamic nature of MWNs. Meanwhile, reinforcement-learning-based methods offer better adaptability to dynamics but suffer from the problem of high-dimensional action spaces, leading to slower convergence. In this paper, we propose a multi-agent actor-critic reinforcement learning (MACRL) algorithm to optimize multiple objectives, specifically the minimization of E2E delay and the enhancement of channel efficiency. First, to reduce the action space and speed up the convergence in the dynamical optimization process, a centralized-critic-distributed-actor scheme is proposed. Then, a multi-objective reward balancing method is designed to dynamically balance the MWNs' performances between the E2E delay and the channel efficiency. Finally, the trained MACRL algorithm is deployed in the QaulNet simulator to verify its effectiveness.
引用
收藏
页码:10501 / 10513
页数:13
相关论文
共 50 条
  • [1] Multi-agent reinforcement learning based dynamic self-coordinated topology optimization for wireless mesh networks
    Tanga, Qingwei
    Sun, Wei
    Liu, Zhi
    Li, Qiyue
    Yuan, Xiaohui
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2025, 239
  • [2] Multi-Agent Deep Reinforcement Learning for Dynamic Power Allocation in Wireless Networks
    Nasir, Yasar Sinan
    Guo, Dongning
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) : 2239 - 2250
  • [3] Dynamic Multi-Agent Reinforcement Learning for Control Optimization
    Fagan, Derek
    Meier, Rene
    PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 99 - 104
  • [4] Efficient Communications for Multi-Agent Reinforcement Learning in Wireless Networks
    Lv, Zefang
    Du, Yousong
    Chen, Yifan
    Xiao, Liang
    Han, Shuai
    Ji, Xiangyang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 583 - 588
  • [5] Conjectural Variations in Multi-agent Reinforcement Learning for Energy-Efficient Cognitive Wireless Mesh Networks
    Chen, Xianfu
    Zhao, Zhifeng
    Zhang, Honggang
    Chen, Tao
    2012 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2012,
  • [6] Multi-Agent Reinforcement Learning for Wireless Networks Against Adversarial Communications
    Lv, Zefang
    Chen, Yifan
    Xiao, Liang
    Yang, Helin
    Ji, Xiangyang
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 3409 - 3414
  • [7] Online Multi-Agent Reinforcement Learning for Multiple Access in Wireless Networks
    Xiao, Jianbin
    Chen, Zhenyu
    Sun, Xinghua
    Zhan, Wen
    Wang, Xijun
    Chen, Xiang
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (12) : 3250 - 3254
  • [8] Emergent Communication in Multi-Agent Reinforcement Learning for Future Wireless Networks
    Chafii M.
    Naoumi S.
    Alami R.
    Almazrouei E.
    Bennis M.
    Debbah M.
    IEEE Internet of Things Magazine, 2023, 6 (04): : 18 - 24
  • [9] DynAMO: Multi-agent reinforcement learning for dynamic anticipatory mesh optimization with applications to hyperbolic conservation laws
    Dzanic, T.
    Mittal, K.
    Kim, D.
    Yang, J.
    Petrides, S.
    Keith, B.
    Anderson, R.
    JOURNAL OF COMPUTATIONAL PHYSICS, 2024, 506
  • [10] Multi-agent reinforcement learning based dynamic optimization algorithm of CRE offset for heterogeneous networks
    Zhang C.
    Zhu J.
    Liu Z.
    Huang Y.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (12): : 86 - 98