Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

被引:0
|
作者
Liu, Dongyu [1 ]
Li, Zexu [1 ]
Hu, Zeyu [1 ]
Li, Yong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Wireless Signal Proc & Network Lab, Beijing 100876, Peoples R China
来源
2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS) | 2018年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we aim to determine the multi-hop route between a device-to-device (D2D) source-destination pair which meets the quality-of-service (QoS) of services. We model this QoS routing problem in D2D as a Markov decision process (MDP) and proposes a distributed multi-agent reinforcement learning routing algorithm. We consider the QoS requirements in terms of bandwidth, delay, and packet loss rate, and allocate the routing path according to link information averaged over time in dynamic network environments. By decomposing the Q-function into multiple local Q-functions, each agent can compute its own optimal strategy based on local observations, which greatly reduces the costs of learning and searching in large-scale multi-state systems. The simulation results show that the proposed algorithm can significantly reduce the average end-to-end delay, the average packet loss rate and service rejection rate compared with both the minimum hop algorithm and the traditional routing algorithm which only considers static parameters.
引用
收藏
页码:282 / 286
页数:5
相关论文
共 50 条
  • [41] Application of reinforcement learning to routing in distributed wireless networks: a review
    Al-Rawi, Hasan A. A.
    Ng, Ming Ann
    Yau, Kok-Lim Alvin
    ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (03) : 381 - 416
  • [42] An Achievable Throughput Scaling Law of Wireless Device-to-Device Caching Networks With Distributed MIMO and Hierarchical Cooperations
    Guo, Jiajia
    Yuan, Jinhong
    Zhang, Jian
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (01) : 492 - 505
  • [43] Device-Aware Routing and Scheduling In Multi-Hop Device-to-Device Networks
    Xing, Yuxuan
    Seferoglu, Hulya
    2017 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2017,
  • [44] Distributed Learning for Optimal Spectrum Access in Dense Device-to-Device Ad-Hoc Networks
    Boyarski, Tomer
    Wang, Wenbo
    Leshem, Amir
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 3149 - 3163
  • [45] A Quality-of-Service Routing Protocol with Supplementary Cooperation for Wireless Ad Hoc Networks
    Su, Szu-Lin
    Tsai, Yuan-Chun
    Yang, Yuan-Hung
    WIRELESS PERSONAL COMMUNICATIONS, 2015, 84 (03) : 1627 - 1645
  • [46] A Quality-of-Service Routing Protocol with Supplementary Cooperation for Wireless Ad Hoc Networks
    Szu-Lin Su
    Yuan-Chun Tsai
    Yuan-Hung Yang
    Wireless Personal Communications, 2015, 84 : 1627 - 1645
  • [47] Cooperative Reinforcement Learning for Adaptive Power Allocation in Device-to-Device Communication
    Khan, Muhidul Islam
    Alam, Muhammad Mahtab
    Le Moullec, Yannick
    Yaacoub, Elias
    2018 IEEE 4TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2018, : 476 - 481
  • [48] Reinforcement Learning Assisted Impersonation Attack Detection in Device-to-Device Communications
    Tu, Shanshan
    Waqas, Muhammad
    Rehman, Sadaqat Ur
    Mir, Talha
    Abbas, Ghulam
    Abbas, Ziaul Haq
    Halim, Zahid
    Ahmad, Iftekhar
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (02) : 1474 - 1479
  • [49] Failure recovery in wireless content distribution networks with device-to-device cooperation
    Sharafeddine, Sanaa
    Jahed, Karim
    Farhat, Omar
    Dawy, Zaher
    COMPUTER NETWORKS, 2017, 128 : 108 - 122
  • [50] Wireless Device-to-Device Caching Networks: Basic Principles and System Performance
    Ji, Mingyue
    Caire, Giuseppe
    Molisch, Andreas F.
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2016, 34 (01) : 176 - 189