Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

被引：0

作者：

Liu, Dongyu ^{[1
]}

Li, Zexu ^{[1
]}

Hu, Zeyu ^{[1
]}

Li, Yong ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Wireless Signal Proc & Network Lab, Beijing 100876, Peoples R China

来源：

2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS) | 2018年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we aim to determine the multi-hop route between a device-to-device (D2D) source-destination pair which meets the quality-of-service (QoS) of services. We model this QoS routing problem in D2D as a Markov decision process (MDP) and proposes a distributed multi-agent reinforcement learning routing algorithm. We consider the QoS requirements in terms of bandwidth, delay, and packet loss rate, and allocate the routing path according to link information averaged over time in dynamic network environments. By decomposing the Q-function into multiple local Q-functions, each agent can compute its own optimal strategy based on local observations, which greatly reduces the costs of learning and searching in large-scale multi-state systems. The simulation results show that the proposed algorithm can significantly reduce the average end-to-end delay, the average packet loss rate and service rejection rate compared with both the minimum hop algorithm and the traditional routing algorithm which only considers static parameters.

引用

页码：282 / 286

页数：5

共 50 条

[41] Application of reinforcement learning to routing in distributed wireless networks: a review
Al-Rawi, Hasan A. A.
Ng, Ming Ann
Yau, Kok-Lim Alvin
ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (03) : 381 - 416
[42] An Achievable Throughput Scaling Law of Wireless Device-to-Device Caching Networks With Distributed MIMO and Hierarchical Cooperations
Guo, Jiajia
Yuan, Jinhong
Zhang, Jian
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (01) : 492 - 505
[43] Device-Aware Routing and Scheduling In Multi-Hop Device-to-Device Networks
Xing, Yuxuan
Seferoglu, Hulya
2017 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2017,
[44] Distributed Learning for Optimal Spectrum Access in Dense Device-to-Device Ad-Hoc Networks
Boyarski, Tomer
Wang, Wenbo
Leshem, Amir
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 3149 - 3163
[45] A Quality-of-Service Routing Protocol with Supplementary Cooperation for Wireless Ad Hoc Networks
Su, Szu-Lin
Tsai, Yuan-Chun
Yang, Yuan-Hung
WIRELESS PERSONAL COMMUNICATIONS, 2015, 84 (03) : 1627 - 1645
[46] A Quality-of-Service Routing Protocol with Supplementary Cooperation for Wireless Ad Hoc Networks
Szu-Lin Su
Yuan-Chun Tsai
Yuan-Hung Yang
Wireless Personal Communications, 2015, 84 : 1627 - 1645
[47] Cooperative Reinforcement Learning for Adaptive Power Allocation in Device-to-Device Communication
Khan, Muhidul Islam
Alam, Muhammad Mahtab
Le Moullec, Yannick
Yaacoub, Elias
2018 IEEE 4TH WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2018, : 476 - 481
[48] Reinforcement Learning Assisted Impersonation Attack Detection in Device-to-Device Communications
Tu, Shanshan
Waqas, Muhammad
Rehman, Sadaqat Ur
Mir, Talha
Abbas, Ghulam
Abbas, Ziaul Haq
Halim, Zahid
Ahmad, Iftekhar
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (02) : 1474 - 1479
[49] Failure recovery in wireless content distribution networks with device-to-device cooperation
Sharafeddine, Sanaa
Jahed, Karim
Farhat, Omar
Dawy, Zaher
COMPUTER NETWORKS, 2017, 128 : 108 - 122
[50] Wireless Device-to-Device Caching Networks: Basic Principles and System Performance
Ji, Mingyue
Caire, Giuseppe
Molisch, Andreas F.
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2016, 34 (01) : 176 - 189

← 1 2 3 4 5 →