Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

被引：0

作者：

Liu, Dongyu ^{[1
]}

Li, Zexu ^{[1
]}

Hu, Zeyu ^{[1
]}

Li, Yong ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Wireless Signal Proc & Network Lab, Beijing 100876, Peoples R China

来源：

2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS) | 2018年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we aim to determine the multi-hop route between a device-to-device (D2D) source-destination pair which meets the quality-of-service (QoS) of services. We model this QoS routing problem in D2D as a Markov decision process (MDP) and proposes a distributed multi-agent reinforcement learning routing algorithm. We consider the QoS requirements in terms of bandwidth, delay, and packet loss rate, and allocate the routing path according to link information averaged over time in dynamic network environments. By decomposing the Q-function into multiple local Q-functions, each agent can compute its own optimal strategy based on local observations, which greatly reduces the costs of learning and searching in large-scale multi-state systems. The simulation results show that the proposed algorithm can significantly reduce the average end-to-end delay, the average packet loss rate and service rejection rate compared with both the minimum hop algorithm and the traditional routing algorithm which only considers static parameters.

引用

页码：282 / 286

页数：5

共 50 条

[21] On the design of a quality-of-service driven routing protocol for wireless cooperative networks
Sheng, Zhengguo
Ding, Zhiguo
Leung, Kin K.
2008 IEEE 67TH VEHICULAR TECHNOLOGY CONFERENCE-SPRING, VOLS 1-7, 2008, : 624 - 628
[22] On Power and Quality of Service Tradeoff in Device-to-Device Communication
Sun, Fengyou
Jiang, Yuming
2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION WORKSHOP (ICCW), 2015, : 614 - 619
[23] Bayesian Reinforcement Learning-Based Coalition Formation for Distributed Resource Sharing by Device-to-Device Users in Heterogeneous Cellular Networks
Asheralieva, Alia
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (08) : 5016 - 5032
[24] Distributed resource optimisation using the Q-learning algorithm, in device-to-device communication: A reinforcement learning paradigm
Jayakumar, Steffi
Nandakumar, S.
RESULTS IN ENGINEERING, 2024, 23
[25] On Some Sufficient Conditions for Distributed Quality-of-Service Support in Wireless Networks
Ganesan, Ashwin
2009 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & COMMUNICATIONS (NETCOM 2009), 2009, : 301 - 306
[26] Performance of sufficient conditions for distributed quality-of-service support in wireless networks
Ashwin Ganesan
Wireless Networks, 2014, 20 : 1321 - 1334
[27] On some sufficient conditions for distributed quality-of-service support in wireless networks
Department of Information Technology, K. J. Somaiya College of Engineering, Vidyavihar, Mumbai-77, India
Int. Conf. Netw. Commun., NetCoM, 1600, (301-306):
[28] Social Network Aware Device-to-Device Communication in Wireless Networks
Zhang, Yanru
Pan, Erte
Song, Lingyang
Saad, Walid
Dawy, Zaher
Han, Zhu
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (01) : 177 - 190
[29] Distributed Resource Allocation in Device-to-Device Enhanced Cellular Networks
Ye, Qiaoyang
Al-Shalash, Mazin
Caramanis, Constantine
Andrews, Jeffrey G.
IEEE TRANSACTIONS ON COMMUNICATIONS, 2015, 63 (02) : 441 - 454
[30] Evaluation of Building Wireless Performance for Indoor Device-to-Device Networks
Lin, Jixuan
Hu, Haonan
Huang, Yixin
Zhang, Jiliang
Zhang, Jie
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2024, 8 (01): : 150 - 161

← 1 2 3 4 5 →