Distributed Reinforcement Learning for Quality-of-Service Routing in Wireless Device-to-device Networks

被引:0
|
作者
Liu, Dongyu [1 ]
Li, Zexu [1 ]
Hu, Zeyu [1 ]
Li, Yong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Minist Educ, Key Lab Universal Wireless Commun, Wireless Signal Proc & Network Lab, Beijing 100876, Peoples R China
来源
2018 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC WORKSHOPS) | 2018年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we aim to determine the multi-hop route between a device-to-device (D2D) source-destination pair which meets the quality-of-service (QoS) of services. We model this QoS routing problem in D2D as a Markov decision process (MDP) and proposes a distributed multi-agent reinforcement learning routing algorithm. We consider the QoS requirements in terms of bandwidth, delay, and packet loss rate, and allocate the routing path according to link information averaged over time in dynamic network environments. By decomposing the Q-function into multiple local Q-functions, each agent can compute its own optimal strategy based on local observations, which greatly reduces the costs of learning and searching in large-scale multi-state systems. The simulation results show that the proposed algorithm can significantly reduce the average end-to-end delay, the average packet loss rate and service rejection rate compared with both the minimum hop algorithm and the traditional routing algorithm which only considers static parameters.
引用
收藏
页码:282 / 286
页数:5
相关论文
共 50 条
  • [21] On the design of a quality-of-service driven routing protocol for wireless cooperative networks
    Sheng, Zhengguo
    Ding, Zhiguo
    Leung, Kin K.
    2008 IEEE 67TH VEHICULAR TECHNOLOGY CONFERENCE-SPRING, VOLS 1-7, 2008, : 624 - 628
  • [22] On Power and Quality of Service Tradeoff in Device-to-Device Communication
    Sun, Fengyou
    Jiang, Yuming
    2015 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION WORKSHOP (ICCW), 2015, : 614 - 619
  • [23] Bayesian Reinforcement Learning-Based Coalition Formation for Distributed Resource Sharing by Device-to-Device Users in Heterogeneous Cellular Networks
    Asheralieva, Alia
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2017, 16 (08) : 5016 - 5032
  • [24] Distributed resource optimisation using the Q-learning algorithm, in device-to-device communication: A reinforcement learning paradigm
    Jayakumar, Steffi
    Nandakumar, S.
    RESULTS IN ENGINEERING, 2024, 23
  • [25] On Some Sufficient Conditions for Distributed Quality-of-Service Support in Wireless Networks
    Ganesan, Ashwin
    2009 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & COMMUNICATIONS (NETCOM 2009), 2009, : 301 - 306
  • [26] Performance of sufficient conditions for distributed quality-of-service support in wireless networks
    Ashwin Ganesan
    Wireless Networks, 2014, 20 : 1321 - 1334
  • [27] On some sufficient conditions for distributed quality-of-service support in wireless networks
    Department of Information Technology, K. J. Somaiya College of Engineering, Vidyavihar, Mumbai-77, India
    Int. Conf. Netw. Commun., NetCoM, 1600, (301-306):
  • [28] Social Network Aware Device-to-Device Communication in Wireless Networks
    Zhang, Yanru
    Pan, Erte
    Song, Lingyang
    Saad, Walid
    Dawy, Zaher
    Han, Zhu
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2015, 14 (01) : 177 - 190
  • [29] Distributed Resource Allocation in Device-to-Device Enhanced Cellular Networks
    Ye, Qiaoyang
    Al-Shalash, Mazin
    Caramanis, Constantine
    Andrews, Jeffrey G.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2015, 63 (02) : 441 - 454
  • [30] Evaluation of Building Wireless Performance for Indoor Device-to-Device Networks
    Lin, Jixuan
    Hu, Haonan
    Huang, Yixin
    Zhang, Jiliang
    Zhang, Jie
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2024, 8 (01): : 150 - 161