A Reinforcement Learning Based Low-Delay Scheduling With Adaptive Transmission

被引:0
|
作者
Zhao, Yu [1 ]
Lee, Joohyun [1 ]
机构
[1] Hanyang Univ, Div Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Reinforcement Learning; delay-power tradeoff; adaptive transmission; infinite-horizon Markov Decision Process;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As modern communication systems become indispensable, the requirements for communication systems such as delay and power get more stringent. In this paper, we adopt a Reinforcement Learning (RL) based approach to obtain the optimal trade-off between delay and power consumption for a given power constraint in a communication system whose conditions (e.g., channel conditions, traffic arrival rates) can change over time. To this end, we first formulate this problem as an infinite-horizon Markov Decision Process (MDP) and then Q-learning is adopted to solve this problem. To handle the given power constraint, we apply the Lagrange multiplier method that transforms a constrained optimization problem into a non-constrained problem. Finally, via simulation, we show that Q-learning achieves the optimal policy.
引用
收藏
页码:916 / 919
页数:4
相关论文
共 50 条
  • [21] Reinforcement Learning for the Adaptive Scheduling of Educational Activities
    Bassen, Jonathan
    Balaji, Bharathan
    Schaarschmidt, Michael
    Thille, Candace
    Painter, Jay
    Zimmaro, Dawn
    Gamest, Alex
    Fast, Ethan
    Mitchell, John C.
    PROCEEDINGS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'20), 2020,
  • [22] BACKWARD ADAPTIVE LATTICE AND TRANSVERSAL PREDICTORS IN LOW-DELAY CELP CODERS
    AGUIAR, SRMF
    ALCAIM, A
    SIGNAL PROCESSING, 1994, 39 (1-2) : 201 - 214
  • [23] Efficient and Low-delay Task Scheduling for Big Data Clusters in A Theoretical Perspective
    Gao, Yuanxiang
    Yu, Hongfang
    Luo, Shouxi
    Yu, Shui
    2016 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2016,
  • [24] Reinforcement Learning Based VNF Scheduling with End-to-End Delay Guarantee
    Li, Junling
    Shi, Weisen
    Zhang, Ning
    Shen, Xuemin Sherman
    2019 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2019,
  • [25] Epidemic routing based on adaptive compression of vectors: efficient low-delay routing for opportunistic networks based on adaptive compression of vectors
    Ren, Zhi
    Peng, Shuang
    Chen, Hong
    Fang, Jun
    Chen, Qianbin
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2015, 28 (03) : 560 - 573
  • [26] LOW-DELAY SPEECH CODING
    CUPERMAN, V
    GERSHO, A
    SPEECH COMMUNICATION, 1993, 12 (02) : 193 - 204
  • [27] A Low-Delay MDCT/IMDCT
    Lee, Sangkil
    Lee, Insung
    ETRI JOURNAL, 2013, 35 (05) : 935 - 938
  • [28] Low-Delay Ultra-Small Packet Transmission With In-Network Aggregation via Distributed Stochastic Learning
    Zhang, Nannan
    Wang, Wei
    Xin, Xiaofeng
    Liu, Yuanwei
    Shan, Hangguan
    Huang, Aiping
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (05) : 2655 - 2669
  • [29] Low-delay and erior-robust wireless video transmission for video communications
    Wang, TC
    Fang, HC
    Chen, LG
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2002, 12 (12) : 1049 - 1058
  • [30] Deep Reinforcement Learning based Adaptive Transmission Control in Vehicular Networks
    Liu, Mingyuan
    Quan, Wei
    Yu, Chengxiao
    Zhang, Xue
    Gao, Deyun
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,