RL-based Transmission Completion Time Minimization with Energy Harvesting for Time-varying Channels

被引:0
|
作者
Kim, Heasung [1 ]
Shin, Wonjae [2 ]
Yang, Heecheol [3 ]
Lee, Jungwoo [1 ]
机构
[1] Seoul Natl Univ, Dept Elect & Comp Engn, Seoul, South Korea
[2] Pusan Natl Univ, Dept Elect Engn, Busan, South Korea
[3] Kumoh Natl Inst Technol, Sch Elect Engn, Gumi, South Korea
关键词
energy harvesting communications; transmission completion time minimization; reinforcement learning; DELAY MINIMIZATION;
D O I
10.1109/iccworkshops49005.2020.9145452
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we consider the problem of minimizing the transmission completion time in energy harvesting devices on time-varying channels with a reinforcement learning approach. Because of the randomness of energy arrival and fading channel in wireless communications, a reinforcement learning algorithm often converges to suboptimal points with a degraded performance. To solve this problem, we first prove that the expected discounted reward sum in the environment is an increasing function of negative time, amount of data sent, channel gain, harvested energy, and remaining battery. We leverage this proof to construct a partially monotonic network that efficiently approximates the optimal action-value function for learning. Experimental results show that our approach with the exploitation of the partial monotonicity of the desired function achieves better performance than existing power allocation policies. Further experiments show that the performance of our learning-based approach is close to the theoretical upper bound over rapidly time-varying channels.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] On the capacity of linear time-varying channels
    Barbarossa, Sergio
    Scaglione, Anna
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 5 : 2627 - 2630
  • [32] On the capacity of linear time-varying channels
    Barbarossa, S
    Scaglione, A
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 2627 - 2630
  • [33] Pilot-based estimation of time-varying multipath channels
    Baissas, MAR
    Sayeed, AM
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 2657 - 2660
  • [34] Energy-effficient Training for Antenna Selection in Time-varying Channels
    Kristem, Vinod
    Mehta, Neelesh B.
    Molisch, Andreas F.
    2011 CONFERENCE RECORD OF THE FORTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS (ASILOMAR), 2011, : 13 - 17
  • [35] Time-Varying FIR Equalization for MIMO Transmission over Doubly Selective Channels
    Barhumi, Imad
    Moonen, Marc
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [36] Analysis and optimization of adaptive multicopy transmission ARQ protocols for time-varying channels
    Annamalai, A
    Bhargava, VK
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1998, 46 (10) : 1356 - 1368
  • [37] Adaptive transmission for frequency-hop communication over time-varying channels
    Pursley, MB
    Wilkins, CS
    1998 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY - PROCEEDINGS, 1998, : 454 - 454
  • [38] Transmission range optimization for FH-CDMA networks in time-varying channels
    Sui, Haichang
    Zeidler, James R.
    2007 IEEE MILITARY COMMUNICATIONS CONFERENCE, VOLS 1-8, 2007, : 101 - 107
  • [39] Adaptive Signal Detection for Statistical Signal Transmission in Fast Time-Varying Channels
    Xu, Tianheng
    Zhang, Mengying
    Yao, Sha
    Hu, Honglin
    Chen, Hsiao-Hwa
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2017, 66 (12) : 11070 - 11085
  • [40] Time-Varying FIR Equalization for MIMO Transmission over Doubly Selective Channels
    Imad Barhumi
    Marc Moonen
    EURASIP Journal on Advances in Signal Processing, 2010