On optimality of monotone channel-aware transmission policies: A constrained Markov Decision Process approach

被引:0
|
作者
Ngo, Minh Hanh [1 ]
Krishnamurthy, Vikram [1 ]
机构
[1] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
关键词
Markov processes; stochastic optimal control; resource management;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A constrained Markov Decision Process (MDP) approach is deployed to prove the monotone structure of optimal channel-aware transmission policies for packet transmission over a correlated fading wireless channel subject to an average delay constraint. A transmission policy is a function mapping channel state information (CSI), buffer states and numbers of arriving packets to transmit probabilities. The objective is to minimize the average transmission energy cost subject to an average delay constraint. We use the Lagrange multiplier method to convert the constrained MDP to an unconstrained MDP and prove that the unconstrained optimal policy is threshold in the buffer state. It then follows that the constrained optimal transmission policy is a randomized mixture of two pure transmission policies that are threshold in the buffer occupancy.
引用
收藏
页码:621 / +
页数:2
相关论文
共 50 条
  • [1] MIMO transmission control in fading channels - A constrained Markov decision process formulation with monotone randomized policies
    Djonin, Dejan V.
    Krishnarnurthy, Vikram
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (10) : 5069 - 5083
  • [2] On the Optimality of Threshold Scheduling Policies for Video Transmission in Markovian Fading Wireless Channels with Channel-Aware ARQ
    Ngo, Minh Hanh
    Krishnamurthy, Vikram
    GLOBECOM 2006 - 2006 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2006,
  • [3] Q-learning algorithms for constrained Markov decision processes with randomized monotone policies:: Application to MIMO transmission control
    Djonin, Dejan V.
    Krishnamurthy, Vikram
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (05) : 2170 - 2181
  • [4] Channel-Aware Line Code Decision in RFID
    Park, Jongho
    Lee, Tae-Jin
    IEEE COMMUNICATIONS LETTERS, 2011, 15 (12) : 1402 - 1404
  • [5] Massive MIMO Channel-Aware Decision Fusion
    Ciuonzo, Domenico
    Rossi, Pierluigi Salvo
    Dey, Subhrakanti
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2015, 63 (03) : 604 - 619
  • [6] Optimality of monotone policies for transmission control with switching costs
    Farrokh, Arsalan
    Krishnamurthy, Vikrarn
    PROCEEDINGS OF THE 46TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-14, 2007, : 6077 - 6082
  • [7] MONOTONE OPTIMAL POLICIES FOR MARKOV DECISION-PROCESSES
    SERFOZO, RF
    MATHEMATICAL PROGRAMMING STUDY, 1976, 6 (DEC): : 202 - 215
  • [8] A risk-aware maintenance model based on a constrained Markov decision process
    Xu, Jianyu
    Zhao, Xiujie
    Liu, Bin
    IISE TRANSACTIONS, 2022, 54 (11) : 1072 - 1083
  • [9] Robustness of policies in constrained Markov decision processes
    Zadorojniy, A
    Shwartz, A
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2006, 51 (04) : 635 - 638
  • [10] Computing monotone policies for Markov decision processes: a nearly-isotonic penalty approach
    Mattila, Robert
    Rojas, Cristian R.
    Krishnamurthy, Vikram
    Wahlberg, Bo
    IFAC PAPERSONLINE, 2017, 50 (01): : 8429 - 8434