Learning Algorithms for Minimizing Queue Length Regret

被引:15
|
作者
Stahlbuhk, Thomas [1 ]
Shrader, Brooke [1 ]
Modiano, Eytan [2 ]
机构
[1] MIT, Lincoln Lab, Lexington, MA 02421 USA
[2] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Statistical learning; bandit algorithms; queueing theory; network control; MULTIARMED BANDIT; ALLOCATION; THROUGHPUT; POLICIES;
D O I
10.1109/TIT.2021.3054854
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a system consisting of a single transmitter/receiver pair and N channels over which they may communicate. Packets randomly arrive to the transmitter's queue and wait to be successfully sent to the receiver. The transmitter may attempt a frame transmission on one channel at a time, where each frame includes a packet if one is in the queue. For each channel, an attempted transmission is successful with an unknown probability. The transmitter's objective is to quickly identify the best channel to minimize the number of packets in the queue over T time slots. To analyze system performance, we introduce queue length regret, which is the expected difference between the total queue length of a learning policy and a controller that knows the rates, a priori. One approach to designing a transmission policy would be to apply algorithms from the literature that solve the closely-related stochastic multi-armed bandit problem. These policies would focus on maximizing the number of successful frame transmissions over time. However, we show that these methods have Omega(log T} queue length regret. On the other hand, we show that there exists a set of queue-length based policies that can obtain order optimal O(1) queue length regret. We use our theoretical analysis to devise heuristic methods that are shown to perform well in simulation.
引用
收藏
页码:1759 / 1781
页数:23
相关论文
共 50 条
  • [21] Minimizing Average Regret Ratio in Database
    Zeighami, Sepanta
    Wong, Raymond Chi-Wing
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2265 - 2266
  • [22] Minimizing Regret in Dynamic Decision Problems
    Halpern, Joseph Y.
    Leung, Samantha
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2015, 2015, 9161 : 3 - 13
  • [23] Regret-minimizing Bayesian persuasion
    Babichenko, Yakov
    Talgam-Cohen, Inbal
    Xu, Haifeng
    Zabarnyi, Konstantin
    GAMES AND ECONOMIC BEHAVIOR, 2022, 136 : 226 - 248
  • [24] Minimizing regret with label efficient prediction
    Cesa-Bianchi, N
    Lugosi, G
    Stoltz, G
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (06) : 2152 - 2162
  • [25] On the Guarantees of Minimizing Regret in Receding Horizon
    Martin, Andrea
    Furieri, Luca
    Dorfler, Florian
    Lygeros, John
    Ferrari-Trecate, Giancarlo
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (03) : 1547 - 1562
  • [26] Regret-Minimizing Project Choice
    Guo, Yingni
    Shmaya, Eran
    ECONOMETRICA, 2023, 91 (05) : 1567 - 1593
  • [27] Minimizing regret in dynamic decision problems
    Joseph Y. Halpern
    Samantha Leung
    Theory and Decision, 2016, 81 : 123 - 151
  • [28] Regret-Minimizing Representative Databases
    Nanongkai, Danupon
    Das Sarma, Atish
    Lall, Ashwin
    Lipton, Richard J.
    Xu, Jun
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 1114 - 1124
  • [29] Minimizing regret in dynamic decision problems
    Halpern, Joseph Y.
    Leung, Samantha
    THEORY AND DECISION, 2016, 81 (01) : 123 - 151
  • [30] Minimizing regret with label efficient prediction
    Cesa-Bianchi, N
    Lugosi, G
    Stoltz, G
    LEARNING THEORY, PROCEEDINGS, 2004, 3120 : 77 - 92