Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret

被引:204
|
作者
Anandkumar, Animashree [1 ]
Michael, Nithin [2 ]
Tang, Kevin [2 ]
Swami, Ananthram [3 ]
机构
[1] Univ Calif Irvine, Ctr Pervas Commun & Comp, Dept Elect Engn & Comp Sci, Irvine, CA 92697 USA
[2] Cornell Univ, Sch Elect & Comp Engn, Ithaca, NY 14853 USA
[3] USA, Res Lab, Adelphi, MD 20783 USA
关键词
Cognitive medium access control; multi-armed bandits; distributed algorithms; logarithmic regret; MULTIARMED BANDIT PROBLEM; EFFICIENT ALLOCATION RULES; MULTIPLE PLAYS; REWARDS;
D O I
10.1109/JSAC.2011.110406
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The problem of distributed learning and channel access is considered in a cognitive network with multiple secondary users. The availability statistics of the channels are initially unknown to the secondary users and are estimated using sensing decisions. There is no explicit information exchange or prior agreement among the secondary users and sensing and access decisions are undertaken by them in a completely distributed manner. We propose policies for distributed learning and access which achieve order-optimal cognitive system throughput (number of successful secondary transmissions) under self play, i.e., when implemented at all the secondary users. Equivalently, our policies minimize the sum regret in distributed learning and access, which is the loss in secondary throughput due to learning and distributed access. For the scenario when the number of secondary users is known to the policy, we prove that the total regret is logarithmic in the number of transmission slots. This policy achieves order-optimal regret based on a logarithmic lower bound for regret under any uniformly-good learning and access policy. We then consider the case when the number of secondary users is fixed but unknown, and is estimated at each user through feedback. We propose a policy whose sum regret grows only slightly faster than logarithmic in the number of transmission slots.
引用
收藏
页码:731 / 745
页数:15
相关论文
共 50 条
  • [41] Distributed Reinforcement Learning for scalable wireless medium access in IoTs and sensor networks
    Dutta, Hrishikesh
    Biswas, Subir
    COMPUTER NETWORKS, 2022, 202
  • [42] Sub-logarithmic distributed algorithms for metric facility location
    James W. Hegeman
    Sriram V. Pemmaraju
    Distributed Computing, 2015, 28 : 351 - 374
  • [43] Sub-logarithmic distributed algorithms for metric facility location
    Hegeman, James W.
    Pemmaraju, Sriram V.
    DISTRIBUTED COMPUTING, 2015, 28 (05) : 351 - 374
  • [44] Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning
    Fei, Yingjie
    Xu, Ruitu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [45] Achieving Logarithmic Regret via Hints in Online Learning of Noisy LQR Systems
    Akbari, Mohammad
    Gharesifard, Bahman
    Linder, Tamas
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 4700 - 4705
  • [46] Learning to Cache and Caching to Learn: Regret Analysis of Caching Algorithms
    Bura, Archana
    Rengarajan, Desik
    Kalathil, Dileep
    Shakkottai, Srinivas
    Chamberland, Jean-Francois
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2022, 30 (01) : 18 - 31
  • [47] Regret Minimization for Primary/Secondary Access to Satellite Resources With Cognitive Interference
    Sagduyu, Yalin E.
    Shi, Yi
    MacKenzie, Allen B.
    Hou, Y. Thomas
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2018, 17 (05) : 3512 - 3523
  • [48] Distributed Lifelong Reinforcement Learning with Sub-Linear Regret
    Tutunov, Rasul
    El-Zini, Julia
    Bou-Ammar, Haitham
    Jadbabaie, Ali
    2017 IEEE 56TH ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2017,
  • [49] Learning How to Configure LoRa Networks With No Regret: A Distributed Approach
    Toro-Betancur, Veronica
    Premsankar, Gopika
    Liu, Chen-Feng
    Slabicki, Mariusz
    Bennis, Mehdi
    Di Francesco, Mario
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (04) : 5633 - 5644
  • [50] Performance evaluation of algorithms for wireless medium access
    Bharghavan, V
    IEEE INTERNATIONAL COMPUTER PERFORMANCE AND DEPENDABILITY SYMPOSIUM -PROCEEDINGS, 1998, : 86 - 95