Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret

被引:204
|
作者
Anandkumar, Animashree [1 ]
Michael, Nithin [2 ]
Tang, Kevin [2 ]
Swami, Ananthram [3 ]
机构
[1] Univ Calif Irvine, Ctr Pervas Commun & Comp, Dept Elect Engn & Comp Sci, Irvine, CA 92697 USA
[2] Cornell Univ, Sch Elect & Comp Engn, Ithaca, NY 14853 USA
[3] USA, Res Lab, Adelphi, MD 20783 USA
关键词
Cognitive medium access control; multi-armed bandits; distributed algorithms; logarithmic regret; MULTIARMED BANDIT PROBLEM; EFFICIENT ALLOCATION RULES; MULTIPLE PLAYS; REWARDS;
D O I
10.1109/JSAC.2011.110406
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The problem of distributed learning and channel access is considered in a cognitive network with multiple secondary users. The availability statistics of the channels are initially unknown to the secondary users and are estimated using sensing decisions. There is no explicit information exchange or prior agreement among the secondary users and sensing and access decisions are undertaken by them in a completely distributed manner. We propose policies for distributed learning and access which achieve order-optimal cognitive system throughput (number of successful secondary transmissions) under self play, i.e., when implemented at all the secondary users. Equivalently, our policies minimize the sum regret in distributed learning and access, which is the loss in secondary throughput due to learning and distributed access. For the scenario when the number of secondary users is known to the policy, we prove that the total regret is logarithmic in the number of transmission slots. This policy achieves order-optimal regret based on a logarithmic lower bound for regret under any uniformly-good learning and access policy. We then consider the case when the number of secondary users is fixed but unknown, and is estimated at each user through feedback. We propose a policy whose sum regret grows only slightly faster than logarithmic in the number of transmission slots.
引用
收藏
页码:731 / 745
页数:15
相关论文
共 50 条
  • [21] Algorithms for Dynamic Spectrum Access With Learning for Cognitive Radio
    Unnikrishnan, Jayakrishnan
    Veeravalli, Venugopal V.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (02) : 750 - 760
  • [22] Logarithmic Regret for Reinforcement Learning with Linear Function Approximation
    He, Jiafan
    Zhou, Dongruo
    Gu, Quanquan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [23] Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently
    Cassel, Asaf
    Cohen, Alon
    Koren, Tomer
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [24] Distributed Algorithms for Opportunistic Spectrum Access in Decentralized Cognitive Radio Network
    Kumar, Rohit
    2018 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2018, : 516 - 518
  • [25] Channel Sensing Order for Distributed Cognitive Radio Networks Using No-Regret Learning
    Li, Li-Wang
    Ge, Jin-Cheng
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATION AND SENSOR NETWORKS (WCSN 2016), 2016, 44 : 674 - 679
  • [26] Distributed Learning Algorithms for Coordination in a Cognitive Network in Presence of Jammers
    Sawant, Suneet
    Hanawal, Manjesh K.
    Darak, Sumit
    Kumar, Rohit
    2018 16TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2018,
  • [27] No-Regret Learning with Unbounded Losses: The Case of Logarithmic Pooling
    Neyman, Eric
    Roughgarden, Tim
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [28] Learning Algorithms for Minimizing Queue Length Regret
    Stahlbuhk, Thomas
    Shrader, Brooke
    Modiano, Eytan
    2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 1001 - 1005
  • [29] Learning Algorithms for Minimizing Queue Length Regret
    Stahlbuhk, Thomas
    Shrader, Brooke
    Modiano, Eytan
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (03) : 1759 - 1781
  • [30] Logarithmic Regret for Distributed Online Subgradient Method over Unbalanced Directed Networks
    Yamashita, Makoto
    Hayashi, Naoki
    Hatanaka, Takeshi
    Takai, Shigemasa
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2021, E104A (08) : 1019 - 1026