Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret

被引:204
|
作者
Anandkumar, Animashree [1 ]
Michael, Nithin [2 ]
Tang, Kevin [2 ]
Swami, Ananthram [3 ]
机构
[1] Univ Calif Irvine, Ctr Pervas Commun & Comp, Dept Elect Engn & Comp Sci, Irvine, CA 92697 USA
[2] Cornell Univ, Sch Elect & Comp Engn, Ithaca, NY 14853 USA
[3] USA, Res Lab, Adelphi, MD 20783 USA
关键词
Cognitive medium access control; multi-armed bandits; distributed algorithms; logarithmic regret; MULTIARMED BANDIT PROBLEM; EFFICIENT ALLOCATION RULES; MULTIPLE PLAYS; REWARDS;
D O I
10.1109/JSAC.2011.110406
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The problem of distributed learning and channel access is considered in a cognitive network with multiple secondary users. The availability statistics of the channels are initially unknown to the secondary users and are estimated using sensing decisions. There is no explicit information exchange or prior agreement among the secondary users and sensing and access decisions are undertaken by them in a completely distributed manner. We propose policies for distributed learning and access which achieve order-optimal cognitive system throughput (number of successful secondary transmissions) under self play, i.e., when implemented at all the secondary users. Equivalently, our policies minimize the sum regret in distributed learning and access, which is the loss in secondary throughput due to learning and distributed access. For the scenario when the number of secondary users is known to the policy, we prove that the total regret is logarithmic in the number of transmission slots. This policy achieves order-optimal regret based on a logarithmic lower bound for regret under any uniformly-good learning and access policy. We then consider the case when the number of secondary users is fixed but unknown, and is estimated at each user through feedback. We propose a policy whose sum regret grows only slightly faster than logarithmic in the number of transmission slots.
引用
收藏
页码:731 / 745
页数:15
相关论文
共 50 条
  • [31] Distributed Learning Algorithms for Spectrum Sharing in Spatial Random Access Networks
    Cohen, Kobi
    Nedic, Angelia
    Srikant, R.
    2015 13TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), 2015, : 513 - 520
  • [32] Distributive opportunistic spectrum access for cognitive radio using correlated equilibrium and no-regret learning
    Han, Zhu
    Pandana, Charles
    Liu, K. J. Ray
    2007 IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-9, 2007, : 11 - +
  • [33] A Distributed Medium Access Control Protocol for Cognitive Radio Ad Hoc Networks
    Joshi, Gyanendra Prasad
    Kim, Sung Won
    Kim, Changsu
    Nam, Seung Yeob
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2015, 9 (01): : 331 - 343
  • [34] SAdaBoundNc: an adaptive subgradient online learning algorithm with logarithmic regret bounds
    Lin Wang
    Xin Wang
    Tao Li
    Ruijuan Zheng
    Junlong Zhu
    Mingchuan Zhang
    Neural Computing and Applications, 2023, 35 : 8051 - 8063
  • [35] SAdaBoundNc: an adaptive subgradient online learning algorithm with logarithmic regret bounds
    Wang, Lin
    Wang, Xin
    Li, Tao
    Zheng, Ruijuan
    Zhu, Junlong
    Zhang, Mingchuan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (11): : 8051 - 8063
  • [36] Cognitive Access Algorithms For Multiple Access Channels
    Hu, Yichuan
    Ribeiro, Alejandro
    2013 IEEE 14TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC), 2013, : 120 - 124
  • [37] Distributed Learning Algorithms for Spectrum Sharing in Spatial Random Access Wireless Networks
    Cohen, Kobi
    Nedic, Angelia
    Srikant, R.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2854 - 2869
  • [38] AutoCFR: Learning to Design Counterfactual Regret Minimization Algorithms
    Xu, Hang
    Li, Kai
    Fu, Haobo
    Fu, Qiang
    Xing, Junliang
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 5244 - 5251
  • [39] Adaptive Sensing Period Based Distributed Medium Access Control for Cognitive Radio Networks
    Kim, Su Min
    Kim, Junsu
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2014, E97B (11) : 2502 - 2511
  • [40] Distributed Online Learning for Joint Regret with Communication Constraints
    van der Hoeven, Dirk
    Hadiji, Hedi
    van Erven, Tim
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167