Information Sharing in Distributed Stochastic Bandits

被引:0
|
作者
Buccapatnam, Swapna [1 ,2 ]
Tan, Jian [2 ]
Zhang, Li [2 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
MULTIARMED BANDIT;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Information sharing is an important issue for stochastic bandit problems in a distributed setting. Consider N players dealing with the same multi-armed bandit problem. All players receive requests simultaneously and must choose one of M actions for each request. Sharing information among these N players can decrease the regret for each of them but also incurs cooperation and communication overhead. In this setting, we study how cooperation and communication can impact the system performance measured by regret and communication cost. For both scenarios, we establish a uniform lower bound to the regret for the entire system as a function of time and network size. Concerning cooperation, we study the problem from a game-theoretic perspective. When each player's actions and payoffs are immediately visible to all others, we identify strategies for all players under which co-operative exploration is ensured. Regarding the communication cost, we consider incomplete information sharing such that a player's payoffs and actions are not entirely available to others. The players communicate observations to each other to reduce their regret, however with a cost. We show that a logarithmic communication cost is necessary to achieve the optimal regret. For Bernoulli arrivals, we specify a policy that achieves the optimal regret with a logarithmic communication cost. Our work opens a novel direction towards understanding information sharing for active learning in a distributed environment.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Distributed Stochastic Bandits with Hidden Contexts
    Lin, Jiabin
    Moothedath, Shana
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [2] Feature Selection in Distributed Stochastic Linear Bandits
    Lin, Jiabin
    Moothedath, Shana
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3939 - 3944
  • [3] Information Directed Sampling for Stochastic Bandits with Graph Feedback
    Liu, Fang
    Buccapatnam, Swapna
    Shroff, Ness
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3643 - 3650
  • [4] BelMan: An Information-Geometric Approach to Stochastic Bandits
    Basu, Debabrota
    Senellart, Pierre
    Bressan, Stephane
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 167 - 183
  • [5] Information cooperation model for distributed information sharing
    Yang, X.-C.
    Wang, D.
    Wang, G.-R.
    Yu, G.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2001, 22 (03): : 257 - 260
  • [6] Learning from Distributed Users in Contextual Linear Bandits Without Sharing the Context
    Hanna, Osama A.
    Yang, Lin F.
    Fragouli, Christina
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [7] On the stochastic scalability of information sharing platforms
    Phuoc, Tran-Gia
    Binzenhfoefer, Andreas
    DISTRIBUTED COOPERATIVE LABORATORIES: NETWORKING, INSTRUMENTATION, AND MEASUREMENTS, 2006, : 11 - +
  • [8] Distributed Information Sharing in Mobile Environments
    Gorman, Joe
    Johansen, Ulrik
    INTELLIGENT DISTRIBUTED COMPUTING III, 2009, 237 : 221 - 226
  • [9] Resource Adaptive Distributed Information Sharing
    Hansen, Hans Vatne
    Goebel, Vera
    Plagemann, Thomas
    Siekkinen, Matti
    NETWORKED SERVICES AND APPLICATIONS - ENGINEERING, CONTROL AND MANAGEMENT, 2010, 6164 : 246 - +
  • [10] Stochastic Rising Bandits
    Metelli, Alberto Maria
    Trovo, Francesco
    Pirola, Matteo
    Restelli, Marcello
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,