Information Sharing in Distributed Stochastic Bandits

被引:0
|
作者
Buccapatnam, Swapna [1 ,2 ]
Tan, Jian [2 ]
Zhang, Li [2 ]
机构
[1] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
MULTIARMED BANDIT;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Information sharing is an important issue for stochastic bandit problems in a distributed setting. Consider N players dealing with the same multi-armed bandit problem. All players receive requests simultaneously and must choose one of M actions for each request. Sharing information among these N players can decrease the regret for each of them but also incurs cooperation and communication overhead. In this setting, we study how cooperation and communication can impact the system performance measured by regret and communication cost. For both scenarios, we establish a uniform lower bound to the regret for the entire system as a function of time and network size. Concerning cooperation, we study the problem from a game-theoretic perspective. When each player's actions and payoffs are immediately visible to all others, we identify strategies for all players under which co-operative exploration is ensured. Regarding the communication cost, we consider incomplete information sharing such that a player's payoffs and actions are not entirely available to others. The players communicate observations to each other to reduce their regret, however with a cost. We show that a logarithmic communication cost is necessary to achieve the optimal regret. For Bernoulli arrivals, we specify a policy that achieves the optimal regret with a logarithmic communication cost. Our work opens a novel direction towards understanding information sharing for active learning in a distributed environment.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Distributed Stochastic Optimization under Imperfect Information
    Kannan, Aswin
    Nedic, Angelia
    Shanbhag, Uday V.
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 400 - 405
  • [42] Stochastic clustering for organizing distributed information sources
    Shyu, ML
    Chen, SC
    Rubin, SH
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (05): : 2035 - 2047
  • [43] An Analysis of the Value of Information When Exploring Stochastic, Discrete Multi-Armed Bandits
    Sledge, Isaac J.
    Principe, Jose C.
    ENTROPY, 2018, 20 (03)
  • [44] Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis
    Jose, Sharu Theresa
    Moothedath, Shana
    ENTROPY, 2024, 26 (07)
  • [45] A healthcare information sharing scheme in distributed cloud networks
    Sunyoung Kang
    TaeYeon Kim
    Hongseok Jeon
    Wonhyuk Lee
    Seungae Kang
    Cluster Computing, 2015, 18 : 1405 - 1410
  • [46] A distributed feature selection scheme with partial information sharing
    Aida Brankovic
    Luigi Piroddi
    Machine Learning, 2019, 108 : 2009 - 2034
  • [47] InfoSleuth - An emerging technology for sharing distributed environmental information
    Pitts, G
    Fowler, J
    INFORMATION SYSTEMS AND THE ENVIRONMENT, 2001, : 159 - 172
  • [48] Online Distributed Interdependency Estimation with Partial Information Sharing
    Gasparri, Andrea
    Iovino, Francesco
    Oliva, Gabriele
    Panzieri, Stefano
    2010 COMPLEXITY IN ENGINEERING: COMPENG 2010, PROCEEDINGS, 2010, : 82 - 84
  • [49] Distributed Content Backup and Sharing Using Social Information
    Jiang, Jin
    Casetti, Claudio E.
    NETWORKING 2012, PT I, 2012, 7289 : 68 - 81
  • [50] Incidents Information Sharing Platform for Distributed Attack Detection
    Fotiadou, Konstantina
    Velivassaki, Terpsichori-Helen
    Voulkidis, Artemis
    Railis, Konstantinos
    Trakadas, Panagiotis
    Zahariadis, Theodore
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2020, 1 : 593 - 605