Best-of-Both-Worlds Algorithms for Partial Monitoring

被引:0
|
作者
Tsuchiya, Taira [1 ,2 ]
Ito, Shinji [3 ]
Honda, Junya [1 ,2 ]
机构
[1] Kyoto Univ, Kyoto, Japan
[2] RIKEN AIP, Tokyo, Japan
[3] NEC Corp Ltd, Tokyo, Japan
关键词
partial monitoring; best-of-both-worlds; follow-the-regularized-leader; stochastic regime with adversarial corruptions; REGRET;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study considers the partial monitoring problem with k-actions and d-outcomes and provides the first best-of-both-worlds algorithms, whose regrets are favorably bounded both in the stochastic and adversarial regimes. In particular, we show that for non-degenerate locally observable games, the regret is O(m(2)k(4) log(T) log(k.T)/.min) in the stochastic regime and O(mk(3/2) root T log(T) log k Pi) in the adversarial regime, where T is the number of rounds, m is the maximum number of distinct observations per action,.min is the minimum suboptimality gap, and k. is the number of Pareto optimal actions. Moreover, we show that for globally observable games, the regret is O(m(2)k(4) log(T) log(k(Pi)T)/Delta(min)) in the stochastic regime and O(mk(3/2)root Tlog(T) log(k(Pi))) in the adversarial regime, where cG is a game-dependent constant. We also provide regret bounds for a stochastic regime with adversarial corruptions. Our algorithms are based on the follow-theregularized-leader framework and are inspired by the approach of exploration by optimization and the adaptive learning rate in the field of online learning with feedback graphs.
引用
收藏
页码:1484 / 1515
页数:32
相关论文
共 50 条
  • [21] The best of both worlds
    Valero, Greg
    Adhesives Age, 2003, 46 (05): : 11 - 12
  • [22] Best of both worlds
    Frisk, Richard
    EP Electronic Production (London), 2000, 29 (07):
  • [23] The best of both worlds
    Pitcher, Graham
    New Electronics, 2002, 35 (06): : 33 - 34
  • [24] The best of both worlds
    Place, J
    SOAP & COSMETICS, 2000, 76 (04): : 31 - +
  • [25] The best of both worlds
    Thackery, Oscar H.
    Hagberg, Ted S.
    Hydrocarbon Engineering, 2002, 7 (04):
  • [26] Best of both worlds
    Eleni Diamanti
    Elham Kashefi
    Nature Physics, 2017, 13 : 3 - 4
  • [27] The best of both worlds
    Penhune, J
    FORBES, 1998, : 24 - +
  • [28] The Best of Both Worlds
    Waters, Hannah
    SCIENTIST, 2012, 26 (04): : 69 - 71
  • [29] Best of both worlds
    Chen, M
    Zak, C
    Cusack, T
    Magee, W
    ADHESIVES AGE, 2003, 46 (01): : 19 - 22
  • [30] Best of both worlds?
    Atkinson, S
    IEE REVIEW, 2003, 49 (03): : 4 - 4