Best-of-Both-Worlds Algorithms for Partial Monitoring

被引：0

作者：

Tsuchiya, Taira ^{[1
,2
]}

Ito, Shinji ^{[3
]}

Honda, Junya ^{[1
,2
]}

机构：

[1] Kyoto Univ, Kyoto, Japan

[2] RIKEN AIP, Tokyo, Japan

[3] NEC Corp Ltd, Tokyo, Japan

来源：

INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201 | 2023年 / 201卷

关键词：

partial monitoring; best-of-both-worlds; follow-the-regularized-leader; stochastic regime with adversarial corruptions; REGRET;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This study considers the partial monitoring problem with k-actions and d-outcomes and provides the first best-of-both-worlds algorithms, whose regrets are favorably bounded both in the stochastic and adversarial regimes. In particular, we show that for non-degenerate locally observable games, the regret is O(m(2)k(4) log(T) log(k.T)/.min) in the stochastic regime and O(mk(3/2) root T log(T) log k Pi) in the adversarial regime, where T is the number of rounds, m is the maximum number of distinct observations per action,.min is the minimum suboptimality gap, and k. is the number of Pareto optimal actions. Moreover, we show that for globally observable games, the regret is O(m(2)k(4) log(T) log(k(Pi)T)/Delta(min)) in the stochastic regime and O(mk(3/2)root Tlog(T) log(k(Pi))) in the adversarial regime, where cG is a game-dependent constant. We also provide regret bounds for a stochastic regime with adversarial corruptions. Our algorithms are based on the follow-theregularized-leader framework and are inspired by the approach of exploration by optimization and the adaptive learning rate in the field of online learning with feedback graphs.

引用

页码：1484 / 1515

页数：32

共 50 条

[21] The best of both worlds
Valero, Greg
Adhesives Age, 2003, 46 (05): : 11 - 12
[22] Best of both worlds
Frisk, Richard
EP Electronic Production (London), 2000, 29 (07):
[23] The best of both worlds
Pitcher, Graham
New Electronics, 2002, 35 (06): : 33 - 34
[24] The best of both worlds
Place, J
SOAP & COSMETICS, 2000, 76 (04): : 31 - +
[25] The best of both worlds
Thackery, Oscar H.
Hagberg, Ted S.
Hydrocarbon Engineering, 2002, 7 (04):
[26] Best of both worlds
Eleni Diamanti
Elham Kashefi
Nature Physics, 2017, 13 : 3 - 4
[27] The best of both worlds
Penhune, J
FORBES, 1998, : 24 - +
[28] The Best of Both Worlds
Waters, Hannah
SCIENTIST, 2012, 26 (04): : 69 - 71
[29] Best of both worlds
Chen, M
Zak, C
Cusack, T
Magee, W
ADHESIVES AGE, 2003, 46 (01): : 19 - 22
[30] Best of both worlds?
Atkinson, S
IEE REVIEW, 2003, 49 (03): : 4 - 4

← 1 2 3 4 5 →