Regret minimization under partial monitoring

被引：0

作者：

Cesa-Bianchi, Nicolo ^{[1
]}

Lugosi, Gabor ^{[2
]}

Stoltz, Gilles ^{[3
]}

机构：

[1] Univ Milan, Dipartimento Sci Informaz, I-20135 Milan, Italy

[2] Pompeu Fabra Univ, Dept Econ, Barcelona, Spain

[3] Ecole Normale Super, Dept Math Appl, Paris, France

来源：

2006 IEEE INFORMATION THEORY WORKSHOP | 2006年

关键词：

D O I：

10.1109/ITW.2006.1633784

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We consider repeated games in which the player, instead of observing the action chosen by the opponent in each game round, receives a feedback generated by the combined choice of the two players. We study Hannan consistent players for these games, that is, randomized playing strategies whose per-round regret vanishes with probability one as the number of game rounds goes to infinity. We prove a general lower bound for the convergence rate of the regret, and exhibit a specific strategy that attains this rate for any game for which a Hannan consistent player exists.

引用

页码：72 / +

页数：2

共 50 条

[21] Internal Regret with Partial Monitoring: Calibration-Based Optimal Algorithms
Perchet, Vianney
JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1893 - 1921
[22] Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring
Komiyama, Junpei
Honda, Junya
Nakagawa, Hiroshi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
[23] Iterated Regret Minimization in Game Graphs
Filiot, Emmanuel
Le Gall, Tristan
Raskin, Jean-Francois
MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2010, 2010, 6281 : 342 - 354
[24] Regret Minimization and the Price of Total Anarchy
Blum, Avrim
Hajiaghayi, MohammadTaghi
Ligett, Katrina
Roth, Aaron
STOC'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL SYMPOSIUM ON THEORY OF COMPUTING, 2008, : 373 - +
[25] A New Model of Random Regret Minimization
Chorus, Caspar G.
EUROPEAN JOURNAL OF TRANSPORT AND INFRASTRUCTURE RESEARCH, 2010, 10 (02): : 181 - 196
[26] Decision making using minimization of regret
Yager, RR
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2004, 36 (02) : 109 - 128
[27] Pseudonorm Approachability and Applications to Regret Minimization
Dann, Christoph
Mansour, Yishay
Mohri, Mehryar
Schneider, Jon
Sivan, Balubramanian
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 471 - 509
[28] Hybrid Regret Minimization: A Submodular Approach
Zheng, Jiping
Meng, Fanxu
Wang, Yanhao
Wang, Xiaoyang
Wang, Sheng
Ma, Yuan
Hao, Zhiyang
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3151 - 3165
[29] Evolutionary Dynamics and Φ-Regret Minimization in Games
Piliouras G.
Rowland M.
Omidshaflei S.
Elie R.
Hennes D.
Connor J.
Tuyls K.
Journal of Artificial Intelligence Research, 2022, 74 : 1125 - 1158
[30] Evolutionary Dynamics and Φ-Regret Minimization in Games
Piliouras, Georgios
Rowland, Mark
Omidshafiei, Shayegan
Elie, Romuald
Hennes, Daniel
Connor, Jerome
Tuyls, Karl
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1125 - 1158

← 1 2 3 4 5 →