Regret minimization under partial monitoring

被引:0
|
作者
Cesa-Bianchi, Nicolo [1 ]
Lugosi, Gabor [2 ]
Stoltz, Gilles [3 ]
机构
[1] Univ Milan, Dipartimento Sci Informaz, I-20135 Milan, Italy
[2] Pompeu Fabra Univ, Dept Econ, Barcelona, Spain
[3] Ecole Normale Super, Dept Math Appl, Paris, France
关键词
D O I
10.1109/ITW.2006.1633784
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider repeated games in which the player, instead of observing the action chosen by the opponent in each game round, receives a feedback generated by the combined choice of the two players. We study Hannan consistent players for these games, that is, randomized playing strategies whose per-round regret vanishes with probability one as the number of game rounds goes to infinity. We prove a general lower bound for the convergence rate of the regret, and exhibit a specific strategy that attains this rate for any game for which a Hannan consistent player exists.
引用
收藏
页码:72 / +
页数:2
相关论文
共 50 条
  • [21] Internal Regret with Partial Monitoring: Calibration-Based Optimal Algorithms
    Perchet, Vianney
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1893 - 1921
  • [22] Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring
    Komiyama, Junpei
    Honda, Junya
    Nakagawa, Hiroshi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [23] Iterated Regret Minimization in Game Graphs
    Filiot, Emmanuel
    Le Gall, Tristan
    Raskin, Jean-Francois
    MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE 2010, 2010, 6281 : 342 - 354
  • [24] Regret Minimization and the Price of Total Anarchy
    Blum, Avrim
    Hajiaghayi, MohammadTaghi
    Ligett, Katrina
    Roth, Aaron
    STOC'08: PROCEEDINGS OF THE 2008 ACM INTERNATIONAL SYMPOSIUM ON THEORY OF COMPUTING, 2008, : 373 - +
  • [25] A New Model of Random Regret Minimization
    Chorus, Caspar G.
    EUROPEAN JOURNAL OF TRANSPORT AND INFRASTRUCTURE RESEARCH, 2010, 10 (02): : 181 - 196
  • [26] Decision making using minimization of regret
    Yager, RR
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2004, 36 (02) : 109 - 128
  • [27] Pseudonorm Approachability and Applications to Regret Minimization
    Dann, Christoph
    Mansour, Yishay
    Mohri, Mehryar
    Schneider, Jon
    Sivan, Balubramanian
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 471 - 509
  • [28] Hybrid Regret Minimization: A Submodular Approach
    Zheng, Jiping
    Meng, Fanxu
    Wang, Yanhao
    Wang, Xiaoyang
    Wang, Sheng
    Ma, Yuan
    Hao, Zhiyang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3151 - 3165
  • [29] Evolutionary Dynamics and Φ-Regret Minimization in Games
    Piliouras G.
    Rowland M.
    Omidshaflei S.
    Elie R.
    Hennes D.
    Connor J.
    Tuyls K.
    Journal of Artificial Intelligence Research, 2022, 74 : 1125 - 1158
  • [30] Evolutionary Dynamics and Φ-Regret Minimization in Games
    Piliouras, Georgios
    Rowland, Mark
    Omidshafiei, Shayegan
    Elie, Romuald
    Hennes, Daniel
    Connor, Jerome
    Tuyls, Karl
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1125 - 1158