Mechanisms for a No-Regret Agent: Beyond the Common Prior

被引:10
|
作者
Camara, Modibo K. [1 ]
Hartline, Jason D.
Johnsen, Aleck
机构
[1] Northwestern Univ, Dept Econ, Evanston, IL 60208 USA
关键词
IMPLEMENTATION;
D O I
10.1109/FOCS46700.2020.00033
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
A rich class of mechanism design problems can be understood as incomplete-information games between a principal who commits to a policy and an agent who responds, with payoffs determined by an unknown state of the world. Traditionally, these models require strong and often-impractical assumptions about beliefs (a common prior over the state). In this paper, we dispense with the common prior. Instead, we consider a repeated interaction where both the principal and the agent may learn over time from the state history. We reformulate mechanism design as a reinforcement learning problem and develop mechanisms that attain natural benchmarks without any assumptions on the state-generating process. Our results make use of novel behavioral assumptions for the agent - based on counterfactual internal regret - that capture the spirit of rationality without relying on beliefs.(1)
引用
收藏
页码:259 / 270
页数:12
相关论文
共 50 条
  • [21] On the convergence of no-regret learning in selfish routing
    Krichene, Walid
    Drighes, Benjamin
    Bayen, Alexandre
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 163 - 171
  • [22] No-Regret Learning in Dynamic Stackelberg Games
    Lauffer, Niklas
    Ghasemi, Mahsa
    Hashemi, Abolfazl
    Savas, Yagiz
    Topcu, Ufuk
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (03) : 1418 - 1431
  • [23] Acceleration through Optimistic No-Regret Dynamics
    Wang, Jun-Kun
    Abernethy, Jacob
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Unifying convergence and no-regret in multiagent learning
    Banerjee, Bikramjit
    Peng, Jing
    LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 100 - 114
  • [25] Calibration and Internal No-Regret with Random Signals
    Perchet, Vianney
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2009, 5809 : 68 - 82
  • [26] No-regret Algorithms for Fair Resource Allocation
    Sinha, Abhishek
    Joshi, Ativ
    Bhattacharjee, Rajarshi
    Musco, Cameron
    Hajiesmaili, Mohammad
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [27] Manipulation Game Considering No-Regret Strategies
    Clempner, Julio B.
    MATHEMATICS, 2025, 13 (02)
  • [28] No-regret bayesian optimization with unknown hyperparameters
    Berkenkamp, Felix
    Schoellig, Angela P.
    Krause, Andreas
    Journal of Machine Learning Research, 2019, 20
  • [29] No-Regret Bayesian Optimization with Unknown Hyperparameters
    Berkenkamp, Felix
    Schoellig, Angela P.
    Krause, Andreas
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [30] Weighted Voting Via No-Regret Learning
    Haghtalab, Nika
    Noothigattu, Ritesh
    Procaccia, Ariel D.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1055 - 1062