Regret Analysis of Repeated Delegated Choice

被引:0
|
作者
Hajiaghayi, MohammadTaghi [1 ]
Mahdavi, Mohammad [1 ]
Rezaei, Keivan [1 ]
Shin, Suho [1 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a study on a repeated delegated choice problem, which is the first to consider an online learning variant of Kleinberg and Kleinberg, EC'18. In this model, a principal interacts repeatedly with an agent who possesses an exogenous set of solutions to search for efficient ones. Each solution can yield varying utility for both the principal and the agent, and the agent may propose a solution to maximize its own utility in a selfish manner. To mitigate this behavior, the principal announces an eligible set which screens out a certain set of solutions. The principal, however, does not have any information on the distribution of solutions nor the number of solutions in advance. Therefore, the principal dynamically announces various eligible sets to efficiently learn the distribution. The principal's objective is to minimize cumulative regret compared to the optimal eligible set in hindsight. We explore two dimensions of the problem setup, whether the agent behaves myopically or strategizes across the rounds, and whether the solutions yield deterministic or stochastic utility. We obtain sublinear regret upper bounds in various regimes, and derive corresponding lower bounds which implies the tightness of the results. Overall, we bridge a well-known problem in economics to the evolving area of online learning, and present a comprehensive study in this problem.
引用
收藏
页码:9757 / 9764
页数:8
相关论文
共 50 条
  • [1] Simple Delegated Choice
    Khodabakhsh, Ali
    Pountourakis, Emmanouil
    Taggart, Samuel
    PROCEEDINGS OF THE 2024 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2024, : 569 - 590
  • [2] EFFECTS OF DELEGATED CHOICE ON PERFORMANCE
    COSIER, RA
    APLIN, JC
    PERSONNEL PSYCHOLOGY, 1980, 33 (03) : 581 - 593
  • [3] A Model of Delegated Project Choice
    Armstrong, Mark
    Vickers, John
    ECONOMETRICA, 2010, 78 (01) : 213 - 244
  • [4] News media and delegated information choice
    Nimark, Kristoffer P.
    Pitschner, Stefan
    JOURNAL OF ECONOMIC THEORY, 2019, 181 : 160 - 196
  • [5] Regret, choice, and outcome
    Matarazzo, Olimpia
    Abbamonte, Lucia
    World Academy of Science, Engineering and Technology, 2009, 40 : 420 - 428
  • [6] Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions
    Feng, Zhe
    Guruganesh, Guru
    Liaw, Christopher
    Mehta, Aranyak
    Sethi, Abhishek
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 5399 - 5406
  • [7] Policy Regret in Repeated Games
    Arora, Raman
    Dinitz, Michael
    Marinov, Teodor V.
    Mohri, Mehryar
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [8] Expanding the applicability of random regret minimization for route choice analysis
    Prato, Carlo Giacomo
    TRANSPORTATION, 2014, 41 (02) : 351 - 375
  • [9] Expanding the applicability of random regret minimization for route choice analysis
    Carlo Giacomo Prato
    Transportation, 2014, 41 : 351 - 375
  • [10] Regret theory and the tyranny of choice
    Irons, Ben
    Hepburn, Cameron
    ECONOMIC RECORD, 2007, 83 (261) : 191 - 203