Achieving fairness in the stochastic multi-armed bandit problem

被引:0
|
作者
Patil, Vishakha [1 ]
Ghalme, Ganesh [2 ]
Nair, Vineet [3 ]
Narahari, Y. [4 ]
机构
[1] Patil, Vishakha
[2] Ghalme, Ganesh
[3] Nair, Vineet
[4] Narahari, Y.
来源
| 1600年 / Microtome Publishing卷 / 22期
关键词
Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:1 / 31
相关论文
共 50 条
  • [1] Achieving Fairness in the Stochastic Multi-Armed Bandit Problem
    Patil, Vishakha
    Ghalme, Ganesh
    Nair, Vineet
    Narahari, Y.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [2] Achieving Fairness in the Stochastic Multi-Armed Bandit Problem
    Patil, Vishakha
    Ghalme, Ganesh
    Nair, Vineet
    Narahari, Y.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5379 - 5386
  • [3] The Multi-Armed Bandit With Stochastic Plays
    Lesage-Landry, Antoine
    Taylor, Joshua A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (07) : 2280 - 2286
  • [4] The non-stationary stochastic multi-armed bandit problem
    Allesiardo R.
    Féraud R.
    Maillard O.-A.
    Allesiardo, Robin (robin.allesiardo@gmail.com), 1600, Springer Science and Business Media Deutschland GmbH (03): : 267 - 283
  • [5] Achieving Privacy in the Adversarial Multi-Armed Bandit
    Tossou, Aristide C. Y.
    Dimitrakakis, Christos
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2653 - 2659
  • [6] The budgeted multi-armed bandit problem
    Madani, O
    Lizotte, DJ
    Greiner, R
    LEARNING THEORY, PROCEEDINGS, 2004, 3120 : 643 - 645
  • [7] THE MULTI-ARMED BANDIT PROBLEM WITH COVARIATES
    Perchet, Vianney
    Rigollet, Philippe
    ANNALS OF STATISTICS, 2013, 41 (02): : 693 - 721
  • [8] Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem
    Madhushani, Udari
    Leonard, Naomi Ehrich
    2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 3502 - 3507
  • [9] Achieving Complete Learning in Multi-Armed Bandit Problems
    Vakili, Sattar
    Zhao, Qing
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 1778 - 1782
  • [10] ON MULTI-ARMED BANDIT PROBLEM WITH NUISANCE PARAMETER
    孙嘉阳
    Science China Mathematics, 1986, (05) : 464 - 475