Mean-Variance and Value at Risk in Multi-Armed Bandit Problems

被引：0

作者：

Vakili, Sattar ^{[1
]}

Zhao, Qing ^{[1
]}

机构：

[1] Cornell Univ, Sch Elect & Comp Engn, Ithaca, NY 14850 USA

来源：

2015 53RD ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON) | 2015年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We study risk-averse multi-armed bandit problems under different risk measures. We consider three risk mitigation models. In the first model, the variations in the reward values obtained at different times are considered as risk and the objective is to minimize the mean-variance of the observed rewards. In the second and the third models, the quantity of interest is the total reward at the end of the time horizon, and the objective is to minimize the mean-variance and maximize the value at risk of the total reward, respectively. We develop risk-averse online learning policies and analyze their regret performance. We also provide tight lower bounds on regret under the model of mean-variance of observations.

引用

页码：1330 / 1335

页数：6

共 50 条

[41] Maximal Expectation as Upper Confidence Bound for Multi-armed Bandit Problems
Kao, Kuo-Yuan
Chen, I-Hao
2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 325 - 329
[42] Empirical Gittins index strategies with ?-explorations for multi-armed bandit problems
Li, Xiao
Li, Yuqiang
Wu, Xianyi
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 180
[43] Thompson Sampling Based Mechanisms for Stochastic Multi-Armed Bandit Problems
Ghalme, Ganesh
Jain, Shweta
Gujar, Sujit
Narahari, Y.
AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 87 - 95
[44] Modeling Choice Variation in Search Strategies with Multi-armed Bandit Problems
Sharma, Neha
Dutt, Varun
2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA SCIENCE (MLDS 2017), 2017, : 91 - 97
[45] Solving multi-armed bandit problems using a chaotic microresonator comb
Cuevas, Jonathan
Iwami, Ryugo
Uchida, Atsushi
Minoshima, Kaoru
Kuse, Naoya
APL PHOTONICS, 2024, 9 (03)
[46] ON MULTI-ARMED BANDIT PROBLEM WITH NUISANCE PARAMETER
孙嘉阳
Science China Mathematics, 1986, (05) : 464 - 475
[47] Multi-armed bandit algorithms and empirical evaluation
Vermorel, J
Mohri, M
MACHINE LEARNING: ECML 2005, PROCEEDINGS, 2005, 3720 : 437 - 448
[48] Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds
Ito, Shinji
Tsuchiya, Taira
Honda, Junya
CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
[49] Sustainable Cooperative Coevolution with a Multi-Armed Bandit
De Rainville, Francois-Michel
Sebag, Michele
Gagne, Christian
Schoenauer, Marc
Laurendeau, Denis
GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 1517 - 1524
[50] Robust control of the multi-armed bandit problem
Caro, Felipe
Das Gupta, Aparupa
ANNALS OF OPERATIONS RESEARCH, 2022, 317 (02) : 461 - 480

← 1 2 3 4 5 →