Multi-objective Bandits: Optimizing the Generalized Gini Index

被引：0

作者：

Busa-Fekete, Robert ^{[1
]}

Szorenyi, Balazs ^{[2
,3
]}

Weng, Paul ^{[4
,5
]}

Mannor, Shie ^{[3
]}

机构：

[1] Yahoo Res, New York, NY USA

[2] Hungarian Acad Sci & Univ Szeged, Res Grp AI, Szeged, Hungary

[3] Technion Israel Inst Technol, Haifa, Israel

[4] SYSU, SEIT, SYSU CMU JIE, Guangzhou, Peoples R China

[5] SYSU CMU JRI, Shunde, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70 | 2017年 / 70卷

基金：

欧洲研究理事会;

关键词：

INEQUALITY; MODELS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study the multi-armed bandit (MAB) problem where the agent receives a vectorial feedback that encodes many possibly competing objectives to be optimized The goal of the agent is to find a policy, which can optimize these objectives simultaneously in a fair way. This multi-objective online optimization problem is formalized by using the Generalized Gini Index (GGI) aggregation function. We propose an online gradient descent algorithm which exploits the convexity of the GGI aggregation function, and controls the exploration in a careful way achieving a distribution-free regret (O) over tilde (T-1/2) with high probability. We test our algorithm on synthetic data as well as on an electric battery control problem where the goal is to trade off the use of the different cells of a battery in order to balance their respective degradation rates.

引用

页数：10

共 50 条

[41] Optimizing of Multi-objective Inventory Model by Different Fuzzy Techniques
Mishra U.
Waliv R.H.
Umap H.P.
International Journal of Applied and Computational Mathematics, 2019, 5 (5)
[42] A Novel Multi-Objective Optimizing Web Services Selection Algorithm
Fang, Qiqing
Huang, Qingxian
Liu, Gen
Liu, Qinghua
Deng, Bin
INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL AUTOMATION (ICITIA 2015), 2015, : 551 - 556
[43] Game Theory Method for Multi-objective Optimizing Operation in Microgrid
Ma, Jian
Li, Peng
Lin, Xiaopeng
Zhu, Weiping
Yuan, Xiaodong
2015 IEEE 12TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2015, : 421 - 425
[44] Optimizing Alloy for Multi-objective Software Product Line Configuration
Zulkoski, Ed
Kleynhans, Chris
Yee, Ming-Ho
Rayside, Derek
Czarnecki, Krzysztof
ABSTRACT STATE MACHINES, ALLOY, B, TLA, VDM, AND Z, ABZ 2014, 2014, 8477 : 328 - 333
[45] Optimizing Design of Smart Workplace through Multi-Objective Programming
Aly, Shady
Tyrychtr, Jan
Vrana, Ivan
APPLIED SCIENCES-BASEL, 2021, 11 (07):
[46] Optimizing the Energy Efficient VM Consolidation by a Multi-Objective Algorithm
Jayasena, K. P. N.
Li, Lin
Abd Elaziz, Mohamed
Xiong, Shengwu
Xiang, Jianwen
PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 81 - 86
[47] Multi-objective model for optimizing railway infrastructure asset renewal
Sousa, Nuno
Alcada-Almeida, Luis
Coutinho-Rodrigues, Joao
ENGINEERING OPTIMIZATION, 2019, 51 (10) : 1777 - 1793
[48] A Recommender System for E-Commerce Using Multi-Objective Ranked Bandits Algorithm
Hidayatullah, Ahmad
Anugerah, Media Ayu
2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING, ENGINEERING, AND DESIGN (ICCED 2018), 2018, : 170 - 174
[49] Learning Multi-Objective Rewards and User Utility Function in Contextual Bandits for Personalized Ranking
Wanigasekara, Nirandika
Liang, Yuxuan
Goh, Siong Thye
Liu, Ye
Williams, Joseph Jay
Rosenblum, David S.
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3835 - 3841
[50] Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models
Roijers, Diederik M.
Zintgraf, Luisa M.
Libin, Pieter
Reymond, Mathieu
Bargiacchi, Eugenio
Nowe, Ann
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 463 - 478

← 1 2 3 4 5 →