Multi-objective Bandits: Optimizing the Generalized Gini Index

被引:0
|
作者
Busa-Fekete, Robert [1 ]
Szorenyi, Balazs [2 ,3 ]
Weng, Paul [4 ,5 ]
Mannor, Shie [3 ]
机构
[1] Yahoo Res, New York, NY USA
[2] Hungarian Acad Sci & Univ Szeged, Res Grp AI, Szeged, Hungary
[3] Technion Israel Inst Technol, Haifa, Israel
[4] SYSU, SEIT, SYSU CMU JIE, Guangzhou, Peoples R China
[5] SYSU CMU JRI, Shunde, Peoples R China
基金
欧洲研究理事会;
关键词
INEQUALITY; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the multi-armed bandit (MAB) problem where the agent receives a vectorial feedback that encodes many possibly competing objectives to be optimized The goal of the agent is to find a policy, which can optimize these objectives simultaneously in a fair way. This multi-objective online optimization problem is formalized by using the Generalized Gini Index (GGI) aggregation function. We propose an online gradient descent algorithm which exploits the convexity of the GGI aggregation function, and controls the exploration in a careful way achieving a distribution-free regret (O) over tilde (T-1/2) with high probability. We test our algorithm on synthetic data as well as on an electric battery control problem where the goal is to trade off the use of the different cells of a battery in order to balance their respective degradation rates.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Optimizing of Multi-objective Inventory Model by Different Fuzzy Techniques
    Mishra U.
    Waliv R.H.
    Umap H.P.
    International Journal of Applied and Computational Mathematics, 2019, 5 (5)
  • [42] A Novel Multi-Objective Optimizing Web Services Selection Algorithm
    Fang, Qiqing
    Huang, Qingxian
    Liu, Gen
    Liu, Qinghua
    Deng, Bin
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL AUTOMATION (ICITIA 2015), 2015, : 551 - 556
  • [43] Game Theory Method for Multi-objective Optimizing Operation in Microgrid
    Ma, Jian
    Li, Peng
    Lin, Xiaopeng
    Zhu, Weiping
    Yuan, Xiaodong
    2015 IEEE 12TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2015, : 421 - 425
  • [44] Optimizing Alloy for Multi-objective Software Product Line Configuration
    Zulkoski, Ed
    Kleynhans, Chris
    Yee, Ming-Ho
    Rayside, Derek
    Czarnecki, Krzysztof
    ABSTRACT STATE MACHINES, ALLOY, B, TLA, VDM, AND Z, ABZ 2014, 2014, 8477 : 328 - 333
  • [45] Optimizing Design of Smart Workplace through Multi-Objective Programming
    Aly, Shady
    Tyrychtr, Jan
    Vrana, Ivan
    APPLIED SCIENCES-BASEL, 2021, 11 (07):
  • [46] Optimizing the Energy Efficient VM Consolidation by a Multi-Objective Algorithm
    Jayasena, K. P. N.
    Li, Lin
    Abd Elaziz, Mohamed
    Xiong, Shengwu
    Xiang, Jianwen
    PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 81 - 86
  • [47] Multi-objective model for optimizing railway infrastructure asset renewal
    Sousa, Nuno
    Alcada-Almeida, Luis
    Coutinho-Rodrigues, Joao
    ENGINEERING OPTIMIZATION, 2019, 51 (10) : 1777 - 1793
  • [48] A Recommender System for E-Commerce Using Multi-Objective Ranked Bandits Algorithm
    Hidayatullah, Ahmad
    Anugerah, Media Ayu
    2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING, ENGINEERING, AND DESIGN (ICCED 2018), 2018, : 170 - 174
  • [49] Learning Multi-Objective Rewards and User Utility Function in Contextual Bandits for Personalized Ranking
    Wanigasekara, Nirandika
    Liang, Yuxuan
    Goh, Siong Thye
    Liu, Ye
    Williams, Joseph Jay
    Rosenblum, David S.
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3835 - 3841
  • [50] Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models
    Roijers, Diederik M.
    Zintgraf, Luisa M.
    Libin, Pieter
    Reymond, Mathieu
    Bargiacchi, Eugenio
    Nowe, Ann
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2020, PT III, 2021, 12459 : 463 - 478