Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

被引:0
|
作者
Li, Chuanhao [1 ]
Wang, Hongning [1 ]
机构
[1] Univ Virginia, Charlottesville, VA 22903 USA
来源
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Linear contextual bandit is a popular online learning problem. It has been mostly studied in centralized learning settings. With the surging demand of large-scale decentralized model learning, e.g., federated learning, how to retain regret minimization while reducing communication cost becomes an open challenge. In this paper, we study linear contextual bandit in a federated learning setting. We propose a general framework with asynchronous model update and communication for a collection of homogeneous clients and heterogeneous clients, respectively. Rigorous theoretical analysis is provided about the regret and communication cost under this distributed learning framework; and extensive empirical evaluations demonstrate the effectiveness of our solution.
引用
收藏
页码:6529 / 6553
页数:25
相关论文
共 50 条
  • [1] Imitation Upper Confidence Bound for Bandits on a Graph
    Lupu, Andrei
    Precup, Doina
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8113 - 8114
  • [2] Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits
    Carpentier, Alexandra
    Lazaric, Alessandro
    Ghavamzadeh, Mohammad
    Munos, Remi
    Auer, Peter
    ALGORITHMIC LEARNING THEORY, 2011, 6925 : 189 - +
  • [3] A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
    He, Jiafan
    Wang, Tianhao
    Min, Yifei
    Gu, Quanquan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits
    Roy, Kaushik
    Zhang, Qi
    Gaur, Manas
    Sheth, Amit
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 35 - 50
  • [5] Federated Linear Contextual Bandits
    Huang, Ruiquan
    Wu, Weiqiang
    Yang, Jing
    Shen, Cong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Multilevel Constrained Bandits: A Hierarchical Upper Confidence Bound Approach with Safety Guarantees
    Baheri, Ali
    MATHEMATICS, 2025, 13 (01)
  • [7] Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
    Yang, Hantao
    Liu, Xutong
    Wang, Zhiyong
    Xie, Hong
    Lui, John C. S.
    Lian, Defu
    Chen, Enhong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20596 - 20603
  • [8] Byzantine-Robust Federated Linear Bandits
    Jadbabaie, Ali
    Li, Haochuan
    Qian, Jian
    Tian, Yi
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5206 - 5213
  • [9] Federated Linear Contextual Bandits with Heterogeneous Clients
    Blaser, Ethan
    Li, Chuanhao
    Wang, Hongning
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [10] Federated Linear Bandits with Finite Adversarial Actions
    Fan, Li
    Zhou, Ruida
    Tian, Chao
    Shen, Cong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,