Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

被引：0

作者：

Li, Chuanhao ^{[1
]}

Wang, Hongning ^{[1
]}

机构：

[1] Univ Virginia, Charlottesville, VA 22903 USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Linear contextual bandit is a popular online learning problem. It has been mostly studied in centralized learning settings. With the surging demand of large-scale decentralized model learning, e.g., federated learning, how to retain regret minimization while reducing communication cost becomes an open challenge. In this paper, we study linear contextual bandit in a federated learning setting. We propose a general framework with asynchronous model update and communication for a collection of homogeneous clients and heterogeneous clients, respectively. Rigorous theoretical analysis is provided about the regret and communication cost under this distributed learning framework; and extensive empirical evaluations demonstrate the effectiveness of our solution.

引用

页码：6529 / 6553

页数：25

共 50 条

[1] Imitation Upper Confidence Bound for Bandits on a Graph
Lupu, Andrei
Precup, Doina
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 8113 - 8114
[2] Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits
Carpentier, Alexandra
Lazaric, Alessandro
Ghavamzadeh, Mohammad
Munos, Remi
Auer, Peter
ALGORITHMIC LEARNING THEORY, 2011, 6925 : 189 - +
[3] A Simple and Provably Efficient Algorithm for Asynchronous Federated Contextual Linear Bandits
He, Jiafan
Wang, Tianhao
Min, Yifei
Gu, Quanquan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[4] Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits
Roy, Kaushik
Zhang, Qi
Gaur, Manas
Sheth, Amit
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 35 - 50
[5] Federated Linear Contextual Bandits
Huang, Ruiquan
Wu, Weiqiang
Yang, Jing
Shen, Cong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[6] Multilevel Constrained Bandits: A Hierarchical Upper Confidence Bound Approach with Safety Guarantees
Baheri, Ali
MATHEMATICS, 2025, 13 (01)
[7] Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
Yang, Hantao
Liu, Xutong
Wang, Zhiyong
Xie, Hong
Lui, John C. S.
Lian, Defu
Chen, Enhong
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 20596 - 20603
[8] Byzantine-Robust Federated Linear Bandits
Jadbabaie, Ali
Li, Haochuan
Qian, Jian
Tian, Yi
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5206 - 5213
[9] Federated Linear Contextual Bandits with Heterogeneous Clients
Blaser, Ethan
Li, Chuanhao
Wang, Hongning
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[10] Federated Linear Bandits with Finite Adversarial Actions
Fan, Li
Zhou, Ruida
Tian, Chao
Shen, Cong
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,

← 1 2 3 4 5 →