Globally Informative Thompson Sampling for Structured Bandit Problems with Application to CrowdTranscoding

被引：0

作者：

Liu, Xingchi ^{[1
]}

Derakhshani, Mahsa ^{[1
]}

Zhu, Ziming ^{[2
]}

Lambotharan, Sangarapillai ^{[1
]}

机构：

[1] Loughborough Univ, Wolfson Sch Mech Elect & Mfg Engn, Signal Proc & Networks Res Grp, Loughborough LE11 3TU, Leics, England

[2] Toshiba Europe Ltd, Bristol Res & Innovat Lab, Bristol BS1 4ND, Avon, England

来源：

3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021) | 2021年

基金：

英国工程与自然科学研究理事会;

关键词：

Multi-armed bandit; Thompson sampling; Structured bandit; Edge computing; MULTIARMED BANDIT;

D O I：

10.1109/ICAIIC51459.2021.9415255

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-armed bandit is a widely-studied model for sequential decision-making problems. The most studied model in the literature is stochastic bandits wherein the reward of each arm follows an independent distribution. However, there is a wide range of applications where the rewards of different alternatives are correlated to some extent. In this paper, a class of structured bandit problems is studied in which rewards of different arms are functions of the same unknown parameter vector. To minimize the cumulative learning regret, we propose a globally-informative Thompson sampling algorithm to learn and leverage the correlation among arms, which can deal with unknown multidimensional parameter and non-monotonic reward functions. Our studies demonstrate that the proposed algorithm achieves significant improvement in the learning speed. In particular, the designed algorithm is used to solve an edge transcoder selection problem in crowdsourced live video streaming systems and shows superior performance as compared to the existing schemes.

引用

页码：210 / 215

页数：6

共 50 条

[31] Adversarial Sleeping Bandit Problems with Multiple Plays: Algorithm and Ranking Application
Yuan, Jianjun
Woon, Wei Lee
Coba, Ludovik
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 744 - 749
[32] The application of sampling to economic and sociological problems
Bowley, AL
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1936, 31 (193) : 474 - 480
[33] Sparse Sampling of Structured Information and its Application to Compression
Dragotti, Pier Luigi
2009 IEEE INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP 2009), 2009, : 540 - 544
[34] Solving Non-Stationary Bandit Problems by Random Sampling from Sibling Kalman Filters
Granmo, Ole-Christoffer
Berg, Stian
TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT III, PROCEEDINGS, 2010, 6098 : 199 - 208
[35] Application of the full Bayesian significance test to model selection under informative sampling
A. Sikov
J. M. Stern
Statistical Papers, 2019, 60 : 89 - 104
[36] Application of the full Bayesian significance test to model selection under informative sampling
Sikov, A.
Stern, J. M.
STATISTICAL PAPERS, 2019, 60 (01) : 89 - 104
[37] APPLICATION OF A LARGE SAMPLING CRITERION TO SOME SAMPLING PROBLEMS IN FACTOR ANALYSIS
RIPPE, DD
PSYCHOMETRIKA, 1953, 18 (03) : 191 - 205
[38] Thompson Sampling Based Active Learning in Probabilistic Programs with Application to Travel Time Estimation
Glimsdal, Sondre
Granmo, Ole-Christoffer
ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: FROM THEORY TO PRACTICE, 2019, 11606 : 71 - 78
[39] THE TRUNCATION ERROR IN THE APPLICATION OF SAMPLING SERIES TO ELECTROMAGNETIC PROBLEMS
BUCCI, OM
DIMASSA, G
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1988, 36 (07) : 941 - 949
[40] APPLICATION OF SAMPLING THEOREM TO BOUNDARY-VALUE PROBLEMS
JERRI, AJ
DAVIS, EJ
JOURNAL OF ENGINEERING MATHEMATICS, 1974, 8 (01) : 1 - 8

← 1 2 3 4 5 →