Linear Bandit Algorithms with Sublinear Time Complexity

被引:0
|
作者
Yang, Shuo [1 ]
Ren, Tongzheng [1 ]
Shakkottai, Sanjay [2 ]
Price, Eric [1 ]
Dhillon, Inderjit S. [1 ]
Sanghavi, Sujay [2 ]
机构
[1] Univ Texas Austin, Dept CS, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept ECE, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose two linear bandits algorithms with per-step complexity sublinear in the number of arms K. The algorithms are designed for applications where the arm set is extremely large and slowly changing. Our key realization is that choosing an arm reduces to a maximum inner product search (MIPS) problem, which can be solved approximately without breaking regret guarantees. Existing approximate MIPS solvers run in sublinear time. We extend those solvers and present theoretical guarantees for online learning problems, where adaptivity (i.e., a later step depends on the feedback in previous steps) becomes a unique challenge. We then explicitly characterize the tradeoff between the perstep complexity and regret. For sufficiently large K, our algorithms have sublinear per-step complexity and (O) over tilde(root T) regret. Empirically, we evaluate our proposed algorithms in a synthetic environment and a real-world online movie recommendation problem. Our proposed algorithms can deliver a more than 72 times speedup compared to the linear time baselines while retaining similar regret.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Sublinear Time Algorithms and Complexity of Approximate Maximum Matching
    Behnezhad, Soheil
    Roghani, Mohammad
    Rubinstein, Aviad
    PROCEEDINGS OF THE 55TH ANNUAL ACM SYMPOSIUM ON THEORY OF COMPUTING, STOC 2023, 2023, : 267 - 280
  • [2] Linear and sublinear time algorithms for the basis of abelian groups
    Chen, Li
    Fu, Bin
    THEORETICAL COMPUTER SCIENCE, 2011, 412 (32) : 4110 - 4122
  • [3] SUBLINEAR TIME ALGORITHMS
    Rubinfeld, Ronitt
    Shapira, Asaf
    SIAM JOURNAL ON DISCRETE MATHEMATICS, 2011, 25 (04) : 1562 - 1588
  • [4] Linear Algorithms in Sublinear Time-a Tutorial on Statistical Estimation
    Ullrich, Torsten
    Fellner, Dieter W.
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2011, 31 (02) : 58 - 66
  • [5] SUBLINEAR-TIME ALGORITHMS
    Woeginger, Gerhard J.
    Czumaj, Artur
    Sohler, Christian
    BULLETIN OF THE EUROPEAN ASSOCIATION FOR THEORETICAL COMPUTER SCIENCE, 2006, (89): : 23 - 47
  • [6] Sublinear-time Algorithms
    Czumaj, Artur
    Sohler, Christian
    PROPERTY TESTING: CURRENT RESEARCH AND SURVEYS, 2010, 6390 : 41 - +
  • [7] Sublinear Computation Paradigm: Constant-Time Algorithms and Sublinear Progressive Algorithms
    Chiba, Kyohei
    Ito, Hiro
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (03) : 131 - 141
  • [8] Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sublinear time
    Swamidass, S. Joshua
    Baldi, Pierre
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) : 302 - 317
  • [9] Metric Sublinear Algorithms via Linear Sampling
    Esfandiari, Hossein
    Mitzenmacher, Michael
    2018 IEEE 59TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2018, : 11 - 22
  • [10] Quantum Meets Fine-Grained Complexity: Sublinear Time Quantum Algorithms for String Problems
    François Le Gall
    Saeed Seddighin
    Algorithmica, 2023, 85 : 1251 - 1286