Linear Bandit Algorithms with Sublinear Time Complexity

被引:0
|
作者
Yang, Shuo [1 ]
Ren, Tongzheng [1 ]
Shakkottai, Sanjay [2 ]
Price, Eric [1 ]
Dhillon, Inderjit S. [1 ]
Sanghavi, Sujay [2 ]
机构
[1] Univ Texas Austin, Dept CS, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept ECE, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose two linear bandits algorithms with per-step complexity sublinear in the number of arms K. The algorithms are designed for applications where the arm set is extremely large and slowly changing. Our key realization is that choosing an arm reduces to a maximum inner product search (MIPS) problem, which can be solved approximately without breaking regret guarantees. Existing approximate MIPS solvers run in sublinear time. We extend those solvers and present theoretical guarantees for online learning problems, where adaptivity (i.e., a later step depends on the feedback in previous steps) becomes a unique challenge. We then explicitly characterize the tradeoff between the perstep complexity and regret. For sufficiently large K, our algorithms have sublinear per-step complexity and (O) over tilde(root T) regret. Empirically, we evaluate our proposed algorithms in a synthetic environment and a real-world online movie recommendation problem. Our proposed algorithms can deliver a more than 72 times speedup compared to the linear time baselines while retaining similar regret.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] SUBLINEAR TIME ALGORITHMS IN THE THEORY OF GROUPS AND SEMIGROUPS
    Shpilrain, Vladimir
    ILLINOIS JOURNAL OF MATHEMATICS, 2010, 54 (01) : 187 - 197
  • [22] TANGENT LINEAR ALGORITHMS AND COMPLEXITY
    MORGENSTERN, J
    COMPTES RENDUS HEBDOMADAIRES DES SEANCES DE L ACADEMIE DES SCIENCES SERIE A, 1973, 277 (09): : 367 - 369
  • [23] Fundamental limits and algorithms for sparse linear regression with sublinear sparsity
    Truong, Lan V.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [24] STABLE LINEAR TIME SUBLINEAR SPACE MERGING
    DVORAK, S
    DURIAN, B
    COMPUTER JOURNAL, 1987, 30 (04): : 372 - 375
  • [25] Sublinear Time Algorithms for Earth Mover’s Distance
    Khanh Do Ba
    Huy L. Nguyen
    Huy N. Nguyen
    Ronitt Rubinfeld
    Theory of Computing Systems, 2011, 48 : 428 - 442
  • [26] Sublinear Time Algorithms for Earth Mover's Distance
    Do Ba, Khanh
    Nguyen, Huy L.
    Nguyen, Huy N.
    Rubinfeld, Ronitt
    THEORY OF COMPUTING SYSTEMS, 2011, 48 (02) : 428 - 442
  • [27] Sublinear Time Algorithms for Greedy Selection in High Dimensions
    Chen, Qi
    Liu, Kai
    Yao, Ruilong
    Ding, Hu
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 346 - 356
  • [28] Sublinear-Time Algorithms for Compressive Phase Retrieval
    Li, Yi
    Nakos, Vasileios
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2020, 66 (11) : 7302 - 7310
  • [29] Sublinear-Time Algorithms for Compressive Phase Retrieval
    Li, Yi
    Nakos, Vasileios
    2018 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2018, : 2301 - 2305
  • [30] Bandit Multiclass Linear Classification: Efficient Algorithms for the Separable Case
    Beygelzimer, Alina
    Pal, David
    Szorenyi, Balazs
    Thiruvenkatachari, Devanathan
    Wei, Chen-Yu
    Zhang, Chicheng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97