Linear Bandit Algorithms with Sublinear Time Complexity

被引:0
|
作者
Yang, Shuo [1 ]
Ren, Tongzheng [1 ]
Shakkottai, Sanjay [2 ]
Price, Eric [1 ]
Dhillon, Inderjit S. [1 ]
Sanghavi, Sujay [2 ]
机构
[1] Univ Texas Austin, Dept CS, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept ECE, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose two linear bandits algorithms with per-step complexity sublinear in the number of arms K. The algorithms are designed for applications where the arm set is extremely large and slowly changing. Our key realization is that choosing an arm reduces to a maximum inner product search (MIPS) problem, which can be solved approximately without breaking regret guarantees. Existing approximate MIPS solvers run in sublinear time. We extend those solvers and present theoretical guarantees for online learning problems, where adaptivity (i.e., a later step depends on the feedback in previous steps) becomes a unique challenge. We then explicitly characterize the tradeoff between the perstep complexity and regret. For sufficiently large K, our algorithms have sublinear per-step complexity and (O) over tilde(root T) regret. Empirically, we evaluate our proposed algorithms in a synthetic environment and a real-world online movie recommendation problem. Our proposed algorithms can deliver a more than 72 times speedup compared to the linear time baselines while retaining similar regret.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Sublinear geometric algorithms
    Chazelle, B
    Liu, D
    Magen, A
    SIAM JOURNAL ON COMPUTING, 2006, 35 (03) : 627 - 646
  • [42] Improved approximation guarantees for sublinear-time Fourier algorithms
    Iwen, Mark A.
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2013, 34 (01) : 57 - 82
  • [43] COMPLEXITY OF ALGORITHMS AND THEIR OPERATION TIME
    PETRI, NV
    DOKLADY AKADEMII NAUK SSSR, 1969, 186 (01): : 30 - &
  • [44] On time complexity analysis of algorithms
    Tarek, Ahmed
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 3, 2005, : 221 - 226
  • [45] SUBLINEAR-TIME PARALLEL ALGORITHMS FOR MATCHING AND RELATED PROBLEMS
    GOLDBERG, AV
    PLOTKIN, SA
    VAIDYA, PM
    JOURNAL OF ALGORITHMS, 1993, 14 (02) : 180 - 213
  • [46] Erasure-resilient sublinear-time graph algorithms
    Levi, Amit
    Pallavoor, Ramesh Krishnan S.
    Raskhodnikova, Sofya
    Varma, Nithin
    Leibniz International Proceedings in Informatics, LIPIcs, 2021, 185
  • [47] Oracle-Efficient Algorithms for Online Linear Optimization with Bandit Feedback
    Ito, Shinji
    Hatano, Daisuke
    Sumita, Hanna
    Takemura, Kei
    Fukunaga, Takuro
    Kakimura, Naonori
    Kawarabayashi, Ken-ichi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [48] READING MANY VARIABLES IN ONE ATOMIC OPERATION - SOLUTIONS WITH LINEAR OR SUBLINEAR COMPLEXITY
    KIROUSIS, LM
    SPIRAKIS, P
    TSIGAS, P
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (07) : 688 - 696
  • [49] READING MANY VARIABLES IN ONE ATOMIC OPERATION SOLUTIONS WITH LINEAR OR SUBLINEAR COMPLEXITY
    KIROUSIS, LM
    SPIRAKIS, P
    TSIGAS, P
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 579 : 229 - 241
  • [50] On the character of words of sublinear complexity
    Zamboni, Luca Q.
    ACTA ARITHMETICA, 2018, 184 (03) : 201 - 213