Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit

被引:2
|
作者
Li, Ke [1 ]
Yang, Yun [1 ]
Narisetty, Naveen N. [1 ]
机构
[1] Univ Illinois, Dept Stat, Champaign, IL 61820 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2021年 / 15卷 / 02期
关键词
Contextual linear bandit; high-dimension; minimax regret; sparsity; upper confidence bound; VARIABLE SELECTION;
D O I
10.1214/21-EJS1909
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, we consider the multi-armed bandit problem with high-dimensional features. First, we prove a minimax lower bound, O((log d)(alpha+1/2) T (1-alpha/2) + logT), for the cumulative regret, in terms of horizon T, dimension d and a margin parameter alpha is an element of [0, 1], which controls the separation between the optimal and the sub-optimal arms. This new lower bound unifies existing regret bound results that have different dependencies on T due to the use of different values of margin parameter a explicitly implied by their assumptions. Second, we propose a simple and computationally efficient algorithm inspired by the general Upper Confidence Bound (UCB) strategy that achieves a regret upper bound matching the lower bound. The proposed algorithm uses a properly centered l(1)-ball as the confidence set in contrast to the commonly used ellipsoid confidence set. In addition, the algorithm does not require any forced sampling step and is thereby adaptive to the practically unknown margin parameter. Simulations and a real data analysis are conducted to compare the proposed method with existing ones in the literature.
引用
收藏
页码:5652 / 5695
页数:44
相关论文
共 50 条
  • [31] A minimax optimal approach to high-dimensional double sparse linear regression
    Zhang, Yanhang
    Li, Zhifan
    Liu, Shixiang
    Yin, Jianxin
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 66
  • [32] Optimal errors and phase transitions in high-dimensional generalized linear models
    Barbier, Jean
    Krzakala, Florent
    Macris, Nicolas
    Miolane, Leo
    Zdeborova, Lenka
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (12) : 5451 - 5460
  • [33] COMPUTATIONALLY EFFICIENT AND STATISTICALLY OPTIMAL ROBUST HIGH-DIMENSIONAL LINEAR REGRESSION
    Shen, Yinan
    Li, Jingyang
    Cai, Jian-feng
    Xia, Dong
    ANNALS OF STATISTICS, 2025, 53 (01): : 374 - 399
  • [34] A linear iteration time layout algorithm for visualising high-dimensional data
    Chalmers, M
    VISUALIZATION '96, PROCEEDINGS, 1996, : 127 - +
  • [35] On sparse linear discriminant analysis algorithm for high-dimensional data classification
    Ng, Michael K.
    Liao, Li-Zhi
    Zhang, Leihong
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (02) : 223 - 235
  • [36] HIGH-DIMENSIONAL LINEAR REGRESSION WITH HARD THRESHOLDING REGULARIZATION: THEORY AND ALGORITHM
    Kang, Lican
    Lai, Yanming
    Liu, Yanyan
    Luo, Yuan
    Zhang, Jing
    JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2023, 19 (03) : 2104 - 2122
  • [37] AN EFFICIENT GREEDY SEARCH ALGORITHM FOR HIGH-DIMENSIONAL LINEAR DISCRIMINANT ANALYSIS
    Yang, Hannan
    Lin, Danyu
    Li, Quefeng
    STATISTICA SINICA, 2023, 33 : 1343 - 1364
  • [38] Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds
    Ito, Shinji
    Takemura, Kei
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [39] Optimal Poisson subsampling decorrelated score for high-dimensional generalized linear models
    Shan, Junhao
    Wang, Lei
    JOURNAL OF APPLIED STATISTICS, 2024, 51 (14) : 2719 - 2743
  • [40] Optimal equivariant prediction for high-dimensional linear models with arbitrary predictor covariance
    Dicker, Lee H.
    ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 1806 - 1834