Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit

被引：2

作者：

Li, Ke ^{[1
]}

Yang, Yun ^{[1
]}

Narisetty, Naveen N. ^{[1
]}

机构：

[1] Univ Illinois, Dept Stat, Champaign, IL 61820 USA

来源：

ELECTRONIC JOURNAL OF STATISTICS | 2021年 / 15卷 / 02期

关键词：

Contextual linear bandit; high-dimension; minimax regret; sparsity; upper confidence bound; VARIABLE SELECTION;

D O I：

10.1214/21-EJS1909

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In this paper, we consider the multi-armed bandit problem with high-dimensional features. First, we prove a minimax lower bound, O((log d)(alpha+1/2) T (1-alpha/2) + logT), for the cumulative regret, in terms of horizon T, dimension d and a margin parameter alpha is an element of [0, 1], which controls the separation between the optimal and the sub-optimal arms. This new lower bound unifies existing regret bound results that have different dependencies on T due to the use of different values of margin parameter a explicitly implied by their assumptions. Second, we propose a simple and computationally efficient algorithm inspired by the general Upper Confidence Bound (UCB) strategy that achieves a regret upper bound matching the lower bound. The proposed algorithm uses a properly centered l(1)-ball as the confidence set in contrast to the commonly used ellipsoid confidence set. In addition, the algorithm does not require any forced sampling step and is thereby adaptive to the practically unknown margin parameter. Simulations and a real data analysis are conducted to compare the proposed method with existing ones in the literature.

引用

页码：5652 / 5695

页数：44

共 50 条

[31] A minimax optimal approach to high-dimensional double sparse linear regression
Zhang, Yanhang
Li, Zhifan
Liu, Shixiang
Yin, Jianxin
JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25 : 1 - 66
[32] Optimal errors and phase transitions in high-dimensional generalized linear models
Barbier, Jean
Krzakala, Florent
Macris, Nicolas
Miolane, Leo
Zdeborova, Lenka
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (12) : 5451 - 5460
[33] COMPUTATIONALLY EFFICIENT AND STATISTICALLY OPTIMAL ROBUST HIGH-DIMENSIONAL LINEAR REGRESSION
Shen, Yinan
Li, Jingyang
Cai, Jian-feng
Xia, Dong
ANNALS OF STATISTICS, 2025, 53 (01): : 374 - 399
[34] A linear iteration time layout algorithm for visualising high-dimensional data
Chalmers, M
VISUALIZATION '96, PROCEEDINGS, 1996, : 127 - +
[35] On sparse linear discriminant analysis algorithm for high-dimensional data classification
Ng, Michael K.
Liao, Li-Zhi
Zhang, Leihong
NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2011, 18 (02) : 223 - 235
[36] HIGH-DIMENSIONAL LINEAR REGRESSION WITH HARD THRESHOLDING REGULARIZATION: THEORY AND ALGORITHM
Kang, Lican
Lai, Yanming
Liu, Yanyan
Luo, Yuan
Zhang, Jing
JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION, 2023, 19 (03) : 2104 - 2122
[37] AN EFFICIENT GREEDY SEARCH ALGORITHM FOR HIGH-DIMENSIONAL LINEAR DISCRIMINANT ANALYSIS
Yang, Hannan
Lin, Danyu
Li, Quefeng
STATISTICA SINICA, 2023, 33 : 1343 - 1364
[38] Best-of-Three-Worlds Linear Bandit Algorithm with Variance-Adaptive Regret Bounds
Ito, Shinji
Takemura, Kei
THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
[39] Optimal Poisson subsampling decorrelated score for high-dimensional generalized linear models
Shan, Junhao
Wang, Lei
JOURNAL OF APPLIED STATISTICS, 2024, 51 (14) : 2719 - 2743
[40] Optimal equivariant prediction for high-dimensional linear models with arbitrary predictor covariance
Dicker, Lee H.
ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 1806 - 1834

← 1 2 3 4 5 →