Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits

被引:0
|
作者
Ouhamma, Reda [1 ]
Degenne, Remy [1 ]
Gaillard, Pierre [2 ]
Perchet, Vianney [3 ,4 ]
机构
[1] Univ Lille, INRIA, CNRS, Cent Lille,UMR 9189,CRIStAL, F-59000 Lille, France
[2] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[3] Ensae, Crest, Paris, France
[4] Criteo AI Lab, Paris, France
关键词
ALLOCATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the fixed budget thresholding bandit problem, an algorithm sequentially allocates a budgeted number of samples to different distributions. It then predicts whether the mean of each distribution is larger or lower than a given threshold. We introduce a large family of algorithms (containing most existing relevant ones), inspired by the Frank-Wolfe algorithm, and provide a thorough yet generic analysis of their performance. This allowed us to construct new explicit algorithms, for a broad class of problems, whose losses are within a small constant factor of the non-adaptive oracle ones. Quite interestingly, we observed that adaptive methods empirically greatly out-perform non-adaptive oracles, an uncommon behavior in standard online learning settings, such as regret minimization. We explain this surprising phenomenon on an insightful toy problem.
引用
收藏
页数:13
相关论文
共 23 条
  • [21] Online Identification and Compensation of Drive and Phase Errors for Whole-Angle Hemispherical Resonator Gyroscope Based on Sinusoidal Self-Excitation
    Chen, Zhennan
    Yan, Kaichen
    Wang, Xiaoxu
    Qu, Tianliang
    Zhou, Jinling
    Zhang, Xi
    Che, Chicheng
    Gao, Pu
    Lu, Qianbo
    IEEE SENSORS JOURNAL, 2025, 25 (02) : 2362 - 2371
  • [22] Authentication using Robust Primary PIN (Personal Identification Number), Multifactor Authentication for Credit Card Swipe and Online Transactions Security
    Vaithyasubramanian, S.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 541 - 546
  • [23] Reliable Online Parameter Identification of Li-Ion Batteries in Battery Management Systems Using the Condition Number of the Error Covariance Matrix
    Kim, Minho
    Kim, Kwangrae
    Han, Soohee
    IEEE ACCESS, 2020, 8 (08): : 189106 - 189114