Online Sign Identification: Minimization of the Number of Errors in Thresholding Bandits

被引:0
|
作者
Ouhamma, Reda [1 ]
Degenne, Remy [1 ]
Gaillard, Pierre [2 ]
Perchet, Vianney [3 ,4 ]
机构
[1] Univ Lille, INRIA, CNRS, Cent Lille,UMR 9189,CRIStAL, F-59000 Lille, France
[2] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[3] Ensae, Crest, Paris, France
[4] Criteo AI Lab, Paris, France
关键词
ALLOCATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the fixed budget thresholding bandit problem, an algorithm sequentially allocates a budgeted number of samples to different distributions. It then predicts whether the mean of each distribution is larger or lower than a given threshold. We introduce a large family of algorithms (containing most existing relevant ones), inspired by the Frank-Wolfe algorithm, and provide a thorough yet generic analysis of their performance. This allowed us to construct new explicit algorithms, for a broad class of problems, whose losses are within a small constant factor of the non-adaptive oracle ones. Quite interestingly, we observed that adaptive methods empirically greatly out-perform non-adaptive oracles, an uncommon behavior in standard online learning settings, such as regret minimization. We explain this surprising phenomenon on an insightful toy problem.
引用
收藏
页数:13
相关论文
共 23 条
  • [1] Be Greedy: How Chromatic Number meets Regret Minimization in Graph Bandits
    Shreyas, S.
    Saha, Aadirupa
    Bhattacharyya, Chiranjib
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 595 - 605
  • [2] Analytical minimization of synchronicity errors in stochastic identification
    Bernal, D.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2018, 98 : 415 - 424
  • [3] Lower bounds for online makespan minimization on a small number of related machines
    Jez, Lukasz
    Schwartz, Jarett
    Sgall, Jiri
    Bekesi, Jozsef
    JOURNAL OF SCHEDULING, 2013, 16 (05) : 539 - 547
  • [4] Lower bounds for online makespan minimization on a small number of related machines
    Łukasz Jeż
    Jarett Schwartz
    Jiří Sgall
    József Békési
    Journal of Scheduling, 2013, 16 : 539 - 547
  • [5] EFFECT OF MENU SIGN POSITION ON CUSTOMER ORDERING TIMES AND NUMBER OF FOOD-ORDERING ERRORS
    FOSTER, RS
    AAMODT, MG
    BODENMILLER, JA
    RODGERS, JG
    KOVACH, RC
    BRYAN, DA
    ENVIRONMENT AND BEHAVIOR, 1988, 20 (02) : 200 - 210
  • [6] Loss minimization vector-controlled SPMSM based on online parameter identification
    Huang, Guanghua
    Zhang, Niaona
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 6517 - 6522
  • [7] DETECTION AND IDENTIFICATION OF TOPOLOGICAL ERRORS IN ONLINE POWER-SYSTEM ANALYSIS
    SINGH, N
    GLAVITSCH, H
    IEEE TRANSACTIONS ON POWER SYSTEMS, 1991, 6 (01) : 324 - 331
  • [8] IDENTIFICATION AND ESTIMATION OF ONLINE PRICE COMPETITION WITH AN UNKNOWN NUMBER OF FIRMS
    An, Yonghong
    Baye, Michael R.
    Hu, Yingyao
    Morgan, John
    Shum, Matt
    JOURNAL OF APPLIED ECONOMETRICS, 2017, 32 (01) : 80 - 102
  • [9] Identification of medication errors through a monitoring and minimization program in outpatients in Colombia, 2018-2019
    Enrique Machado-Duque, Manuel
    Enrique Machado-Alba, Jorge
    Gaviria-Mendoza, Andros
    Fernando Valladales-Restrepo, Luis
    Yadira Parrado-Fajardo, Ilsa
    Ospina-Castellanos, Mauren
    Fernanda Rojas-Chavarro, Luisa
    Alexander Lopez-Rincon, John
    BIOMEDICA, 2021, 41 (01):
  • [10] Minimization of model representativity errors in identification of point source emission from atmospheric concentration measurements
    Sharan, Maithili
    Singh, Amit Kumar
    Singh, Sarvesh Kumar
    ATMOSPHERIC ENVIRONMENT, 2017, 169 : 267 - 277