Adaptive Active Learning as a Multi-armed Bandit Problem

被引:2
|
作者
Czarnecki, Wojciech M. [1 ]
Podolak, Igor T. [1 ]
机构
[1] Jagiellonian Univ, Fac Math & Comp Sci, Krakow, Poland
关键词
D O I
10.3233/978-1-61499-419-0-989
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a new active learning strategy whose main focus is to have the ability to adapt to the unknown (or changing) learning scenario. We introduce the learners' ensemble based approach and model it as the multi-armed bandit problem. Presented application of simple exploration-exploitation trade-off algorithms from the UCB and EXP3 families show an improvement over using the classical strategies. Evaluation on data from UCI database compare three different selection algorithms. In our tests, presented method shows promising results.
引用
收藏
页码:989 / 990
页数:2
相关论文
共 50 条
  • [41] The Multi-Armed Bandit With Stochastic Plays
    Lesage-Landry, Antoine
    Taylor, Joshua A.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (07) : 2280 - 2286
  • [42] Satisficing in Multi-Armed Bandit Problems
    Reverdy, Paul
    Srivastava, Vaibhav
    Leonard, Naomi Ehrich
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3788 - 3803
  • [43] Achieving Regular and Fair Learning in Combinatorial Multi-Armed Bandit
    Wu, Xiaoyi
    Li, Bin
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 361 - 370
  • [44] Multi-armed Bandit with Additional Observations
    Yun, Donggyu
    Proutiere, Alexandre
    Ahn, Sumyeong
    Shin, Jinwoo
    Yi, Yung
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2018, 2 (01)
  • [45] IMPROVING STRATEGIES FOR THE MULTI-ARMED BANDIT
    POHLENZ, S
    MARKOV PROCESS AND CONTROL THEORY, 1989, 54 : 158 - 163
  • [46] Active Learning in Multi-armed Bandits
    Antos, Andras
    Grover, Varun
    Szepesvari, Csaba
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 287 - +
  • [47] MULTI-ARMED BANDIT ALLOCATION INDEXES
    JONES, PW
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1989, 40 (12) : 1158 - 1159
  • [48] Multi-armed bandit heterogeneous ensemble learning for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Yang, Jiapeng
    COMPUTATIONAL INTELLIGENCE, 2023, 39 (02) : 344 - 368
  • [49] Learning the Truth in Social Networks Using Multi-Armed Bandit
    Odeyomi, Olusola T.
    IEEE ACCESS, 2020, 8 : 137692 - 137701
  • [50] Multi-Armed Bandit Learning for Content Provisioning in Network of UAVs
    Bhuyan, Amit Kumar
    Dutta, Hrishikesh
    Biswas, Subir
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1143 - 1148