Adaptive Active Learning as a Multi-armed Bandit Problem

被引：2

作者：

Czarnecki, Wojciech M. ^{[1
]}

Podolak, Igor T. ^{[1
]}

机构：

[1] Jagiellonian Univ, Fac Math & Comp Sci, Krakow, Poland

来源：

21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014) | 2014年 / 263卷

关键词：

D O I：

10.3233/978-1-61499-419-0-989

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a new active learning strategy whose main focus is to have the ability to adapt to the unknown (or changing) learning scenario. We introduce the learners' ensemble based approach and model it as the multi-armed bandit problem. Presented application of simple exploration-exploitation trade-off algorithms from the UCB and EXP3 families show an improvement over using the classical strategies. Evaluation on data from UCI database compare three different selection algorithms. In our tests, presented method shows promising results.

引用

页码：989 / 990

页数：2

共 50 条

[41] The Multi-Armed Bandit With Stochastic Plays
Lesage-Landry, Antoine
Taylor, Joshua A.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2018, 63 (07) : 2280 - 2286
[42] Satisficing in Multi-Armed Bandit Problems
Reverdy, Paul
Srivastava, Vaibhav
Leonard, Naomi Ehrich
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3788 - 3803
[43] Achieving Regular and Fair Learning in Combinatorial Multi-Armed Bandit
Wu, Xiaoyi
Li, Bin
IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 361 - 370
[44] Multi-armed Bandit with Additional Observations
Yun, Donggyu
Proutiere, Alexandre
Ahn, Sumyeong
Shin, Jinwoo
Yi, Yung
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2018, 2 (01)
[45] IMPROVING STRATEGIES FOR THE MULTI-ARMED BANDIT
POHLENZ, S
MARKOV PROCESS AND CONTROL THEORY, 1989, 54 : 158 - 163
[46] Active Learning in Multi-armed Bandits
Antos, Andras
Grover, Varun
Szepesvari, Csaba
ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2008, 5254 : 287 - +
[47] MULTI-ARMED BANDIT ALLOCATION INDEXES
JONES, PW
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1989, 40 (12) : 1158 - 1159
[48] Multi-armed bandit heterogeneous ensemble learning for imbalanced data
Dai, Qi
Liu, Jian-wei
Yang, Jiapeng
COMPUTATIONAL INTELLIGENCE, 2023, 39 (02) : 344 - 368
[49] Learning the Truth in Social Networks Using Multi-Armed Bandit
Odeyomi, Olusola T.
IEEE ACCESS, 2020, 8 : 137692 - 137701
[50] Multi-Armed Bandit Learning for Content Provisioning in Network of UAVs
Bhuyan, Amit Kumar
Dutta, Hrishikesh
Biswas, Subir
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 1143 - 1148

← 1 2 3 4 5 →