Adaptive Active Learning as a Multi-armed Bandit Problem

被引：2

作者：

Czarnecki, Wojciech M. ^{[1
]}

Podolak, Igor T. ^{[1
]}

机构：

[1] Jagiellonian Univ, Fac Math & Comp Sci, Krakow, Poland

来源：

21ST EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2014) | 2014年 / 263卷

关键词：

D O I：

10.3233/978-1-61499-419-0-989

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a new active learning strategy whose main focus is to have the ability to adapt to the unknown (or changing) learning scenario. We introduce the learners' ensemble based approach and model it as the multi-armed bandit problem. Presented application of simple exploration-exploitation trade-off algorithms from the UCB and EXP3 families show an improvement over using the classical strategies. Evaluation on data from UCI database compare three different selection algorithms. In our tests, presented method shows promising results.

引用

页码：989 / 990

页数：2

共 50 条

[1] An Adaptive Algorithm in Multi-Armed Bandit Problem
Zhang X.
Zhou Q.
Liang B.
Xu J.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (03): : 643 - 654
[2] Multi-armed Bandit Algorithms for Adaptive Learning: A Survey
Mui, John
Lin, Fuhua
Dewan, M. Ali Akber
ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 273 - 278
[3] The budgeted multi-armed bandit problem
Madani, O
Lizotte, DJ
Greiner, R
LEARNING THEORY, PROCEEDINGS, 2004, 3120 : 643 - 645
[4] THE MULTI-ARMED BANDIT PROBLEM WITH COVARIATES
Perchet, Vianney
Rigollet, Philippe
ANNALS OF STATISTICS, 2013, 41 (02): : 693 - 721
[5] ON MULTI-ARMED BANDIT PROBLEM WITH NUISANCE PARAMETER
孙嘉阳
Science China Mathematics, 1986, (05) : 464 - 475
[6] Robust control of the multi-armed bandit problem
Caro, Felipe
Das Gupta, Aparupa
ANNALS OF OPERATIONS RESEARCH, 2022, 317 (02) : 461 - 480
[7] Robust control of the multi-armed bandit problem
Felipe Caro
Aparupa Das Gupta
Annals of Operations Research, 2022, 317 : 461 - 480
[8] Multi-armed bandit problem with known trend
Bouneffouf, Djallel
Feraud, Raphael
NEUROCOMPUTING, 2016, 205 : 16 - 21
[9] Active Learning on Heterogeneous Information Networks: A Multi-armed Bandit Approach
Xin, Doris
El-Kishky, Ahmed
Liao, De
Norick, Brandon
Han, Jiawei
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1350 - 1355
[10] Adaptive Algorithm for Multi-Armed Bandit Problem with High-Dimensional Covariates
Qian, Wei
Ing, Ching-Kang
Liu, Ji
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 970 - 982

← 1 2 3 4 5 →