Handling Concept Drift in Non-stationary Bandit Through Predicting Future Rewards

被引：0

作者：

Tsai, Yun-Da ^{[1
]}

Lin, Shou-De ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

来源：

TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA | 2024年 / 14658卷

关键词：

D O I：

10.1007/978-981-97-2650-9_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a study on the non-stationary stochastic multi-armed bandit (MAB) problem, which is relevant for addressing real-world challenges related to sequential decision-making. Our work involves a thorough analysis of state-of-the-art algorithms in dynamically changing environments. To address the limitations of existing methods, we propose the Concept Drift Adaptive Bandit (CDAB) framework, which aims to capture and predict potential future concept drift patterns in reward distribution, allowing for better adaptation in non-stationary environments. We conduct extensive numerical experiments to evaluate the effectiveness of the CDAB approach in comparison to both stationary and non-stationary state-of-the-art baselines. Our experiments involve testing on both artificial datasets and real-world data under different types of changing environments. The results show that the CDAB approach exhibits strong empirical performance, outperforming existing methods in all versions tested.

引用

页码：161 / 173

页数：13

共 50 条

[21] Non-stationary concept of accident prediction
Mansourkhaki, Ali
Karimpour, Abolfazl
Yazdi, Hadi Sadoghi
PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-TRANSPORT, 2017, 170 (03) : 140 - 151
[22] Optimizing for the Future in Non-Stationary MDPs
Chandak, Yash
Theocharous, Georgios
Shankar, Shiv
White, Martha
Mahadevan, Sridhar
Thomas, Philip S.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[23] Stochastic Bandits With Non-Stationary Rewards: Reward Attack and Defense
Yang, Chenye
Liu, Guanlin
Lai, Lifeng
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 5007 - 5020
[24] Optimizing for the Future in Non-Stationary MDPs
Chandak, Yash
Theocharous, Georgios
Shankar, Shiv
White, Martha
Mahadevan, Sridhar
Thomas, Philip S.
25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
[25] Incremental Learning in Non-stationary Environments with Concept Drift using a Multiple Classifier Based Approach
Karnick, Matthew
Muhlbaier, Michael D.
Polikar, Robi
19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 497 - 500
[26] Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm
Cavenaghi, Emanuele
Sottocornola, Gabriele
Stella, Fabio
Zanker, Markus
ENTROPY, 2021, 23 (03)
[27] Dynamic Ensemble Active Learning: A Non-Stationary Bandit with Expert Advice
Pang, Kunkun
Dong, Mingzhi
Wu, Yang
Hospedales, Timothy M.
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2269 - 2276
[28] DYNAMIC SPECTRUM ACCESS WITH NON-STATIONARY MULTI-ARMED BANDIT
Alaya-Feki, Afef Ben Hadj
Moulines, Eric
LeCornec, Alain
2008 IEEE 9TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, VOLS 1 AND 2, 2008, : 416 - 420
[29] Modeling and predicting non-stationary time series
Cao, LY
Mees, A
Judd, K
INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 1997, 7 (08): : 1823 - 1831
[30] Modeling and Predicting Non-Stationary Time Series
Cao, L.
Mees, A.
Judd, K.
International Journal of Bifurcations and Chaos in Applied Sciences and Engineering, 7 (08):

← 1 2 3 4 5 →