Handling Concept Drift in Non-stationary Bandit Through Predicting Future Rewards

被引：0

作者：

Tsai, Yun-Da ^{[1
]}

Lin, Shou-De ^{[1
]}

机构：

[1] Natl Taiwan Univ, Taipei, Taiwan

来源：

TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA | 2024年 / 14658卷

关键词：

D O I：

10.1007/978-981-97-2650-9_13

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a study on the non-stationary stochastic multi-armed bandit (MAB) problem, which is relevant for addressing real-world challenges related to sequential decision-making. Our work involves a thorough analysis of state-of-the-art algorithms in dynamically changing environments. To address the limitations of existing methods, we propose the Concept Drift Adaptive Bandit (CDAB) framework, which aims to capture and predict potential future concept drift patterns in reward distribution, allowing for better adaptation in non-stationary environments. We conduct extensive numerical experiments to evaluate the effectiveness of the CDAB approach in comparison to both stationary and non-stationary state-of-the-art baselines. Our experiments involve testing on both artificial datasets and real-world data under different types of changing environments. The results show that the CDAB approach exhibits strong empirical performance, outperforming existing methods in all versions tested.

引用

页码：161 / 173

页数：13

共 50 条

[31] An Optimal Algorithm for Adversarial Bandit Problem with Multiple Plays in Non-Stationary Environments
Vural, N. Mert
Ozturk, Bugra
Kozat, Suleyman S.
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[32] Homogenized Model of Non-Stationary Diffusion in Porous Media with the Drift
Goncharenko, M.
Khilkova, L.
JOURNAL OF MATHEMATICAL PHYSICS ANALYSIS GEOMETRY, 2017, 13 (02) : 154 - 172
[33] Adaptive Drift Detection Mechanism for Non-Stationary Data Stream
Nagendhiran, Nalini
Kuppusamy, Lakshmanan
JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2021, 20 (01)
[34] Experiments on the non-stationary flow through labyrinths
Trutnovsky, K
ZEITSCHRIFT DES VEREINES DEUTSCHER INGENIEURE, 1942, 86 : 609 - 611
[35] Some algorithms for correlated bandits with non-stationary rewards : Regret bounds and applications
Mayekar, Prathamesh
Hemachandra, Nandyala
PROCEEDINGS OF THE THIRD ACM IKDD CONFERENCE ON DATA SCIENCES (CODS), 2016,
[36] Contextual Multi-Armed Bandit With Costly Feature Observation in Non-Stationary Environments
Ghoorchian, Saeed
Kortukov, Evgenii
Maghsudi, Setareh
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 820 - 830
[37] Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
Koulouriotis, D. E.
Xanthopoulos, A.
APPLIED MATHEMATICS AND COMPUTATION, 2008, 196 (02) : 913 - 922
[38] LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments
de Curto, J.
de Zarza, I.
Roig, Gemma
Cano, Juan Carlos
Manzoni, Pietro
Calafate, Carlos T.
ELECTRONICS, 2023, 12 (13)
[39] The concept of adjacency for stationary and non-stationary solutions of scalar semilinear parabolic PDE
Wolfrum, M
EQUADIFF 2003: INTERNATIONAL CONFERENCE ON DIFFERENTIAL EQUATIONS, 2005, : 678 - 683
[40] Solving Non-Stationary Bandit Problems by Random Sampling from Sibling Kalman Filters
Granmo, Ole-Christoffer
Berg, Stian
TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT III, PROCEEDINGS, 2010, 6098 : 199 - 208

← 1 2 3 4 5 →