Handling Concept Drift in Non-stationary Bandit Through Predicting Future Rewards

被引:0
|
作者
Tsai, Yun-Da [1 ]
Lin, Shou-De [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
来源
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA | 2024年 / 14658卷
关键词
D O I
10.1007/978-981-97-2650-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a study on the non-stationary stochastic multi-armed bandit (MAB) problem, which is relevant for addressing real-world challenges related to sequential decision-making. Our work involves a thorough analysis of state-of-the-art algorithms in dynamically changing environments. To address the limitations of existing methods, we propose the Concept Drift Adaptive Bandit (CDAB) framework, which aims to capture and predict potential future concept drift patterns in reward distribution, allowing for better adaptation in non-stationary environments. We conduct extensive numerical experiments to evaluate the effectiveness of the CDAB approach in comparison to both stationary and non-stationary state-of-the-art baselines. Our experiments involve testing on both artificial datasets and real-world data under different types of changing environments. The results show that the CDAB approach exhibits strong empirical performance, outperforming existing methods in all versions tested.
引用
收藏
页码:161 / 173
页数:13
相关论文
共 50 条
  • [31] An Optimal Algorithm for Adversarial Bandit Problem with Multiple Plays in Non-Stationary Environments
    Vural, N. Mert
    Ozturk, Bugra
    Kozat, Suleyman S.
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [32] Homogenized Model of Non-Stationary Diffusion in Porous Media with the Drift
    Goncharenko, M.
    Khilkova, L.
    JOURNAL OF MATHEMATICAL PHYSICS ANALYSIS GEOMETRY, 2017, 13 (02) : 154 - 172
  • [33] Adaptive Drift Detection Mechanism for Non-Stationary Data Stream
    Nagendhiran, Nalini
    Kuppusamy, Lakshmanan
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2021, 20 (01)
  • [34] Experiments on the non-stationary flow through labyrinths
    Trutnovsky, K
    ZEITSCHRIFT DES VEREINES DEUTSCHER INGENIEURE, 1942, 86 : 609 - 611
  • [35] Some algorithms for correlated bandits with non-stationary rewards : Regret bounds and applications
    Mayekar, Prathamesh
    Hemachandra, Nandyala
    PROCEEDINGS OF THE THIRD ACM IKDD CONFERENCE ON DATA SCIENCES (CODS), 2016,
  • [36] Contextual Multi-Armed Bandit With Costly Feature Observation in Non-Stationary Environments
    Ghoorchian, Saeed
    Kortukov, Evgenii
    Maghsudi, Setareh
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 820 - 830
  • [37] Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
    Koulouriotis, D. E.
    Xanthopoulos, A.
    APPLIED MATHEMATICS AND COMPUTATION, 2008, 196 (02) : 913 - 922
  • [38] LLM-Informed Multi-Armed Bandit Strategies for Non-Stationary Environments
    de Curto, J.
    de Zarza, I.
    Roig, Gemma
    Cano, Juan Carlos
    Manzoni, Pietro
    Calafate, Carlos T.
    ELECTRONICS, 2023, 12 (13)
  • [39] The concept of adjacency for stationary and non-stationary solutions of scalar semilinear parabolic PDE
    Wolfrum, M
    EQUADIFF 2003: INTERNATIONAL CONFERENCE ON DIFFERENTIAL EQUATIONS, 2005, : 678 - 683
  • [40] Solving Non-Stationary Bandit Problems by Random Sampling from Sibling Kalman Filters
    Granmo, Ole-Christoffer
    Berg, Stian
    TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT III, PROCEEDINGS, 2010, 6098 : 199 - 208