Handling Concept Drift in Non-stationary Bandit Through Predicting Future Rewards

被引:0
|
作者
Tsai, Yun-Da [1 ]
Lin, Shou-De [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
来源
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA | 2024年 / 14658卷
关键词
D O I
10.1007/978-981-97-2650-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a study on the non-stationary stochastic multi-armed bandit (MAB) problem, which is relevant for addressing real-world challenges related to sequential decision-making. Our work involves a thorough analysis of state-of-the-art algorithms in dynamically changing environments. To address the limitations of existing methods, we propose the Concept Drift Adaptive Bandit (CDAB) framework, which aims to capture and predict potential future concept drift patterns in reward distribution, allowing for better adaptation in non-stationary environments. We conduct extensive numerical experiments to evaluate the effectiveness of the CDAB approach in comparison to both stationary and non-stationary state-of-the-art baselines. Our experiments involve testing on both artificial datasets and real-world data under different types of changing environments. The results show that the CDAB approach exhibits strong empirical performance, outperforming existing methods in all versions tested.
引用
收藏
页码:161 / 173
页数:13
相关论文
共 50 条
  • [21] Non-stationary concept of accident prediction
    Mansourkhaki, Ali
    Karimpour, Abolfazl
    Yazdi, Hadi Sadoghi
    PROCEEDINGS OF THE INSTITUTION OF CIVIL ENGINEERS-TRANSPORT, 2017, 170 (03) : 140 - 151
  • [22] Optimizing for the Future in Non-Stationary MDPs
    Chandak, Yash
    Theocharous, Georgios
    Shankar, Shiv
    White, Martha
    Mahadevan, Sridhar
    Thomas, Philip S.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [23] Stochastic Bandits With Non-Stationary Rewards: Reward Attack and Defense
    Yang, Chenye
    Liu, Guanlin
    Lai, Lifeng
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 5007 - 5020
  • [24] Optimizing for the Future in Non-Stationary MDPs
    Chandak, Yash
    Theocharous, Georgios
    Shankar, Shiv
    White, Martha
    Mahadevan, Sridhar
    Thomas, Philip S.
    25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,
  • [25] Incremental Learning in Non-stationary Environments with Concept Drift using a Multiple Classifier Based Approach
    Karnick, Matthew
    Muhlbaier, Michael D.
    Polikar, Robi
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 497 - 500
  • [26] Non Stationary Multi-Armed Bandit: Empirical Evaluation of a New Concept Drift-Aware Algorithm
    Cavenaghi, Emanuele
    Sottocornola, Gabriele
    Stella, Fabio
    Zanker, Markus
    ENTROPY, 2021, 23 (03)
  • [27] Dynamic Ensemble Active Learning: A Non-Stationary Bandit with Expert Advice
    Pang, Kunkun
    Dong, Mingzhi
    Wu, Yang
    Hospedales, Timothy M.
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2269 - 2276
  • [28] DYNAMIC SPECTRUM ACCESS WITH NON-STATIONARY MULTI-ARMED BANDIT
    Alaya-Feki, Afef Ben Hadj
    Moulines, Eric
    LeCornec, Alain
    2008 IEEE 9TH WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, VOLS 1 AND 2, 2008, : 416 - 420
  • [29] Modeling and predicting non-stationary time series
    Cao, LY
    Mees, A
    Judd, K
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 1997, 7 (08): : 1823 - 1831
  • [30] Modeling and Predicting Non-Stationary Time Series
    Cao, L.
    Mees, A.
    Judd, K.
    International Journal of Bifurcations and Chaos in Applied Sciences and Engineering, 7 (08):