Handling Concept Drift in Non-stationary Bandit Through Predicting Future Rewards

被引:0
|
作者
Tsai, Yun-Da [1 ]
Lin, Shou-De [1 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
来源
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2024 WORKSHOPS, RAFDA AND IWTA | 2024年 / 14658卷
关键词
D O I
10.1007/978-981-97-2650-9_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a study on the non-stationary stochastic multi-armed bandit (MAB) problem, which is relevant for addressing real-world challenges related to sequential decision-making. Our work involves a thorough analysis of state-of-the-art algorithms in dynamically changing environments. To address the limitations of existing methods, we propose the Concept Drift Adaptive Bandit (CDAB) framework, which aims to capture and predict potential future concept drift patterns in reward distribution, allowing for better adaptation in non-stationary environments. We conduct extensive numerical experiments to evaluate the effectiveness of the CDAB approach in comparison to both stationary and non-stationary state-of-the-art baselines. Our experiments involve testing on both artificial datasets and real-world data under different types of changing environments. The results show that the CDAB approach exhibits strong empirical performance, outperforming existing methods in all versions tested.
引用
收藏
页码:161 / 173
页数:13
相关论文
共 50 条
  • [42] Parameter estimation for the non-stationary Ornstein–Uhlenbeck process with linear drift
    Hui Jiang
    Xing Dong
    Statistical Papers, 2015, 56 : 257 - 268
  • [43] AMPLIFICATION OF FRONTS OF NON-STATIONARY DISCONTINUITY WAVES BY MEANS OF DRIFT EFFECT
    KALISKI, S
    BULLETIN DE L ACADEMIE POLONAISE DES SCIENCES-SERIE DES SCIENCES TECHNIQUES, 1968, 16 (05): : 389 - &
  • [44] A Concept of the Non-Stationary Filtering Network with Reduced Transient Response
    Wiechetek, Katarzyna
    Piskorowski, Jacek
    APPLIED SCIENCES-BASEL, 2019, 9 (21):
  • [45] Online Machine Learning from Non-stationary Data Streams in the Presence of Concept Drift and Class Imbalance: A Systematic Review
    Palli, Abdul Sattar
    Jaafar, Jafreezal
    Gilal, Abdul Rehman
    Alsughayyir, Aeshah
    Gomes, Heitor Murilo
    Alshanqiti, Abdullah
    Omar, Mazni
    JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2024, 23 (01): : 105 - 139
  • [46] Test-bench for Task Offloading Mechanisms: Modelling the Rewards of Non-stationary Nodes
    Rahman, Aniq Ur
    Konar, Sarnava
    Banerjee, Ayan
    13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATION SYSTEMS (IEEE ANTS), 2019,
  • [47] A Multi-armed Bandit Algorithm Available in Stationary or Non-stationary Environments Using Self-organizing Maps
    Manome, Nobuhito
    Shinohara, Shuji
    Suzuki, Kouta
    Tomonaga, Kosuke
    Mitsuyoshi, Shunji
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 529 - 540
  • [48] Adaptive Data Placement in Multi-Cloud Storage: A Non-Stationary Combinatorial Bandit Approach
    Li, Li
    Shen, Jiajie
    Wu, Bochun
    Zhou, Yangfan
    Wang, Xin
    Li, Keqin
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (11) : 2843 - 2859
  • [49] Online Second Price Auction with Semi-Bandit Feedback under the Non-Stationary Setting
    Zhao, Haoyu
    Chen, Wei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 6893 - 6900