Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits

被引:3
|
作者
Ghatak G. [1 ]
Mohanty H. [2 ]
Rahman A.U. [3 ]
机构
[1] Department of Electronics and Communication Engineering, Indraprastha Institute of Information Technology Delhi, New Delhi
[2] Department of Electrical Engineering, Indian Institute of Technology Kharagpur, Kharagpur
[3] CEMSE Division, King Abdullah University of Science and Technology, Thuwal
来源
关键词
5G; Edge-computing; Kolmogorov-Smirnov (KS) test; multiarmed bandits (MABs); Thompson sampling (TS);
D O I
10.1109/TAI.2021.3121653
中图分类号
学科分类号
摘要
We consider the nonstationary multiarmed bandit framework and propose a Kolmogorov-Smirnov (KS) test based Thompson sampling (TS) algorithm named TS-KS that actively detects change points and resets the TS parameters once a change is detected. In particular, for the two-armed bandit case, we derive bounds on the number of samples of the reward distribution to detect the change once it occurs. Consequently, we show that the proposed algorithm has sublinear regret. Contrary to existing works, our algorithm is able to detect a change when the underlying reward distribution changes even though the mean reward remains the same. Finally, to test the efficacy of the proposed algorithm, we employ it in the following two case-studies: first, task-offloading scenario in wireless edge-computing, and second, portfolio optimization. Our results show that the proposed TS-KS algorithm outperforms not only the static TS algorithm but also it performs better than other bandit algorithms designed for nonstationary environments. Moreover, the performance of TS-KS is at par with the state-of-the-art forecasting algorithms such as Facebook-PROPHET and ARIMA. © 2020 IEEE.
引用
收藏
页码:11 / 19
页数:8
相关论文
共 7 条
  • [1] A Change-Detection-Based Thompson Sampling Framework for Non-Stationary Bandits
    Ghatak, Gourab
    IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (10) : 1670 - 1676
  • [2] Detection of Spoofing Attacks on Global Navigation Satellite Systems Using Kolmogorov-Smirnov Test-Based Signal Quality Monitoring Method
    Zhou, Wenlong
    Lv, Zhiwei
    Li, Guangyun
    Jiao, Bo
    Wu, Wenbo
    IEEE SENSORS JOURNAL, 2024, 24 (07) : 10474 - 10490
  • [3] Conditional monitoring and fault detection of wind turbines based on Kolmogorov-Smirnov non-parametric test
    Ohunakin, Olayinka S.
    Henry, Emerald U.
    Matthew, Olaniran J.
    Ezekiel, Victor U.
    Adelekan, Damola S.
    Oyeniran, Ayodele T.
    ENERGY REPORTS, 2024, 11 : 2577 - 2591
  • [4] Thompson Sampling-Based Dynamic Spectrum Access in Non-Stationary Environments
    Ye, Shuai
    Wang, Tianyu
    Wang, Shaowei
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2023, 9 (03) : 593 - 603
  • [5] Dynamic Spectrum Access in Non-Stationary Environments: A Thompson Sampling Based Method
    Ye, Shuai
    Wang, Tianyu
    Wang, Shaowei
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3989 - 3994
  • [6] Schwarz Information Criterion Based Thompson Sampling for Dynamic Spectrum Access in Non-Stationary Environment
    He, Ming
    Jin, Ming
    Guo, Qinghua
    Xu, Weiqiang
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (03) : 737 - 741
  • [7] Multi-Robot Adaptive Sampling based on Mixture of Experts Approach to Modeling Non-Stationary Spatial Fields
    Masaba, Kizito
    Li, Alberto Quattrini
    2023 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS, MRS, 2023, : 191 - 198