Non-stationary Bandits with Heavy Tail

被引:0
|
作者
Pan, Weici [1 ]
Liu, Zhenhua [1 ]
机构
[1] Stony Brook University, United States
来源
Performance Evaluation Review | 2024年 / 52卷 / 02期
关键词
Gaussian assumption - Heavy-tailed - Heavy-tails - Multiarmed bandits (MABs) - Nonstationary - Performance - Risk neutrals - Sub-Gaussians;
D O I
10.1145/3695411.3695424
中图分类号
学科分类号
摘要
In this study, we investigate the performance of multi-armed bandit algorithms in environments characterized by heavytailed and non-stationary reward distributions, a setting that deviates from the conventional risk-neutral and sub- Gaussian assumptions. © 2024 Copyright is held by the owner/author(s).
引用
收藏
页码:33 / 35
相关论文
共 50 条
  • [31] Some algorithms for correlated bandits with non-stationary rewards : Regret bounds and applications
    Mayekar, Prathamesh
    Hemachandra, Nandyala
    PROCEEDINGS OF THE THIRD ACM IKDD CONFERENCE ON DATA SCIENCES (CODS), 2016,
  • [32] Non-Stationary Linear Bandits With Dimensionality Reduction for Large-Scale Recommender Systems
    Ghoorchian, Saeed
    Kortukov, Evgenii
    Maghsudi, Setareh
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2024, 5 : 548 - 558
  • [33] Implementation of Exploration in TONIC Using Non-stationary Volatile Multi-arm Bandits
    Shaha, Aditya
    Arya, Dhruv
    Tripathy, B. K.
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 1, 2020, 1048 : 239 - 250
  • [34] A Risk-Averse Framework for Non-Stationary Stochastic Multi-Armed Bandits
    Alami, Reda
    Mahfoud, Mohammed
    Achab, Mastane
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 272 - 280
  • [35] Contextual Multi-Armed Bandits for Non-Stationary Heterogeneous Mobile Edge Computing
    Wirth, Maximilian
    Ortiz, Andrea
    Klein, Anja
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 5599 - 5604
  • [36] A New Algorithm for Non-stationary Contextual Bandits: Efficient, Optimal, and Parameter-free
    Chen, Yifang
    Lee, Chung-Wei
    Luo, Haipeng
    Wei, Chen-Yu
    CONFERENCE ON LEARNING THEORY, VOL 99, 2019, 99
  • [37] An Optimization-based Algorithm for Non-stationary Kernel Bandits without Prior Knowledge
    Hong, Kihyuk
    Li, Yuhang
    Tewari, Ambuj
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [38] Minimax Optimal Bandits for Heavy Tail Rewards
    Lee, Kyungjae
    Lim, Sungbin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 35 (04) : 4899 - 4901
  • [39] Risk-Aware Bandits for Digital Twin Placement in Non-Stationary Mobile Edge Computing
    Wirth, Maximilian
    Ortiz, Andrea
    Klein, Anja
    2024 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS, ICC WORKSHOPS 2024, 2024, : 13 - 18
  • [40] Stochastic Multi-Armed Bandits with Non-Stationary Rewards Generated by a Linear Dynamical System
    Gornet, Jonathan
    Hosseinzadeh, Mehdi
    Sinopoli, Bruno
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 1460 - 1465