Shortest Path Learning in Non-Stationary Enviroments via Online Convex Optimization

被引:0
|
作者
Vural, N. Mert [1 ]
Altas, Burak [2 ]
Ilhan, Fatih [1 ,2 ]
Kozat, Suleyman S. [1 ,2 ]
机构
[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkey
[2] DataBoss AS, Ankara, Turkey
关键词
on-line learning shortest path; non-stationary environment; multi-armed bandit problem;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we study the online shortest path learning problem under semi-bandit feedback in adversarial and non-stationary environments. To develop an efficient algorithm, we use the online convex optimization framework. We introduce an optimal online shortest path algorithm that guarantees to obtain the performance of the shortest path sequence. Since we do not have any statistical assumptions on the path delays, the results in the paper are guaranteed to hold in an individual sequence manner. Hence, our algorithm can be used for a wide range of practical network optimization problems that require exploration and exploitation at the same time.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Online Shortest Path Learning via Convex Optimization
    Vural, N. Mert
    Altas, Burak
    Ilhan, Fatih
    Kozat, Suleyman S.
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [2] Online Learning with Non-Convex Losses and Non-Stationary Regret
    Gao, Xiang
    Li, Xiaobo
    Zhang, Shuzhong
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 84, 2018, 84
  • [3] Bandit Convex Optimization in Non-stationary Environments
    Zhao, Peng
    Wang, Guanghui
    Zhang, Lijun
    Zhou, Zhi-Hua
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1508 - 1517
  • [4] Bandit Convex Optimization in Non-stationary Environments
    Zhao, Peng
    Wang, Guanghui
    Zhang, Lijun
    Zhou, Zhi-Hua
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [5] Online learning of non-stationary sequences
    Monteleoni, C
    Jaakkola, T
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 1093 - 1100
  • [6] Efficient Methods for Non-stationary Online Learning
    Zhao, Peng
    Xie, Yan-Feng
    Zhang, Lijun
    Zhou, Zhi-Hua
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Cascading Non-Stationary Bandits: Online Learning to Rank in the Non-Stationary Cascade Model
    Li, Chang
    de Rijke, Maarten
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2859 - 2865
  • [8] Online Learning for Non-Stationary A/B Tests
    Medina, Andres Munoz
    Vassilvitiskii, Sergei
    Yin, Dong
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 317 - 326
  • [9] A heterogeneous online learning ensemble for non-stationary environments
    Idrees, Mobin M.
    Minku, Leandro L.
    Stahl, Frederic
    Badii, Atta
    KNOWLEDGE-BASED SYSTEMS, 2020, 188
  • [10] Bilevel Online Deep Learning in Non-stationary Environment
    Han, Ya-nan
    Liu, Jian-wei
    Xiao, Bing-biao
    Wang, Xin-Tan
    Luo, Xiong-lin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 347 - 358