Shortest Path Learning in Non-Stationary Enviroments via Online Convex Optimization

被引:0
|
作者
Vural, N. Mert [1 ]
Altas, Burak [2 ]
Ilhan, Fatih [1 ,2 ]
Kozat, Suleyman S. [1 ,2 ]
机构
[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkey
[2] DataBoss AS, Ankara, Turkey
关键词
on-line learning shortest path; non-stationary environment; multi-armed bandit problem;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we study the online shortest path learning problem under semi-bandit feedback in adversarial and non-stationary environments. To develop an efficient algorithm, we use the online convex optimization framework. We introduce an optimal online shortest path algorithm that guarantees to obtain the performance of the shortest path sequence. Since we do not have any statistical assumptions on the path delays, the results in the paper are guaranteed to hold in an individual sequence manner. Hence, our algorithm can be used for a wide range of practical network optimization problems that require exploration and exploitation at the same time.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Online Learning and Online Convex Optimization
    Shalev-Shwartz, Shai
    FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (02): : 107 - 194
  • [32] Online Optimization in the Non-Stationary Cloud: Change Point Detection for Resource Provisioning
    Maghakian, Jessica
    Comden, Joshua
    Liu, Zhenhua
    2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
  • [33] Measures for non-stationary optimization tasks
    Trojanowski, K
    Obuchowicz, A
    ARTIFICIAL NEURAL NETS AND GENETIC ALGORITHMS, 2001, : 244 - 247
  • [34] Non-stationary kriging for design optimization
    Toal, D. J. J.
    Keane, A. J.
    ENGINEERING OPTIMIZATION, 2012, 44 (06) : 741 - 765
  • [35] Deep Reinforcement Learning for inventory optimization with non-stationary uncertain demand
    Dehaybe, Henri
    Catanzaro, Daniele
    Chevalier, Philippe
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 314 (02) : 433 - 445
  • [36] An Online Learning Framework for UAV Target Search Missions in Non-Stationary Environments
    Khial, Noor
    Mhaisen, Naram
    Mabrok, Mohamed
    Mohamed, Amr
    2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 753 - 758
  • [37] Adaptive stepsize selection for online Q-learning in a non-stationary environment
    Levy, Kim
    Vazquez-Abad, Felisa J.
    Costa, Andre
    WODES 2006: EIGHTH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, PROCEEDINGS, 2006, : 372 - +
  • [38] Continual Prototype Evolution: Learning Online from Non-Stationary Data Streams
    De lange, Matthias
    Tuytelaars, Tinne
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8230 - 8239
  • [39] Learning for non-stationary Dirichlet processes
    Quinn, A.
    Karny, M.
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2007, 21 (10) : 827 - 855
  • [40] Social Learning in non-stationary environments
    Boursier, Etienne
    Perchet, Vianney
    Scarsini, Marco
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167