Shortest Path Learning in Non-Stationary Enviroments via Online Convex Optimization

被引：0

作者：

Vural, N. Mert ^{[1
]}

Altas, Burak ^{[2
]}

Ilhan, Fatih ^{[1
,2
]}

Kozat, Suleyman S. ^{[1
,2
]}

机构：

[1] Bilkent Univ, Elekt & Elekt Muhendisligi Bolumu, Ankara, Turkey

[2] DataBoss AS, Ankara, Turkey

来源：

2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU) | 2020年

关键词：

on-line learning shortest path; non-stationary environment; multi-armed bandit problem;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we study the online shortest path learning problem under semi-bandit feedback in adversarial and non-stationary environments. To develop an efficient algorithm, we use the online convex optimization framework. We introduce an optimal online shortest path algorithm that guarantees to obtain the performance of the shortest path sequence. Since we do not have any statistical assumptions on the path delays, the results in the paper are guaranteed to hold in an individual sequence manner. Hence, our algorithm can be used for a wide range of practical network optimization problems that require exploration and exploitation at the same time.

引用

页数：5

共 50 条

[31] Online Learning and Online Convex Optimization
Shalev-Shwartz, Shai
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (02): : 107 - 194
[32] Online Optimization in the Non-Stationary Cloud: Change Point Detection for Resource Provisioning
Maghakian, Jessica
Comden, Joshua
Liu, Zhenhua
2019 53RD ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2019,
[33] Measures for non-stationary optimization tasks
Trojanowski, K
Obuchowicz, A
ARTIFICIAL NEURAL NETS AND GENETIC ALGORITHMS, 2001, : 244 - 247
[34] Non-stationary kriging for design optimization
Toal, D. J. J.
Keane, A. J.
ENGINEERING OPTIMIZATION, 2012, 44 (06) : 741 - 765
[35] Deep Reinforcement Learning for inventory optimization with non-stationary uncertain demand
Dehaybe, Henri
Catanzaro, Daniele
Chevalier, Philippe
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2024, 314 (02) : 433 - 445
[36] An Online Learning Framework for UAV Target Search Missions in Non-Stationary Environments
Khial, Noor
Mhaisen, Naram
Mabrok, Mohamed
Mohamed, Amr
2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 753 - 758
[37] Adaptive stepsize selection for online Q-learning in a non-stationary environment
Levy, Kim
Vazquez-Abad, Felisa J.
Costa, Andre
WODES 2006: EIGHTH INTERNATIONAL WORKSHOP ON DISCRETE EVENT SYSTEMS, PROCEEDINGS, 2006, : 372 - +
[38] Continual Prototype Evolution: Learning Online from Non-Stationary Data Streams
De lange, Matthias
Tuytelaars, Tinne
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8230 - 8239
[39] Learning for non-stationary Dirichlet processes
Quinn, A.
Karny, M.
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2007, 21 (10) : 827 - 855
[40] Social Learning in non-stationary environments
Boursier, Etienne
Perchet, Vianney
Scarsini, Marco
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167

← 1 2 3 4 5 →