VFLH: A Following-the-Leader-History Based Algorithm for Adaptive Online Convex Optimization with Stochastic Constraints

被引：1

作者：

Yang, Yifan ^{[1
]}

Chen, Lin ^{[2
]}

Zhou, Pan ^{[3
]}

Ding, Xiaofeng ^{[2
]}

机构：

[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA

[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Cyber Sci & Engn, Wuhan, Peoples R China

来源：

2023 IEEE 35TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI | 2023年

基金：

中国国家自然科学基金;

关键词：

Adaptive regret; online convex optimization; constrained optimization;

D O I：

10.1109/ICTAI59109.2023.00033

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper considers online convex optimization (OCO) with generated i.i.d. stochastic constraints, where the performance is measured by adaptive regret. The stochastic constraints are disclosed at each round to the learner after the decision is made. Different from the previous non-adaptive constrained OCO algorithm which is directly generalized from the static online gradient descent algorithm, we propose the novel Virtual Queue-based Following-the-Leader-History (VFLH) strategy to make the constrained OCO algorithm adaptive. In this framework, the algorithm generalizes experts that deal with the static constrained optimization problem within specified time intervals. Subsequently, it combines the predictions of active experts to produce a final choice and unify the average regret and constraints virtual queue. The algorithm's performance is evaluated based on two metrics: the bounds of constraint violation and regret. To address the difficulty of proving the constraint violation bound under the adaptive setting, we first employ the multi-objective drift analysis approach to handle the constraints virtual queue. Further analysis of the regret bound and the numerical results also supports the performance of the newly proposed algorithm.

引用

页码：172 / 177

页数：6

共 50 条

[31] A low complexity algorithm with O(√T) Regret and O(1) constraint violations for online convex optimization with long term constraints
Yu, Hao
Neely, Michael J.
Journal of Machine Learning Research, 2020, 21 : 1 - 24
[32] A Low Complexity Algorithm with O(√T) Regret and O(1) Constraint Violations for Online Convex Optimization with Long Term Constraints
Yu, Hao
Neely, Michael J.
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[33] An Adaptive Online Parameter Control Algorithm for Particle Swarm Optimization Based on Reinforcement Learning
Liu, Yaxian
Lu, Hui
Cheng, Shi
Shi, Yuhui
2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 815 - 822
[34] Tip-tilt adaptive correction based on stochastic parallel gradient descent optimization algorithm
Ma, Huimin
Zhang, Pengfei
Zhang, Jinghui
Qiao, Chunhong
Fan, Chengyu
OPTICAL DESIGN AND TESTING IV, 2010, 7849
[35] Adaptive-partitioning-based Stochastic optimization algorithm and its application to fuzzy control design
Han, Chang-Wook
Park, Jung-Il
ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 67 - 76
[36] A Momentum-Based Adaptive Primal-Dual Stochastic Gradient Method for Non-Convex Programs with Expectation Constraints
Qi, Rulei
Xue, Dan
Zhai, Yujia
MATHEMATICS, 2024, 12 (15)
[37] Success-History Based Adaptive Differential Evolution Algorithm for Discrete Structural Optimization
Kaveh A.
Biabani Hamedani K.
Iranian Journal of Science and Technology, Transactions of Civil Engineering, 2025, 49 (1) : 409 - 431
[38] A Virtual-Queue-Based Algorithm for Constrained Online Convex Optimization With Applications to Data Center Resource Allocation
Cao, Xuanyu
Zhang, Junshan
Poor, H. Vincent
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (04) : 703 - 716
[39] ALATO: An efficient intelligent algorithm for time optimization in an economic grid based on adaptive stochastic Petri net
Shojafar, Mohammad
Pooranian, Zahra
Meybodi, Mohammad Reza
Singhal, Mukesh
JOURNAL OF INTELLIGENT MANUFACTURING, 2015, 26 (04) : 641 - 658
[40] ALATO: An efficient intelligent algorithm for time optimization in an economic grid based on adaptive stochastic Petri net
Mohammad Shojafar
Zahra Pooranian
Mohammad Reza Meybodi
Mukesh Singhal
Journal of Intelligent Manufacturing, 2015, 26 : 641 - 658

← 1 2 3 4 5 →