VFLH: A Following-the-Leader-History Based Algorithm for Adaptive Online Convex Optimization with Stochastic Constraints

被引:1
|
作者
Yang, Yifan [1 ]
Chen, Lin [2 ]
Zhou, Pan [3 ]
Ding, Xiaofeng [2 ]
机构
[1] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Cyber Sci & Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive regret; online convex optimization; constrained optimization;
D O I
10.1109/ICTAI59109.2023.00033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper considers online convex optimization (OCO) with generated i.i.d. stochastic constraints, where the performance is measured by adaptive regret. The stochastic constraints are disclosed at each round to the learner after the decision is made. Different from the previous non-adaptive constrained OCO algorithm which is directly generalized from the static online gradient descent algorithm, we propose the novel Virtual Queue-based Following-the-Leader-History (VFLH) strategy to make the constrained OCO algorithm adaptive. In this framework, the algorithm generalizes experts that deal with the static constrained optimization problem within specified time intervals. Subsequently, it combines the predictions of active experts to produce a final choice and unify the average regret and constraints virtual queue. The algorithm's performance is evaluated based on two metrics: the bounds of constraint violation and regret. To address the difficulty of proving the constraint violation bound under the adaptive setting, we first employ the multi-objective drift analysis approach to handle the constraints virtual queue. Further analysis of the regret bound and the numerical results also supports the performance of the newly proposed algorithm.
引用
收藏
页码:172 / 177
页数:6
相关论文
共 50 条
  • [31] A low complexity algorithm with O(√T) Regret and O(1) constraint violations for online convex optimization with long term constraints
    Yu, Hao
    Neely, Michael J.
    Journal of Machine Learning Research, 2020, 21 : 1 - 24
  • [32] A Low Complexity Algorithm with O(√T) Regret and O(1) Constraint Violations for Online Convex Optimization with Long Term Constraints
    Yu, Hao
    Neely, Michael J.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [33] An Adaptive Online Parameter Control Algorithm for Particle Swarm Optimization Based on Reinforcement Learning
    Liu, Yaxian
    Lu, Hui
    Cheng, Shi
    Shi, Yuhui
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 815 - 822
  • [34] Tip-tilt adaptive correction based on stochastic parallel gradient descent optimization algorithm
    Ma, Huimin
    Zhang, Pengfei
    Zhang, Jinghui
    Qiao, Chunhong
    Fan, Chengyu
    OPTICAL DESIGN AND TESTING IV, 2010, 7849
  • [35] Adaptive-partitioning-based Stochastic optimization algorithm and its application to fuzzy control design
    Han, Chang-Wook
    Park, Jung-Il
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 3955 : 67 - 76
  • [36] A Momentum-Based Adaptive Primal-Dual Stochastic Gradient Method for Non-Convex Programs with Expectation Constraints
    Qi, Rulei
    Xue, Dan
    Zhai, Yujia
    MATHEMATICS, 2024, 12 (15)
  • [38] A Virtual-Queue-Based Algorithm for Constrained Online Convex Optimization With Applications to Data Center Resource Allocation
    Cao, Xuanyu
    Zhang, Junshan
    Poor, H. Vincent
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2018, 12 (04) : 703 - 716
  • [39] ALATO: An efficient intelligent algorithm for time optimization in an economic grid based on adaptive stochastic Petri net
    Shojafar, Mohammad
    Pooranian, Zahra
    Meybodi, Mohammad Reza
    Singhal, Mukesh
    JOURNAL OF INTELLIGENT MANUFACTURING, 2015, 26 (04) : 641 - 658
  • [40] ALATO: An efficient intelligent algorithm for time optimization in an economic grid based on adaptive stochastic Petri net
    Mohammad Shojafar
    Zahra Pooranian
    Mohammad Reza Meybodi
    Mukesh Singhal
    Journal of Intelligent Manufacturing, 2015, 26 : 641 - 658