Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-Start

被引:0
|
作者
Grazzi, Riccardo [1 ,2 ]
Pontil, Massimiliano [1 ,2 ]
Salzo, Saverio [3 ,4 ]
机构
[1] Ist Italiano Tecnol, Comp Stat & Machine Learning, Genoa, Italy
[2] UCL, London, England
[3] Univ Sapienza Roma, Rome, Italy
[4] Ist Italiano Tecnol, Comp Stat & Machine Learning, Genoa, Italy
关键词
bilevel optimization; warm-start; non-convex optimization; implicit differentiation; hypergradient; sample complexity;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We analyse a general class of bilevel problems, in which the upper-level problem consists in the minimization of a smooth objective function and the lower-level problem is to find the fixed point of a smooth contraction map. This type of problems include instances of meta-learning, equilibrium models, hyperparameter optimization and data poisoning adversarial attacks. Several recent works have proposed algorithms which warm-start the lower-level problem, i.e. they use the previous lower-level approximate solution as a staring point for the lower-level solver. This warm-start procedure allows one to improve the sample complexity in both the stochastic and deterministic settings, achieving in some cases the order-wise optimal sample complexity. However, there are situations, e.g., meta learning and equilibrium models, in which the warm-start procedure is not well-suited or ineffective. In this work we show that without warm-start, it is still possible to achieve order-wise (near) optimal sample complexity. In particular, we propose a simple method which uses (stochastic) fixed point iterations at the lower-level and projected inexact gradient descent at the upper-level, that reaches an epsilon-stationary point using O(is an element of(-2)) and O ($) over bar(is an element of(-1)) samples for the stochastic and the deterministic setting, respectively. Finally, compared to methods using warm-start, our approach yields a simpler analysis that does not need to study the coupled interactions between the upper-level and lower-level iterates.
引用
收藏
页数:37
相关论文
共 14 条
  • [1] Robust bilevel optimization for near-optimal lower-level solutions
    Besancon, Mathieu
    Anjos, Miguel F.
    Brotcorne, Luce
    JOURNAL OF GLOBAL OPTIMIZATION, 2024, 90 (04) : 813 - 842
  • [2] A METHOD FOR BILEVEL OPTIMIZATION WITH CONVEX LOWER-LEVEL PROBLEM
    Shen, Han
    Paternain, Santiago
    Liu, Gaowen
    Kompella, Ramana
    Chen, Tianyi
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 9426 - 9430
  • [3] Bilevel Optimization via Collaborations Among Lower-Level Optimization Tasks
    Huang, Pei-Qiu
    Zhang, Qingfu
    Wang, Yong
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (06) : 1837 - 1850
  • [4] A Real-Time Optimization with Warm-Start of Multiperiod AC Optimal Power Flows
    Kim, Youngdae
    Anitescu, Mihai
    ELECTRIC POWER SYSTEMS RESEARCH, 2020, 189 (189)
  • [5] Evolutionary Bilevel Optimization via Multiobjective Transformation-Based Lower-Level Search
    Chen, Lei
    Liu, Hai-Lin
    Li, Ke
    Tan, Kay Chen
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2024, 28 (03) : 733 - 747
  • [6] Projection-Free Methods for Stochastic Simple Bilevel Optimization with Convex Lower-level Problem
    Cao, Jincheng
    Jiang, Ruichen
    Abolfazli, Nazanin
    Hamedani, Erfan Yazdandoost
    Mokhtari, Aryan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [7] A Framework for Scalable Bilevel Optimization: Identifying and Utilizing the Interactions Between Upper-Level and Lower-Level Variables
    Huang, Pei-Qiu
    Wang, Yong
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2020, 24 (06) : 1150 - 1163
  • [8] Bilevel Optimization based on Kriging Approximations of Lower Level Optimal Value Function
    Sinha, Ankur
    Bedi, Samish
    Deb, Kalyanmoy
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 213 - 220
  • [9] Bilevel optimization with a multi-objective lower-level problem: risk-neutral and risk-averse formulations
    Giovannelli, T.
    Kent, G. D.
    Vicente, L. N.
    OPTIMIZATION METHODS & SOFTWARE, 2024, 39 (04): : 756 - 778
  • [10] On convex lower-level black-box constraints in bilevel optimization with an application to gas market models with chance constraints
    Holger Heitsch
    René Henrion
    Thomas Kleinert
    Martin Schmidt
    Journal of Global Optimization, 2022, 84 : 651 - 685