Theoretical Convergence of Multi-Step Model-Agnostic Meta-Learning

被引:0
|
作者
Ji, Kaiyi [1 ]
Yang, Junjie [1 ]
Liang, Yingbin [1 ]
机构
[1] Ohio State Univ, Dept Elect & Comp Engn, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
Computational complexity; convergence rate; finite-sum; meta-learning; multi-step MAML; nonconvex; resampling;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As a popular meta-learning approach, the model-agnostic meta-learning (MAML) algo-rithm has been widely used due to its simplicity and effectiveness. However, the conver-gence of the general multi-step MAML still remains unexplored. In this paper, we develop a new theoretical framework to provide such convergence guarantee for two types of objective functions that are of interest in practice: (a) resampling case (e.g., reinforcement learning), where loss functions take the form in expectation and new data are sampled as the algo-rithm runs; and (b) finite-sum case (e.g., supervised learning), where loss functions take the finite-sum form with given samples. For both cases, we characterize the convergence rate and the computational complexity to attain an epsilon-accurate solution for multi-step MAML in the general nonconvex setting. In particular, our results suggest that an inner-stage stepsize needs to be chosen inversely proportional to the number N of inner-stage steps in order for N-step MAML to have guaranteed convergence. From the technical perspective, we develop novel techniques to deal with the nested structure of the meta gradient for multi-step MAML, which can be of independent interest.
引用
收藏
页数:41
相关论文
共 50 条
  • [21] MODEL-AGNOSTIC META-LEARNING FOR RESILIENCE OPTIMIZATION OF ARTIFICIAL INTELLIGENCE SYSTEM
    Moskalenko, V. V.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2023, (02) : 79 - 90
  • [22] Cross Domain Adaptation of Crowd Counting with Model-Agnostic Meta-Learning
    Hou, Xiaoyu
    Xu, Jihui
    Wu, Jinming
    Xu, Huaiyu
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [23] Is Bayesian Model-Agnostic Meta Learning Better than Model-Agnostic Meta Learning, Provably?
    Chen, Lisha
    Chen, Tianyi
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [24] Visual analysis of meteorological satellite data via model-agnostic meta-learning
    Shiyu Cheng
    Hanwei Shen
    Guihua Shan
    Beifang Niu
    Weihua Bai
    Journal of Visualization, 2021, 24 : 301 - 315
  • [25] Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
    Vuorio, Risto
    Sun, Shao-Hua
    Hu, Hexiang
    Lim, Joseph J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [26] Specific Emitter Identification With Limited Samples: A Model-Agnostic Meta-Learning Approach
    Yang, Ning
    Zhang, Bangning
    Ding, Guoru
    Wei, Yimin
    Wei, Guofeng
    Wang, Jian
    Guo, Daoxing
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (02) : 345 - 349
  • [27] On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning
    Fallah, Alireza
    Georgiev, Kristian
    Mokhtari, Aryan
    Ozdaglar, Asuman
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [28] Visual analysis of meteorological satellite data via model-agnostic meta-learning
    Cheng, Shiyu
    Shen, Hanwei
    Shan, Guihua
    Niu, Beifang
    Bai, Weihua
    JOURNAL OF VISUALIZATION, 2021, 24 (02) : 301 - 315
  • [29] Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning
    Kalais, Konstantinos
    Chatzis, Sotirios
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 10586 - 10597
  • [30] Few-shot RUL estimation based on model-agnostic meta-learning
    Mo, Yu
    Li, Liang
    Huang, Biqing
    Li, Xiu
    JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (05) : 2359 - 2372