Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning

被引:0
|
作者
Ce Zhang
Xiao Yao
Changfeng Shi
Min Gu
机构
[1] The College of IoT Engineering,
[2] Hohai University,undefined
[3] Business School of Hohai University,undefined
[4] The First People’s Hospital of Changzhou,undefined
关键词
Machine learning; Few-shot learning; K-FAC; Second-order optimization; Adaptive learning rate;
D O I
暂无
中图分类号
学科分类号
摘要
Model-agnostic meta-learning (MAML) highlights the ability to quickly adapt to new tasks with only a small amount of labeled training data among many few-shot learning algorithms. However, the computational complexity is high, because the MAML algorithm generates a large number of second-order parameters in the secondary gradient update. In addition, due to the non-convex nature of the neural network, the loss landscape has many flat areas, leading to slow convergence during training, and excessively long training. In this paper, a second-order optimization method called Kronecker-factored Approximate Curvature (K-FAC) is proposed to approximate Natural Gradient Descent. K-FAC reduces the computational complexity by approximating the large matrix of the Fisher information as the Kronecker product of two much smaller matrices, and the second-order parameter information is fully utilized to accelerate the convergence. Moreover, in order to solve the problem that Natural Gradient Descent is sensitive to the learning rate, this paper proposes Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning (AK-MAML), which automatically adjusts the learning rate according to the curvature and improves the efficiency of training. Experimental results show that AK-MAML has the ability of faster convergence, lower computation, and higher accuracy on few-shot datasets.
引用
收藏
页码:3169 / 3177
页数:8
相关论文
共 50 条
  • [31] Visual analysis of meteorological satellite data via model-agnostic meta-learning
    Shiyu Cheng
    Hanwei Shen
    Guihua Shan
    Beifang Niu
    Weihua Bai
    Journal of Visualization, 2021, 24 : 301 - 315
  • [32] Specific Emitter Identification via Sparse Bayesian Learning Versus Model-Agnostic Meta-Learning
    He, Boxiang
    Wang, Fanggang
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 3677 - 3691
  • [33] Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
    Vuorio, Risto
    Sun, Shao-Hua
    Hu, Hexiang
    Lim, Joseph J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Specific Emitter Identification With Limited Samples: A Model-Agnostic Meta-Learning Approach
    Yang, Ning
    Zhang, Bangning
    Ding, Guoru
    Wei, Yimin
    Wei, Guofeng
    Wang, Jian
    Guo, Daoxing
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (02) : 345 - 349
  • [35] Adaptive model-agnostic meta-learning network for cross-machine fault diagnosis with limited samples
    Mu, Mingzhe
    Jiang, Hongkai
    Wang, Xin
    Dong, Yutong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 141
  • [36] On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms
    Fallah, Alireza
    Mokhtari, Aryan
    Ozdaglar, Asuman
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1082 - 1091
  • [37] Visual analysis of meteorological satellite data via model-agnostic meta-learning
    Cheng, Shiyu
    Shen, Hanwei
    Shan, Guihua
    Niu, Beifang
    Bai, Weihua
    JOURNAL OF VISUALIZATION, 2021, 24 (02) : 301 - 315
  • [38] Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning
    Kalais, Konstantinos
    Chatzis, Sotirios
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 10586 - 10597
  • [39] Few-shot RUL estimation based on model-agnostic meta-learning
    Mo, Yu
    Li, Liang
    Huang, Biqing
    Li, Xiu
    JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (05) : 2359 - 2372
  • [40] Few-shot RUL estimation based on model-agnostic meta-learning
    Yu Mo
    Liang Li
    Biqing Huang
    Xiu Li
    Journal of Intelligent Manufacturing, 2023, 34 : 2359 - 2372