Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning

被引:0
|
作者
Ce Zhang
Xiao Yao
Changfeng Shi
Min Gu
机构
[1] The College of IoT Engineering,
[2] Hohai University,undefined
[3] Business School of Hohai University,undefined
[4] The First People’s Hospital of Changzhou,undefined
关键词
Machine learning; Few-shot learning; K-FAC; Second-order optimization; Adaptive learning rate;
D O I
暂无
中图分类号
学科分类号
摘要
Model-agnostic meta-learning (MAML) highlights the ability to quickly adapt to new tasks with only a small amount of labeled training data among many few-shot learning algorithms. However, the computational complexity is high, because the MAML algorithm generates a large number of second-order parameters in the secondary gradient update. In addition, due to the non-convex nature of the neural network, the loss landscape has many flat areas, leading to slow convergence during training, and excessively long training. In this paper, a second-order optimization method called Kronecker-factored Approximate Curvature (K-FAC) is proposed to approximate Natural Gradient Descent. K-FAC reduces the computational complexity by approximating the large matrix of the Fisher information as the Kronecker product of two much smaller matrices, and the second-order parameter information is fully utilized to accelerate the convergence. Moreover, in order to solve the problem that Natural Gradient Descent is sensitive to the learning rate, this paper proposes Kronecker-factored Approximate Curvature with adaptive learning rate for optimizing model-agnostic meta-learning (AK-MAML), which automatically adjusts the learning rate according to the curvature and improves the efficiency of training. Experimental results show that AK-MAML has the ability of faster convergence, lower computation, and higher accuracy on few-shot datasets.
引用
收藏
页码:3169 / 3177
页数:8
相关论文
共 50 条
  • [41] Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning
    Kang, Jiawen
    Liu, Ruiqi
    Li, Lantian
    Cai, Yunqi
    Wang, Dong
    Zheng, Thomas Fang
    INTERSPEECH 2020, 2020, : 3825 - 3829
  • [42] SPEAKER ADAPTIVE TRAINING USING MODEL AGNOSTIC META-LEARNING
    Klejch, Ondrej
    Fainberg, Joachim
    Bell, Peter
    Renals, Steve
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 881 - 888
  • [43] Convolutional Shrinkage Neural Networks Based Model-Agnostic Meta-Learning for Few-Shot Learning
    Yunpeng He
    Chuanzhi Zang
    Peng Zeng
    Qingwei Dong
    Ding Liu
    Yuqi Liu
    Neural Processing Letters, 2023, 55 : 505 - 518
  • [44] Model-Agnostic Learning to Meta-Learn
    Devos, Arnout
    Dandi, Yatin
    NEURIPS 2020 WORKSHOP ON PRE-REGISTRATION IN MACHINE LEARNING, VOL 148, 2020, 148 : 155 - 175
  • [45] Convolutional Shrinkage Neural Networks Based Model-Agnostic Meta-Learning for Few-Shot Learning
    He, Yunpeng
    Zang, Chuanzhi
    Zeng, Peng
    Dong, Qingwei
    Liu, Ding
    Liu, Yuqi
    NEURAL PROCESSING LETTERS, 2023, 55 (01) : 505 - 518
  • [46] Meta-LSTM in hydrology: Advancing runoff predictions through model-agnostic meta-learning
    Cai, Kaixuan
    He, Jinxin
    Li, Qingliang
    Wei, Shangguan
    Li, Lu
    Hu, Huiming
    JOURNAL OF HYDROLOGY, 2024, 639
  • [47] Learning From Visual Demonstrations via Replayed Task-Contrastive Model-Agnostic Meta-Learning
    Hu, Ziye
    Li, Wei
    Gan, Zhongxue
    Guo, Weikun
    Zhu, Jiwei
    Wen, James Zhiqing
    Zhou, Decheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8756 - 8767
  • [48] Model-Agnostic Meta-Learning for Fast Text-Dependent Speaker Embedding Adaptation
    Lin, Weiwei
    Mak, Man-Wai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1866 - 1876
  • [49] Few-Shot Bearing Anomaly Detection via Model-Agnostic Meta-Learning
    Zhang, Shen
    Ye, Fei
    Wang, Bingnan
    Habetler, Thomas G.
    2020 23RD INTERNATIONAL CONFERENCE ON ELECTRICAL MACHINES AND SYSTEMS (ICEMS), 2020, : 1341 - 1346
  • [50] Few-Shot Bearing Fault Diagnosis Based on Model-Agnostic Meta-Learning
    Zhang, Shen
    Ye, Fei
    Wang, Bingnan
    Habetler, Thomas G.
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2021, 57 (05) : 4754 - 4764