Model-agnostic multi-stage loss optimization meta learning

被引:7
|
作者
Yao, Xiao [1 ]
Zhu, Jianlong [1 ]
Huo, Guanying [1 ]
Xu, Ning [1 ]
Liu, Xiaofeng [1 ]
Zhang, Ce [1 ]
机构
[1] Hohai Univ, Coll IoT Engn, Changzhou, Jiangsu, Peoples R China
关键词
Meta learning; Few-shot learning; Training instability; Multi-stage loss optimization;
D O I
10.1007/s13042-021-01316-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model Agnostic Meta Learning (MAML) has become the most representative meta learning algorithm to solve few-shot learning problems. This paper mainly discusses MAML framework, focusing on the key problem of solving few-shot learning through meta learning. However, MAML is sensitive to the base model for the inner loop, and training instability occur during the training process, resulting in an increase of the training difficulty of the model in the process of training and verification process, causing degradation of model performance. In order to solve these problems, we propose a multi-stage loss optimization meta-learning algorithm. By discussing a learning mechanism for inner and outer loops, it improves the training stability and accelerates the convergence for the model. The generalization ability of MAML has been enhanced.
引用
收藏
页码:2349 / 2363
页数:15
相关论文
共 50 条
  • [41] A Model-Agnostic Approach to Mitigate Gradient Interference for Multi-Task Learning
    Chai, Heyan
    Yin, Zhe
    Ding, Ye
    Liu, Li
    Fang, Binxing
    Liao, Qing
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (12) : 7810 - 7823
  • [42] Multi-Stage Optimization of Deep Learning Model to Detect Thoracic Complications
    Ratul, Rizwanul Hoque
    Husain, Farah Anjum
    Purnata, Tajmim Hossain
    Pomil, Rifat Alam
    Khandoker, Shaima
    Parvez, Mohammad Zavid
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 3000 - 3005
  • [43] Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
    Vuorio, Risto
    Sun, Shao-Hua
    Hu, Hexiang
    Lim, Joseph J.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [44] Visual analysis of meteorological satellite data via model-agnostic meta-learning
    Shiyu Cheng
    Hanwei Shen
    Guihua Shan
    Beifang Niu
    Weihua Bai
    Journal of Visualization, 2021, 24 : 301 - 315
  • [45] On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms
    Fallah, Alireza
    Mokhtari, Aryan
    Ozdaglar, Asuman
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1082 - 1091
  • [46] Specific Emitter Identification With Limited Samples: A Model-Agnostic Meta-Learning Approach
    Yang, Ning
    Zhang, Bangning
    Ding, Guoru
    Wei, Yimin
    Wei, Guofeng
    Wang, Jian
    Guo, Daoxing
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (02) : 345 - 349
  • [47] Visual analysis of meteorological satellite data via model-agnostic meta-learning
    Cheng, Shiyu
    Shen, Hanwei
    Shan, Guihua
    Niu, Beifang
    Bai, Weihua
    JOURNAL OF VISUALIZATION, 2021, 24 (02) : 301 - 315
  • [48] Responsible model deployment via model-agnostic uncertainty learning
    Lahoti, Preethi
    Gummadi, Krishna
    Weikum, Gerhard
    MACHINE LEARNING, 2023, 112 (03) : 939 - 970
  • [49] Model-Agnostic Multi-Agent Perception Framework
    Xu, Runsheng
    Chen, Weizhe
    Xiang, Hao
    Xia, Xin
    Liu, Lantao
    Ma, Jiaqi
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 1471 - 1478
  • [50] Stochastic Deep Networks with Linear Competing Units for Model-Agnostic Meta-Learning
    Kalais, Konstantinos
    Chatzis, Sotirios
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 10586 - 10597