Model-agnostic multi-stage loss optimization meta learning

被引:7
|
作者
Yao, Xiao [1 ]
Zhu, Jianlong [1 ]
Huo, Guanying [1 ]
Xu, Ning [1 ]
Liu, Xiaofeng [1 ]
Zhang, Ce [1 ]
机构
[1] Hohai Univ, Coll IoT Engn, Changzhou, Jiangsu, Peoples R China
关键词
Meta learning; Few-shot learning; Training instability; Multi-stage loss optimization;
D O I
10.1007/s13042-021-01316-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model Agnostic Meta Learning (MAML) has become the most representative meta learning algorithm to solve few-shot learning problems. This paper mainly discusses MAML framework, focusing on the key problem of solving few-shot learning through meta learning. However, MAML is sensitive to the base model for the inner loop, and training instability occur during the training process, resulting in an increase of the training difficulty of the model in the process of training and verification process, causing degradation of model performance. In order to solve these problems, we propose a multi-stage loss optimization meta-learning algorithm. By discussing a learning mechanism for inner and outer loops, it improves the training stability and accelerates the convergence for the model. The generalization ability of MAML has been enhanced.
引用
收藏
页码:2349 / 2363
页数:15
相关论文
共 50 条
  • [21] Task-Robust Model-Agnostic Meta-Learning
    Collins, Liam
    Mokhtari, Aryan
    Shakkottai, Sanjay
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [22] Automated precision localization of peripherally inserted central catheter tip through model-agnostic multi-stage networks
    Park, Subin
    Cha, Yoon Ki
    Park, Soyoung
    Chung, Myung Jin
    Kim, Kyungsu
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 144
  • [23] Uncertainty in Model-Agnostic Meta-Learning using Variational Inference
    Nguyen, Cuong
    Do, Thanh-Toan
    Carneiro, Gustavo
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 3079 - 3089
  • [24] Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection
    Awal, Md Rabiul
    Lee, Roy Ka-Wei
    Tanwar, Eshaan
    Garg, Tanmay
    Chakraborty, Tanmoy
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 1086 - 1095
  • [25] Model-Agnostic Meta-Learning for Relation Classification with Limited Supervision
    Obamuyide, Abiola
    Vlachos, Andreas
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5873 - 5879
  • [26] On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning
    Fallah, Alireza
    Georgiev, Kristian
    Mokhtari, Aryan
    Ozdaglar, Asuman
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
    Finn, Chelsea
    Abbeel, Pieter
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [28] Federated Model-Agnostic Meta-Learning With Sharpness-Aware Minimization for Internet of Things Optimization
    Wu, Qingtao
    Zhang, Yong
    Liu, Muhua
    Zhu, Junlong
    Zheng, Ruijuan
    Zhang, Mingchuan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (19): : 31317 - 31330
  • [29] Personalized Federated Learning with Theoretical Guarantees: A Model-Agnostic Meta-Learning Approach
    Fallah, Alireza
    Mokhtari, Aryan
    Ozdaglar, Asuman
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [30] MAME : Model-Agnostic Meta-Exploration
    Gurumurthy, Swaminathan
    Kumar, Sumit
    Sycara, Katia
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100