Semi-supervised Meta-learning for Cross-domain Few-shot Intent Classification

被引:0
|
作者
Li, Judith Yue [1 ]
Zhang, Jiong [2 ]
机构
[1] Salesforce Res, Palo Alto, CA 94301 USA
[2] LinkedIn AI, Sunnyvale, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning aims to optimize the model's capability to generalize to new tasks and domains. Lacking a data-efficient way to create meta training tasks has prevented the application of meta-learning to the real-world few shot learning scenarios. Recent studies have proposed unsupervised approaches to create meta-training tasks from unlabeled data for free, e.g., the SMLMT method (Bansal et al., 2020a) constructs unsupervised multiclass classification tasks from the unlabeled text by randomly masking words in the sentence and let the meta learner choose which word to fill in the blank. This study proposes a semi-supervised meta-learning approach that incorporates both the representation power of large pre-trained language models and the generalization capability of prototypical networks enhanced by SMLMT. The semi-supervised meta training approach avoids overfitting prototypical networks on a small number of labeled training examples and quickly learns cross-domain task-specific representation only from a few supporting examples. By incorporating SMLMT with prototypical networks, the meta learner generalizes better to unseen domains and gains higher accuracy on out-of-scope examples without the heavy lifting of pre-training. We observe significant improvement in few-shot generalization after training only a few epochs on the intent classification tasks evaluated in a multi-domain setting.
引用
收藏
页码:67 / 75
页数:9
相关论文
共 50 条
  • [31] Cross-Domain Few-Shot Hyperspectral Image Classification With Cross-Modal Alignment and Supervised Contrastive Learning
    Li, Zhaokui
    Zhang, Chenyang
    Wang, Yan
    Li, Wei
    Du, Qian
    Fang, Zhuoqun
    Chen, Yushi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 19
  • [32] A Cross-Domain Semi-Supervised Zero-Shot Learning Model for the Classification of Hyperspectral Images
    Pallavi Ranjan
    Gautam Gupta
    Journal of the Indian Society of Remote Sensing, 2023, 51 : 1991 - 2005
  • [33] Few-shot fault diagnosis of turnout switch machine based on flexible semi-supervised meta-learning network
    He, Yiling
    He, Deqiang
    Lao, Zhenpeng
    Jin, Zhenzhen
    Miao, Jian
    Lai, Zhiping
    Chen, Yanjun
    KNOWLEDGE-BASED SYSTEMS, 2024, 294
  • [34] Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
    Oh, Jaehoon
    Kim, Sungnyun
    Ho, Namgyu
    Kim, Jin-Hwa
    Song, Hwanjun
    Yun, Se-Young
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [35] A Cross-Domain Semi-Supervised Zero-Shot Learning Model for the Classification of Hyperspectral Images
    Ranjan, Pallavi
    Gupta, Gautam
    JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2023, 51 (10) : 1991 - 2005
  • [36] Causal Meta-Transfer Learning for Cross-Domain Few-Shot Hyperspectral Image Classification
    Cheng, Yuhu
    Zhang, Wei
    Wang, Haoyu
    Wang, Xuesong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [37] Few-Shot Directed Meta-Learning for Image Classification
    Ouyang, Jihong
    Duan, Ganghai
    Liu, Siguang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (01)
  • [38] UNSUPERVISED AND SEMI-SUPERVISED FEW-SHOT ACOUSTIC EVENT CLASSIFICATION
    Huang, Hsin-Ping
    Puvvada, Krishna C.
    Sun, Ming
    Wang, Chao
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 331 - 335
  • [39] Unsupervised Meta-Learning for Few-Shot Image Classification
    Khodadadeh, Siavash
    Boloni, Ladislau
    Shah, Mubarak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [40] Contrastive Meta-Learning for Few-shot Node Classification
    Wang, Song
    Tan, Zhen
    Liu, Huan
    Li, Jundong
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 2386 - 2397