Less is more: A closer look at semantic-based few-shot learning

被引:0
|
作者
Zhou, Chunpeng [1 ]
Yu, Zhi [2 ]
Yuan, Xilu [1 ]
Zhou, Sheng [2 ]
Bu, Jiajun [1 ]
Wang, Haishuai [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Zhejiang Key Lab Accessible Percept & Intelligent, Hangzhou 310000, Peoples R China
[2] Zhejiang Univ, Sch Software Technol, Ningbo 310027, Peoples R China
[3] Shanghai Artificial Intelligence Lab, Shanghai 200125, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Few-shot learning; Multi-modal learning; Feature representation; Image classification;
D O I
10.1016/j.inffus.2024.102672
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot Learning (FSL) aims to learn and distinguish new categories from a scant number of available samples, presenting a significant challenge in the realm of deep learning. Recent researchers have sought to leverage the additional semantic or linguistic information of scarce categories with a pre-trained language model to facilitate learning, thus partially alleviating the problem of insufficient supervision signals. Nonetheless, the full potential of the semantic information and pre-trained language model have been underestimated in the few-shot learning till now, resulting in limited performance enhancements. To address this, we propose a straightforward and efficacious framework for few-shot learning tasks, specifically designed to exploit the semantic information and language model. Specifically, we explicitly harness the zero-shot capability of the pre-trained language model with learnable prompts. And we directly add the visual feature with the textual feature for inference without the intricate designed fusion modules as in prior studies. Additionally, we apply the self-ensemble and distillation to further enhance performance. Extensive experiments conducted across four widely used few-shot datasets demonstrate that our simple framework achieves impressive results. Particularly noteworthy is its outstanding performance in the 1-shot learning task, surpassing the current state-of-the-art by an average of 3.3% in classification accuracy. Our code will be available at https://github.com/zhouchunpong/ SimpleFewShot.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Variational Few-Shot Learning
    Zhang, Jian
    Zhao, Chenglong
    Ni, Bingbing
    Xu, Minghao
    Yang, Xiaokang
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1685 - 1694
  • [42] Fractal Few-Shot Learning
    Zhou, Fobao
    Huang, Wenkai
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16353 - 16367
  • [43] Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
    Oh, Jaehoon
    Kim, Sungnyun
    Ho, Namgyu
    Kim, Jin-Hwa
    Song, Hwanjun
    Yun, Se-Young
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] Interventional Few-Shot Learning
    Yue, Zhongqi
    Zhang, Hanwang
    Sun, Qianru
    Hua, Xian-Sheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [45] Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation
    Tian, Pinzhuo
    Wu, Zhangkai
    Qi, Lei
    Wang, Lei
    Shi, Yinghuan
    Gao, Yang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12087 - 12094
  • [46] Quaternion-Valued Correlation Learning for Few-Shot Semantic Segmentation
    Zheng, Zewen
    Huang, Guoheng
    Yuan, Xiaochen
    Pun, Chi-Man
    Liu, Hongrui
    Ling, Wing-Kuen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2102 - 2115
  • [47] VSA: Adaptive Visual and Semantic Guided Attention on Few-Shot Learning
    Chai, Jin
    Chen, Yisheng
    Shen, Weinan
    Zhang, Tong
    Chen, C. L. Philip
    ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 280 - 292
  • [48] SIN: Semantic Inference Network for Few-Shot Streaming Label Learning
    Wang, Zhen
    Liu, Liu
    Duan, Yiqun
    Tao, Dacheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 9952 - 9965
  • [49] Learning Meta-class Memory for Few-Shot Semantic Segmentation
    Wu, Zhonghua
    Shi, Xiangxi
    Lin, Guosheng
    Cai, Jianfei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 497 - 506
  • [50] Global-Local Interplay in Semantic Alignment for Few-Shot Learning
    Hao, Fusheng
    He, Fengxiang
    Cheng, Jun
    Tao, Dacheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4351 - 4363