Less is more: A closer look at semantic-based few-shot learning

被引：0

作者：

Zhou, Chunpeng ^{[1
]}

Yu, Zhi ^{[2
]}

Yuan, Xilu ^{[1
]}

Zhou, Sheng ^{[2
]}

Bu, Jiajun ^{[1
]}

Wang, Haishuai ^{[1
,3
]}

机构：

[1] Zhejiang Univ, Coll Comp Sci, Zhejiang Key Lab Accessible Percept & Intelligent, Hangzhou 310000, Peoples R China

[2] Zhejiang Univ, Sch Software Technol, Ningbo 310027, Peoples R China

[3] Shanghai Artificial Intelligence Lab, Shanghai 200125, Peoples R China

来源：

INFORMATION FUSION | 2025年 / 114卷

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Few-shot learning; Multi-modal learning; Feature representation; Image classification;

D O I：

10.1016/j.inffus.2024.102672

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot Learning (FSL) aims to learn and distinguish new categories from a scant number of available samples, presenting a significant challenge in the realm of deep learning. Recent researchers have sought to leverage the additional semantic or linguistic information of scarce categories with a pre-trained language model to facilitate learning, thus partially alleviating the problem of insufficient supervision signals. Nonetheless, the full potential of the semantic information and pre-trained language model have been underestimated in the few-shot learning till now, resulting in limited performance enhancements. To address this, we propose a straightforward and efficacious framework for few-shot learning tasks, specifically designed to exploit the semantic information and language model. Specifically, we explicitly harness the zero-shot capability of the pre-trained language model with learnable prompts. And we directly add the visual feature with the textual feature for inference without the intricate designed fusion modules as in prior studies. Additionally, we apply the self-ensemble and distillation to further enhance performance. Extensive experiments conducted across four widely used few-shot datasets demonstrate that our simple framework achieves impressive results. Particularly noteworthy is its outstanding performance in the 1-shot learning task, surpassing the current state-of-the-art by an average of 3.3% in classification accuracy. Our code will be available at https://github.com/zhouchunpong/ SimpleFewShot.

引用

页数：12

共 50 条

[41] Variational Few-Shot Learning
Zhang, Jian
Zhao, Chenglong
Ni, Bingbing
Xu, Minghao
Yang, Xiaokang
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1685 - 1694
[42] Fractal Few-Shot Learning
Zhou, Fobao
Huang, Wenkai
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 16353 - 16367
[43] Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty
Oh, Jaehoon
Kim, Sungnyun
Ho, Namgyu
Kim, Jin-Hwa
Song, Hwanjun
Yun, Se-Young
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[44] Interventional Few-Shot Learning
Yue, Zhongqi
Zhang, Hanwang
Sun, Qianru
Hua, Xian-Sheng
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[45] Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation
Tian, Pinzhuo
Wu, Zhangkai
Qi, Lei
Wang, Lei
Shi, Yinghuan
Gao, Yang
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12087 - 12094
[46] Quaternion-Valued Correlation Learning for Few-Shot Semantic Segmentation
Zheng, Zewen
Huang, Guoheng
Yuan, Xiaochen
Pun, Chi-Man
Liu, Hongrui
Ling, Wing-Kuen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2102 - 2115
[47] VSA: Adaptive Visual and Semantic Guided Attention on Few-Shot Learning
Chai, Jin
Chen, Yisheng
Shen, Weinan
Zhang, Tong
Chen, C. L. Philip
ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 280 - 292
[48] SIN: Semantic Inference Network for Few-Shot Streaming Label Learning
Wang, Zhen
Liu, Liu
Duan, Yiqun
Tao, Dacheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 9952 - 9965
[49] Learning Meta-class Memory for Few-Shot Semantic Segmentation
Wu, Zhonghua
Shi, Xiangxi
Lin, Guosheng
Cai, Jianfei
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 497 - 506
[50] Global-Local Interplay in Semantic Alignment for Few-Shot Learning
Hao, Fusheng
He, Fengxiang
Cheng, Jun
Tao, Dacheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4351 - 4363

← 1 2 3 4 5 →