Intra-inter channel attention for few-shot classification

被引:0
|
作者
Yang L. [1 ]
Zhang T. [1 ]
Wang Y. [1 ]
Gu X. [2 ]
机构
[1] Key Laboratory of Optoelectronic Technique & System of Ministry of Education, Chongqing University, Chongqing
[2] School of Electrical Engineering, Chongqing University of Science & Technology, Chongqing
关键词
channel attention; deep learning; few-shot classification; meta-learning; prototypical network;
D O I
10.37188/OPE.20233121.3145
中图分类号
学科分类号
摘要
As only one or a few training samples are used for few-shot classification tasks, the features extracted via a prototypical network cannot guarantee much discriminative power. Accordingly, this paper proposes an intra-inter channel attention few-shot classification (ICAFSC) method. This method uses an intra-inter channel attention module (ICAM) to calculate channel weights based on an intra-inter distance metric. The module is integrated into the prototypical network to make the embedded features more discriminative. To overcome the problems of overfitting or underfitting when directly learning the ICAM in the few-shot classification's meta-training stage, ICAFSC adds a pre-training stage before the meta-training of the prototypical network. We design adequate classification tasks with a large number of labeled samples to learn optimal parameters of the ICAM in the pre-training stage. Subsequently, in the meta-training and meta-testing stages of the prototypical network, ICAFSC first freezes the parameters of the ICAM to guarantee a stable channel attention relationship. It then achieves few-shot classification experience learning and transfer via meta-training and meta-testing. We conduct 1-shot and 5-shot few-shot classification experiments on the MiniImagenet dataset. The experimental results indicate that, compared to the prototypical network, the proposed ICAFSC method shows improvements of 1.93% and 1.15% for the 1-shot and 5-shot scenarios, respectively. © 2023 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:3145 / 3155
页数:10
相关论文
共 25 条
  • [1] JIANG K, ZHU L, SUN Q D, Joint dual-structural constrained and non-negative analysis representation learning for pattern classification[J], Applied Artificial Intelligence, 37, 1, (2023)
  • [2] AGARWAL N, SONDHI A, CHOPRA K, Et al., Transfer Learning: Survey and Classification[M], Smart Innovations in Communication and Computational Sciences, pp. 145-155, (2020)
  • [3] LI X, SUN Z, XUE J H, A concise review of recent few-shot meta-learning methods[J], Neurocomputing, 456, pp. 463-468, (2021)
  • [4] SONG Y, WANG T, MONDAL S K, A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities[EB/OL]
  • [5] LU J, GONG P, YE J, Et al., Learning from Very Few Samples: a Survey[EB/OL]
  • [6] HOSPEDALES T, ANTONIOU A, MICAELLI P, Meta-learning in neural networks: a survey[J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 44, 9, pp. 5149-5169, (2022)
  • [7] HUISMAN M, RIJN J N, PLAAT A, A survey of deep meta-learning[J], Artificial Intelligence Review, 54, 6, pp. 4483-4541, (2021)
  • [8] SNELL J, SWERSKY K, ZEMEL R, Prototypical networks for few-shot learning[C], Proceedings of the 31st International Conference on Neural Information Processing Systems, pp. 4080-4090, (2017)
  • [9] YAN S P, ZHANG S Y, HE X M, A dual attention network with semantic embedding for few-shot learning[J], Proceedings of the AAAI Conference on Artificial Intelligence, 33, 1, pp. 9079-9086, (2019)
  • [10] LEE S, MOON W, HEO J P, Task discrepancy maximization for fine-grained few-shot classification[C], 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5321-5330