AMEMD-FSL: fuse attention mechanism and earth mover's distance metric network to deep learning for few-shot image recognition

被引:1
|
作者
Liang, Yong [1 ]
Chen, Zetao [1 ]
Cui, Qi [1 ]
Li, Xinhai [1 ]
Lin, Daoqian [1 ]
Tan, Junwen [1 ]
机构
[1] Guilin Univ Technol, Coll Mech & Control Engn, Guilin, Peoples R China
关键词
attention mechanism; earth mover's distance; image recognition; few-shot; deep learning;
D O I
10.1117/1.JEI.32.6.063035
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In computer vision, image recognition is one of the classic tasks. Currently, with the foundation of big data and advanced hardware, deep learning has achieved high accuracy. However, deep learning often fails to perform well when faced with a small number of samples. Therefore, few-shot learning has become a key technology to solve this problem. The learning paradigm of few-shot learning is different from that of deep learning. It aims to learn a universal representation from multiple training categories, used for recognition in new categories. Each few-shot learning training instance consists of a group of images and an unlabeled sample. The goal is to enable the model to perform well in recognizing new categories. To achieve this, the model needs to extract representative and highly generalizable features that enable the correct recognition of new category samples. To address the problem of small sample space being unable to describe enough dataset's semantic features, we propose the attention mechanism and earth mover's distance for few-shot learning (AMEMD-FSL) method. First, we fuse the attention mechanism (AM) to deep learning to help the model extract more semantically rich features. Then we use the earth mover's distance (EMD) metric method to calculate the distance between samples, enabling better classification. Finally, we combine the deep-learning residual network and AMEMD to perform few-shot learning. We validate our algorithm on the Caltech-UCSD Birds-200-2011 dataset and the few-shot public dataset mini-ImageNet, which comes from the DeepMind team. The experimental results demonstrate that we have proposed an end-to-end and effective method in the field of few-shot image classification.
引用
收藏
页数:16
相关论文
共 35 条
  • [1] Hyperspectral Image Few-Shot Classification Network Based on the Earth Mover's Distance
    Sun, Jiaxing
    Shen, Xiaobo
    Sun, Quansen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [2] An Modified Earth Mover's Distance for Few-Shot Image Classification
    Jin, Zhiyu
    Tang, Zhuohe
    Yan, Jintao
    2022 PROGNOSTICS AND HEALTH MANAGEMENT CONFERENCE, PHM-LONDON 2022, 2022, : 405 - 409
  • [3] DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning
    Zhang, Chi
    Cai, Yujun
    Lin, Guosheng
    Shen, Chunhua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 5632 - 5648
  • [4] Multi-distance metric network for few-shot learning
    Gao, Farong
    Cai, Lijie
    Yang, Zhangyi
    Song, Shiji
    Wu, Cheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2495 - 2506
  • [5] Multi-distance metric network for few-shot learning
    Farong Gao
    Lijie Cai
    Zhangyi Yang
    Shiji Song
    Cheng Wu
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2495 - 2506
  • [6] RACP: A network with attention corrected prototype for few-shot speaker recognition using indefinite distance metric
    Wang, Xingmei
    Meng, Jiaxiang
    Wen, Bin
    Xue, Fuzhao
    NEUROCOMPUTING, 2022, 490 : 283 - 294
  • [7] HMFN-FSL: Heterogeneous Metric Fusion Network-Based Few-Shot Learning for Crop Disease Recognition
    Yan, Wenbo
    Feng, Quan
    Yang, Sen
    Zhang, Jianhua
    Yang, Wanxia
    AGRONOMY-BASEL, 2023, 13 (12):
  • [8] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254
  • [9] Few-Shot Hyperspectral Image Classification With Deep Fuzzy Metric Learning
    Tang, Haojin
    Zhang, Chao
    Tang, Dong
    Lin, Xin
    Yang, Xiaofei
    Xie, Weixin
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [10] Deep metric learning for few-shot image classification: A Review of recent developments
    Li, Xiaoxu
    Yang, Xiaochen
    Ma, Zhanyu
    Xue, Jing-Hao
    PATTERN RECOGNITION, 2023, 138