DeepEMD: Differentiable Earth Mover's Distance for Few-Shot Learning

被引:55
|
作者
Zhang, Chi [1 ]
Cai, Yujun [2 ]
Lin, Guosheng [3 ]
Shen, Chunhua [4 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Inst Media Innovat, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[4] Zhejiang Univ, Hangzhou 310027, Zhejiang, Peoples R China
基金
新加坡国家研究基金会;
关键词
Task analysis; Earth; Training; Optimal matching; Optimization; Neural networks; Costs; Few-shot classification; meta learning; metric learning;
D O I
10.1109/TPAMI.2022.3217373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we develop methods for few-shot image classification from a new perspective of optimal matching between image regions. We employ the Earth Mover's Distance (EMD) as a metric to compute a structural distance between dense image representations to determine image relevance. The EMD generates the optimal matching flows between structural elements that have the minimum matching cost, which is used to calculate the image distance for classification. To generate the important weights of elements in the EMD formulation, we design a cross-reference mechanism, which can effectively alleviate the adverse impact caused by the cluttered background and large intra-class appearance variations. To implement $k$k-shot classification, we propose to learn a structured fully connected layer that can directly classify dense image representations with the EMD. Based on the implicit function theorem, the EMD can be inserted as a layer into the network for end-to-end training. Our extensive experiments validate the effectiveness of our algorithm which outperforms state-of-the-art methods by a significant margin on five widely used few-shot classification benchmarks, namely, miniImageNet, tieredImageNet, Fewshot-CIFAR100 (FC100), Caltech-UCSD Birds-200-2011 (CUB), and CIFAR-FewShot (CIFAR-FS). We also demonstrate the effectiveness of our method on the image retrieval task in our experiments.
引用
收藏
页码:5632 / 5648
页数:17
相关论文
共 50 条
  • [1] An Modified Earth Mover's Distance for Few-Shot Image Classification
    Jin, Zhiyu
    Tang, Zhuohe
    Yan, Jintao
    2022 PROGNOSTICS AND HEALTH MANAGEMENT CONFERENCE, PHM-LONDON 2022, 2022, : 405 - 409
  • [2] Hyperspectral Image Few-Shot Classification Network Based on the Earth Mover's Distance
    Sun, Jiaxing
    Shen, Xiaobo
    Sun, Quansen
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] AMEMD-FSL: fuse attention mechanism and earth mover's distance metric network to deep learning for few-shot image recognition
    Liang, Yong
    Chen, Zetao
    Cui, Qi
    Li, Xinhai
    Lin, Daoqian
    Tan, Junwen
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (06)
  • [4] DiffKendall: A Novel Approach for Few-Shot Learning with Differentiable Kendall's Rank Correlation
    Zheng, Kaipeng
    Zhang, Huishuai
    Huang, Weiran
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [5] Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation
    Tian, Pinzhuo
    Wu, Zhangkai
    Qi, Lei
    Wang, Lei
    Shi, Yinghuan
    Gao, Yang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12087 - 12094
  • [6] Few-shot regression with differentiable reference model
    Shi, Peng
    Huang, Guoyan
    He, Hongdou
    Zhao, Guyu
    Hao, Xiaobing
    Huang, Yifang
    INFORMATION SCIENCES, 2024, 658
  • [7] Few-Shot Few-Shot Learning and the role of Spatial Attention
    Lifchitz, Yann
    Avrithis, Yannis
    Picard, Sylvaine
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2693 - 2700
  • [8] Multi-distance metric network for few-shot learning
    Gao, Farong
    Cai, Lijie
    Yang, Zhangyi
    Song, Shiji
    Wu, Cheng
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (09) : 2495 - 2506
  • [9] Multi-distance metric network for few-shot learning
    Farong Gao
    Lijie Cai
    Zhangyi Yang
    Shiji Song
    Cheng Wu
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2495 - 2506
  • [10] Cross-Domain Few-Shot Fine-Grained Classification Based on Local-Global Semantic Consistency and Earth Mover's Distance
    Chen, Tong
    Liu, Jianming
    Wei, Haitao
    Liu, Xuhua
    Li, Chunyang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14866 : 286 - 297