Distilling knowledge from multiple foundation models for zero-shot image classification

被引:0
|
作者
Yin, Siqi [1 ]
Jiang, Lifan [1 ]
机构
[1] Shandong Univ Sci & Technol, Sch Comp Sci & Technol, Qingdao, Shandong, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 09期
关键词
D O I
10.1371/journal.pone.0310730
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Zero-shot image classification enables the recognition of new categories without requiring additional training data, thereby enhancing the model's generalization capability when specific training are unavailable. This paper introduces a zero-shot image classification framework to recognize new categories that are unseen during training by distilling knowledge from foundation models. Specifically, we first employ ChatGPT and DALL-E to synthesize reference images of unseen categories from text prompts. Then, the test image is aligned with text and reference images using CLIP and DINO to calculate the logits. Finally, the predicted logits are aggregated according to their confidence to produce the final prediction. Experiments are conducted on multiple datasets, including MNIST, SVHN, CIFAR-10, CIFAR-100, and TinyImageNet. The results demonstrate that our method can significantly improve classification accuracy compared to previous approaches, achieving AUROC scores of over 96% across all test datasets. Our code is available at https://github.com/1134112149/MICW-ZIC.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Hybrid Feature Approach for Enhancing Zero-Shot Image Classification
    Khanam, Shaista
    Sonar, Poonam N.
    ARTIFICIAL INTELLIGENCE AND KNOWLEDGE PROCESSING, AIKP 2024, 2025, 2228 : 239 - 251
  • [32] Generalized Zero-Shot Image Classification Based on Reconstruction Contrast
    Xu R.
    Shao S.
    Cao W.
    Liu B.
    Tao D.
    Liu W.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (12): : 1078 - 1088
  • [33] Zero-shot personalization of speech foundation models for depressed mood monitoring
    Gerczuk, Maurice
    Triantafyllopoulos, Andreas
    Amiriparian, Shahin
    Kathan, Alexander
    Bauer, Jonathan
    Berking, Matthias
    Schuller, Bjorn W.
    PATTERNS, 2023, 4 (11):
  • [34] Text-to-Image Diffusion Models are Zero-Shot Classifiers
    Clark, Kevin
    Jaini, Priyank
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [35] Deep Multiple Instance Learning for Zero-Shot Image Tagging
    Rahman, Shafin
    Khan, Salman
    COMPUTER VISION - ACCV 2018, PT I, 2019, 11361 : 530 - 546
  • [36] Underwater Sonar Image Classification with Image Disentanglement Reconstruction and Zero-Shot Learning
    Peng, Ye
    Li, Houpu
    Zhang, Wenwen
    Zhu, Junhui
    Liu, Lei
    Zhai, Guojun
    REMOTE SENSING, 2025, 17 (01)
  • [37] Zero-shot Relation Classification from Side Information
    Gong, Jiaying
    Eldardiry, Hoda
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 576 - 585
  • [38] Prompt-based Zero-shot Text Classification with Conceptual Knowledge
    Wang, Yuqi
    Wang, Wei
    Chen, Qi
    Huang, Kaizhu
    Nguyen, Anh
    De, Suparna
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-SRW 2023, VOL 4, 2023, : 30 - 38
  • [39] Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer
    Zhuo, Junbao
    Zhu, Yan
    Cui, Shuhao
    Wang, Shuhui
    Ma, Bin
    Huang, Qingming
    Wei, Xiaoming
    Wei, Xiaolin
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5761 - 5772
  • [40] A Lightweight Framework With Knowledge Distillation for Zero-Shot Mars Scene Classification
    Tan, Xiaomeng
    Xi, Bobo
    Xu, Haitao
    Li, Jiaojiao
    Li, Yunsong
    Xue, Changbin
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62