Distilling knowledge from multiple foundation models for zero-shot image classification

被引:0
|
作者
Yin, Siqi [1 ]
Jiang, Lifan [1 ]
机构
[1] Shandong Univ Sci & Technol, Sch Comp Sci & Technol, Qingdao, Shandong, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 09期
关键词
D O I
10.1371/journal.pone.0310730
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Zero-shot image classification enables the recognition of new categories without requiring additional training data, thereby enhancing the model's generalization capability when specific training are unavailable. This paper introduces a zero-shot image classification framework to recognize new categories that are unseen during training by distilling knowledge from foundation models. Specifically, we first employ ChatGPT and DALL-E to synthesize reference images of unseen categories from text prompts. Then, the test image is aligned with text and reference images using CLIP and DINO to calculate the logits. Finally, the predicted logits are aggregated according to their confidence to produce the final prediction. Experiments are conducted on multiple datasets, including MNIST, SVHN, CIFAR-10, CIFAR-100, and TinyImageNet. The results demonstrate that our method can significantly improve classification accuracy compared to previous approaches, achieving AUROC scores of over 96% across all test datasets. Our code is available at https://github.com/1134112149/MICW-ZIC.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Gaze Embeddings for Zero-Shot Image Classification
    Karessli, Nour
    Akata, Zeynep
    Schiele, Bernt
    Bulling, Andreas
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6412 - 6421
  • [2] Multimodal Ensembling for Zero-Shot Image Classification
    Hickmon, Javon
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23747 - 23749
  • [3] Zero-Shot Image Classification Based on Attribute
    Zhang, Wei
    Chen, Wenbai
    Chen, Xiangfeng
    Han, Hu
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30
  • [4] Method for improving zero-shot image classification
    Chen, Xiangfeng
    Chen, Wenbai
    Zhang, Chong
    Lv, Mengyao
    Han, Hu
    JOURNAL OF ENGINEERING-JOE, 2018, (16): : 1688 - 1691
  • [5] Class knowledge overlay to visual feature learning for zero-shot image classification
    Xie, Cheng
    Zeng, Ting
    Xiang, Hongxin
    Li, Keqin
    Yang, Yun
    Liu, Qing
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2021, 207
  • [6] Learning visual-and-semantic knowledge embedding for zero-shot image classification
    Dehui Kong
    Xiliang Li
    Shaofan Wang
    Jinghua Li
    Baocai Yin
    Applied Intelligence, 2023, 53 : 2250 - 2264
  • [7] Learning visual-and-semantic knowledge embedding for zero-shot image classification
    Kong, Dehui
    Li, Xiliang
    Wang, Shaofan
    Li, Jinghua
    Yin, Baocai
    APPLIED INTELLIGENCE, 2023, 53 (02) : 2250 - 2264
  • [8] Micro-Knowledge Embedding for Zero-shot Classification
    Li, Houjun
    Wang, Fang
    Liu, Jingxian
    Huang, Jianhua
    Zhang, Ting
    Yang, Shuhong
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 101
  • [9] Enhanced VAEGAN: a zero-shot image classification method
    Ding, Bo
    Fan, Yufei
    He, Yongjun
    Zhao, Jing
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9235 - 9246
  • [10] Zero-shot image classification based on factor space
    Guan, Shijie
    Guan, Qixue
    Yin, Anqi
    International Journal of Web Engineering and Technology, 2021, 16 (01) : 1 - 29