Distilling knowledge from multiple foundation models for zero-shot image classification

被引:0
|
作者
Yin, Siqi [1 ]
Jiang, Lifan [1 ]
机构
[1] Shandong Univ Sci & Technol, Sch Comp Sci & Technol, Qingdao, Shandong, Peoples R China
来源
PLOS ONE | 2024年 / 19卷 / 09期
关键词
D O I
10.1371/journal.pone.0310730
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Zero-shot image classification enables the recognition of new categories without requiring additional training data, thereby enhancing the model's generalization capability when specific training are unavailable. This paper introduces a zero-shot image classification framework to recognize new categories that are unseen during training by distilling knowledge from foundation models. Specifically, we first employ ChatGPT and DALL-E to synthesize reference images of unseen categories from text prompts. Then, the test image is aligned with text and reference images using CLIP and DINO to calculate the logits. Finally, the predicted logits are aggregated according to their confidence to produce the final prediction. Experiments are conducted on multiple datasets, including MNIST, SVHN, CIFAR-10, CIFAR-100, and TinyImageNet. The results demonstrate that our method can significantly improve classification accuracy compared to previous approaches, achieving AUROC scores of over 96% across all test datasets. Our code is available at https://github.com/1134112149/MICW-ZIC.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Category Expansion Based Generalized Zero-Shot Image Classification
    Zhang J.
    Liao S.-B.
    Zhang H.-F.
    Chen D.-B.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2023, 51 (04): : 1068 - 1080
  • [22] Zero-Shot Image Classification: Recent Status and Future Trends
    Feng, Xiaodong
    Liu, Ying
    Chiew, Tuan Kiang
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 609 - 618
  • [23] Review of Zero-Shot Remote Sensing Image Scene Classification
    Tan, Xiaomeng
    Xi, Bobo
    Li, Jiaojiao
    Zheng, Tie
    Li, Yunsong
    Xue, Changbin
    Chanussot, Jocelyn
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11274 - 11289
  • [24] A Cross-Modal Alignment for Zero-Shot Image Classification
    Wu, Lu
    Wu, Chenyu
    Guo, Han
    Zhao, Zhihao
    IEEE ACCESS, 2023, 11 : 9067 - 9073
  • [25] Zero-shot image classification based on generative adversarial network
    Wei H.
    Zhang Y.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2019, 45 (12): : 2345 - 2350
  • [26] Analyzing the Potential of Zero-Shot Recognition for Document Image Classification
    Siddiqui, Shoaib Ahmed
    Dengel, Andreas
    Ahmed, Sheraz
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 293 - 304
  • [27] INCREMENTAL ZERO-SHOT LEARNING BASED ON ATTRIBUTES FOR IMAGE CLASSIFICATION
    Xue, Nan
    Wang, Yi
    Fan, Xin
    Min, Maomao
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 850 - 854
  • [28] Zero-shot image classification using coupled dictionary embedding
    Rostami, Mohammad
    Kolouri, Soheil
    Murez, Zak
    Owechko, Yuri
    Eaton, Eric
    Kim, Kuyngnam
    MACHINE LEARNING WITH APPLICATIONS, 2022, 8
  • [29] Zero-Shot Image Classification Based on a Learnable Deep Metric
    Liu, Jingyi
    Shi, Caijuan
    Tu, Dongjing
    Shi, Ze
    Liu, Yazhi
    SENSORS, 2021, 21 (09)
  • [30] Image-free Classifier Injection for Zero-Shot Classification
    Christensen, Anders
    Mancini, Massimiliano
    Koepke, A. Sophia
    Winther, Ole
    Akata, Zeynep
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19026 - 19035