Multimodal Framework for Long-Tailed Recognition

被引:0
|
作者
Chen, Jian [1 ]
Zhao, Jianyin [1 ]
Gu, Jiaojiao [1 ]
Qin, Yufeng [1 ]
Ji, Hong [1 ]
机构
[1] Naval Aviat Univ, Coll Coastal Def Force, Yantai 264001, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 22期
关键词
long-tailed recognition; vision-language models; imbalanced classification;
D O I
10.3390/app142210572
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Long-tailed data distribution (i.e., minority classes occupy most of the data, while most classes have very few samples) is a common problem in image classification. In this paper, we propose a novel multimodal framework for long-tailed data recognition. In the first stage, long-tailed data are used for visual-semantic contrastive learning to obtain good features, while in the second stage, class-balanced data are used for classifier training. The proposed framework leverages the advantages of multimodal models and mitigates the problem of class imbalance in long-tailed data recognition. Experimental results demonstrate that the proposed framework achieves competitive performance on the CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018 datasets for image classification.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Beyond the Label Distribution Prior for Long-Tailed Recognition
    Li, Ming
    Cao, Liujuan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT IV, 2023, 14089 : 792 - 803
  • [32] Balanced self-distillation for long-tailed recognition
    Ren, Ning
    Li, Xiaosong
    Wu, Yanxia
    Fu, Yan
    KNOWLEDGE-BASED SYSTEMS, 2024, 290
  • [33] Self Supervision to Distillation for Long-Tailed Visual Recognition
    Li, Tianhao
    Wang, Limin
    Wu, Gangshan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 610 - 619
  • [34] Balanced Contrastive Learning for Long-Tailed Visual Recognition
    Zhu, Jianggang
    Wang, Zheng
    Chen, Jingjing
    Chen, Yi-Ping Phoebe
    Jiang, Yu-Gang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6898 - 6907
  • [35] Inverse Image Frequency for Long-Tailed Image Recognition
    Alexandridis, Konstantinos Panagiotis
    Luo, Shan
    Nguyen, Anh
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5721 - 5736
  • [36] Exploring the auxiliary learning for long-tailed visual recognition
    Zhang, Junjie
    Liu, Lingqiao
    Wang, Peng
    Zhang, Jian
    NEUROCOMPUTING, 2021, 449 : 303 - 314
  • [37] The long-tailed rat
    Gold, AG
    ASIAN FOLKLORE STUDIES, 2004, 63 (02): : 243 - 265
  • [38] LONG-TAILED PAIR
    SCROGGIE, MG
    WIRELESS WORLD, 1968, 74 (1396): : 369 - &
  • [39] Key Point Sensitive Loss for Long-Tailed Visual Recognition
    Li, Mengke
    Cheung, Yiu-Ming
    Hu, Zhikai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4812 - 4825
  • [40] Mixing Global and Local Features for Long-Tailed Expression Recognition
    Zhou, Jiaxiong
    Li, Jian
    Yan, Yubo
    Wu, Lei
    Xu, Hao
    INFORMATION, 2023, 14 (02)