Multimodal Framework for Long-Tailed Recognition

被引:0
|
作者
Chen, Jian [1 ]
Zhao, Jianyin [1 ]
Gu, Jiaojiao [1 ]
Qin, Yufeng [1 ]
Ji, Hong [1 ]
机构
[1] Naval Aviat Univ, Coll Coastal Def Force, Yantai 264001, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 22期
关键词
long-tailed recognition; vision-language models; imbalanced classification;
D O I
10.3390/app142210572
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Long-tailed data distribution (i.e., minority classes occupy most of the data, while most classes have very few samples) is a common problem in image classification. In this paper, we propose a novel multimodal framework for long-tailed data recognition. In the first stage, long-tailed data are used for visual-semantic contrastive learning to obtain good features, while in the second stage, class-balanced data are used for classifier training. The proposed framework leverages the advantages of multimodal models and mitigates the problem of class imbalance in long-tailed data recognition. Experimental results demonstrate that the proposed framework achieves competitive performance on the CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018 datasets for image classification.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A dual progressive strategy for long-tailed visual recognition
    Hong Liang
    Guoqing Cao
    Mingwen Shao
    Qian Zhang
    Machine Vision and Applications, 2024, 35
  • [22] Class-Balanced Regularization for Long-Tailed Recognition
    Xu, Yuge
    Lyu, Chuanlong
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [23] Domain Balancing: Face Recognition on Long-Tailed Domains
    Cao, Dong
    Zhu, Xiangyu
    Huang, Xingyu
    Guo, Jianzhu
    Lei, Zhen
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5670 - 5678
  • [24] Long-tailed recognition via key attribute learning
    Fu, Yu
    Han, Jungong
    Chang, Xiang
    Chen, Changrui
    Shang, Changjing
    Shen, Qiang
    NEUROCOMPUTING, 2025, 627
  • [25] Targeted Supervised Contrastive Learning for Long-Tailed Recognition
    Li, Tianhong
    Cao, Peng
    Yuan, Yuan
    Fan, Lijie
    Yang, Yuzhe
    Feris, Rogerio
    Indyk, Piotr
    Katabi, Dina
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6908 - 6918
  • [26] A dual progressive strategy for long-tailed visual recognition
    Liang, Hong
    Cao, Guoqing
    Shao, Mingwen
    Zhang, Qian
    MACHINE VISION AND APPLICATIONS, 2024, 35 (01)
  • [27] Local pseudo-attributes for long-tailed recognition
    Kim, Dong-Jin
    Ke, Tsung-Wei
    Yu, Stella X.
    PATTERN RECOGNITION LETTERS, 2023, 172 : 51 - 57
  • [28] Towards Effective Collaborative Learning in Long-Tailed Recognition
    Xu, Zhengzhuo
    Chai, Zenghao
    Xu, Chengyin
    Yuan, Chun
    Yang, Haiqin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3754 - 3764
  • [29] Nested Collaborative Learning for Long-Tailed Visual Recognition
    Li, Jun
    Tan, Zichang
    Wan, Jun
    Lei, Zhen
    Guo, Guodong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6939 - 6948
  • [30] Probabilistic Contrastive Learning for Long-Tailed Visual Recognition
    Du, Chaoqun
    Wang, Yulin
    Song, Shiji
    Huang, Gao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (09) : 5890 - 5904