Multimodal Framework for Long-Tailed Recognition

被引:0
|
作者
Chen, Jian [1 ]
Zhao, Jianyin [1 ]
Gu, Jiaojiao [1 ]
Qin, Yufeng [1 ]
Ji, Hong [1 ]
机构
[1] Naval Aviat Univ, Coll Coastal Def Force, Yantai 264001, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 22期
关键词
long-tailed recognition; vision-language models; imbalanced classification;
D O I
10.3390/app142210572
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Long-tailed data distribution (i.e., minority classes occupy most of the data, while most classes have very few samples) is a common problem in image classification. In this paper, we propose a novel multimodal framework for long-tailed data recognition. In the first stage, long-tailed data are used for visual-semantic contrastive learning to obtain good features, while in the second stage, class-balanced data are used for classifier training. The proposed framework leverages the advantages of multimodal models and mitigates the problem of class imbalance in long-tailed data recognition. Experimental results demonstrate that the proposed framework achieves competitive performance on the CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018 datasets for image classification.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Dynamic Learnable Logit Adjustment for Long-Tailed Visual Recognition
    Zhang, Enhao
    Geng, Chuanxing
    Li, Chaohua
    Chen, Songcan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) : 7986 - 7997
  • [42] Feature Re-Balancing for Long-Tailed Visual Recognition
    Zhao, Yan
    Chen, Weicong
    Huang, Kai
    Zhu, Jihong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [43] FCC: Feature Clusters Compression for Long-Tailed Visual Recognition
    Li, Jian
    Meng, Ziyao
    Shi, Daqian
    Song, Rui
    Diao, Xiaolei
    Wang, Jingwen
    Xu, Hao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24080 - 24089
  • [44] Balanced clustering contrastive learning for long-tailed visual recognition
    Kim, Byeong-il
    Ko, Byoung Chul
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [45] Feature calibration and feature separation for long-tailed visual recognition
    Wang, Qianqian
    Zhou, Fangyu
    Zhao, Xiangge
    Lin, Yangtao
    Ye, Haibo
    NEUROCOMPUTING, 2025, 637
  • [46] Margin-aware rectified augmentation for long-tailed recognition
    Xiang, Liuyu
    Han, Jungong
    Ding, Guiguang
    PATTERN RECOGNITION, 2023, 141
  • [47] Attentional Composition Networks for Long-Tailed Human Action Recognition
    Wang, Haoran
    Wang, Yajie
    Yu, Baosheng
    Zhan, Yibing
    Yuan, Chunfeng
    Yang, Wankou
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [48] Large-Scale Long-Tailed Recognition in an Open World
    Liu, Ziwei
    Miao, Zhongqi
    Zhan, Xiaohang
    Wang, Jiayun
    Gong, Boqing
    Yu, Stella X.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2532 - 2541
  • [49] Virtual Student Distribution Knowledge Distillation for Long-Tailed Recognition
    Liu, Haodong
    Huang, Xinlei
    Tang, Jialiang
    Jiang, Ning
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT IV, 2025, 15034 : 406 - 419
  • [50] SWRM: Similarity Window Reweighting and Margin for Long-Tailed Recognition
    Chen, Qiong
    Huang, Tianlin
    Liu, Qingfa
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)