Exploring Contrastive Learning for Long-Tailed Multi-label Text Classification

被引:0
|
作者
Audibert, Alexandre [1 ]
Gauffre, Aurelien [1 ]
Amini, Massih-Reza [1 ]
机构
[1] Univ Grenoble Alpes, CNRS, LIG, 150 Pl Torrent, F-38401 St Martin Dheres, France
关键词
Supervised Contrastive Learning; Multi-Label Text Classification;
D O I
10.1007/978-3-031-70368-3_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning an effective representation in multi-label text classification (MLTC) is a significant challenge in natural language processing. This challenge arises from the inherent complexity of the task, which is shaped by two key factors: the intricate connections between labels and the widespread long-tailed distribution of the data. To overcome this issue, one potential approach involves integrating supervised contrastive learning with classical supervised loss functions. Although contrastive learning has shown remarkable performance in multi-class classification, its impact in the multi-label framework has not been thoroughly investigated. In this paper, we conduct an in-depth study of supervised contrastive learning and its influence on representation in MLTC context. We emphasize the importance of considering long-tailed data distributions to build a robust representation space, and we identify two critical challenges associated with contrastive learning: the "lack of positives" and the "attraction-repulsion imbalance". Building on these insights, we introduce a novel contrastive loss function for MLTC. It attains Micro-F1 scores that either match or surpass those obtained with other frequently employed loss functions, and demonstrates a significant improvement in Macro-F1 scores across four multi-label datasets.
引用
收藏
页码:245 / 261
页数:17
相关论文
共 50 条
  • [21] Multi-expert contrastive learning for remote sensing long-tailed image classification
    Zhang, Lei
    Peng, Lijia
    Yang, Chengwei
    Ding, Xin
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2025, 46 (04) : 1517 - 1542
  • [22] Contrastive Learning-Enhanced Nearest Neighbor Mechanism for Multi-Label Text Classification
    Su, Xi'ao
    Wang, Ran
    Dai, Xinyu
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 672 - 679
  • [23] Dynamic Mixup for Multi-Label Long-Tailed Food Ingredient Recognition
    Gao, Jixiang
    Chen, Jingjing
    Fu, Huazhu
    Jiang, Yu-Gang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4764 - 4773
  • [24] Addressing long-tailed distribution in judicial text for criminal motive classification: a balanced contrastive learning approach
    Li, Ting
    Mi, Lewen
    Meng, Xiangyu
    Jia, Yongju
    Zhao, Lin
    Zhao, Qi
    Wei, Zihao
    Gao, Guandong
    Li, Xiangxian
    EPJ DATA SCIENCE, 2025, 14 (01)
  • [25] Long-Tailed Multi-label Retinal Diseases Recognition via Relational Learning and Knowledge Distillation
    Zhou, Qian
    Zou, Hua
    Wang, Zhongyuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 709 - 718
  • [26] Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification
    Wang, Peng
    Han, Kai
    Wei, Xiu-Shen
    Zhang, Lei
    Wang, Lei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 943 - 952
  • [27] Long-tailed visual classification based on supervised contrastive learning with multi-view fusion
    Zeng, Liang
    Feng, Zheng
    Chen, Jia
    Wang, Shanshan
    KNOWLEDGE-BASED SYSTEMS, 2024, 301
  • [28] An Optimized Ensemble Framework for Multi-Label Classification on Long-Tailed Chest X-ray Data
    Jeong, Jaehyup
    Jeoun, Bosoung
    Park, Yeonju
    Han, Bohyung
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2731 - 2738
  • [29] Criticality-aware Deconfounded Classification of Long-tailed Multi-label 12-lead Electrocardiogram
    Deb, Trisrota
    Sahu, Ishan
    Ukil, Arijit
    Pal, Arpan
    Khandelwal, Sundeep
    Garain, Utpal
    2024 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS, PERCOM WORKSHOPS, 2024, : 239 - 244
  • [30] Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification
    Zhang, Yu
    Shen, Zhihong
    Wu, Chieh-Han
    Xie, Boya
    Hao, Junheng
    Wang, Ye-Yi
    Wang, Kuansan
    Han, Jiawei
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3162 - 3173