Exploring Contrastive Learning for Long-Tailed Multi-label Text Classification

被引:0
|
作者
Audibert, Alexandre [1 ]
Gauffre, Aurelien [1 ]
Amini, Massih-Reza [1 ]
机构
[1] Univ Grenoble Alpes, CNRS, LIG, 150 Pl Torrent, F-38401 St Martin Dheres, France
关键词
Supervised Contrastive Learning; Multi-Label Text Classification;
D O I
10.1007/978-3-031-70368-3_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning an effective representation in multi-label text classification (MLTC) is a significant challenge in natural language processing. This challenge arises from the inherent complexity of the task, which is shaped by two key factors: the intricate connections between labels and the widespread long-tailed distribution of the data. To overcome this issue, one potential approach involves integrating supervised contrastive learning with classical supervised loss functions. Although contrastive learning has shown remarkable performance in multi-class classification, its impact in the multi-label framework has not been thoroughly investigated. In this paper, we conduct an in-depth study of supervised contrastive learning and its influence on representation in MLTC context. We emphasize the importance of considering long-tailed data distributions to build a robust representation space, and we identify two critical challenges associated with contrastive learning: the "lack of positives" and the "attraction-repulsion imbalance". Building on these insights, we introduce a novel contrastive loss function for MLTC. It attains Micro-F1 scores that either match or surpass those obtained with other frequently employed loss functions, and demonstrates a significant improvement in Macro-F1 scores across four multi-label datasets.
引用
收藏
页码:245 / 261
页数:17
相关论文
共 50 条
  • [1] Does Head Label Help for Long-Tailed Multi-Label Text Classification
    Xiao, Lin
    Zhang, Xiangliang
    Jing, Liping
    Huang, Chi
    Song, Mingyang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14103 - 14111
  • [2] Residual diverse ensemble for long-tailed multi-label text classification
    Jiangxin SHI
    Tong WEI
    Yufeng LI
    Science China(Information Sciences), 2024, 67 (11) : 92 - 105
  • [3] Residual diverse ensemble for long-tailed multi-label text classification
    Shi, Jiangxin
    Wei, Tong
    Li, Yufeng
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (11)
  • [4] Label-Specific Feature Augmentation for Long-Tailed Multi-Label Text Classification
    Xu, Pengyu
    Xiao, Lin
    Liu, Bing
    Lu, Sijin
    Jing, Liping
    Yu, Jian
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10602 - 10610
  • [5] Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution
    Huang, Yi
    Giledereli, Buse
    Koksal, Abdullatif
    Ozgur, Arzucan
    Ozkirimli, Elif
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8153 - 8161
  • [6] Long-tailed Extreme Multi-label Text Classification by the Retrieval of Generated Pseudo Label Descriptions
    Zhang, Ruohong
    Wang, Yau-Shian
    Yang, Yiming
    Yu, Donghan
    Vu, Tom
    Lei, Likun
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1092 - 1106
  • [7] Triple Alliance Prototype Orthotist Network for Long-Tailed Multi-Label Text Classification
    Xiao, Lin
    Xu, Pengyu
    Song, Mingyang
    Liu, Huafeng
    Jing, Liping
    Zhang, Xiangliang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2616 - 2628
  • [8] Contrastive Enhanced Learning for Multi-Label Text Classification
    Wu, Tianxiang
    Yang, Shuqun
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [9] Hierarchical contrastive learning for multi-label text classification
    Wei Zhang
    Yun Jiang
    Yun Fang
    Shuai Pan
    Scientific Reports, 15 (1)
  • [10] Robust Asymmetric Loss for Multi-Label Long-Tailed Learning
    Park, Wongi
    Park, Inhyuk
    Kim, Sungeun
    Ryu, Jongbin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2703 - 2712