Hierarchical text classification with multi-label contrastive learning and KNN

被引:11
|
作者
Zhang, Jun [1 ]
Li, Yubin [1 ]
Shen, Fanfan [2 ]
He, Yueshun [1 ]
Tan, Hai [2 ]
He, Yanxiang [3 ]
机构
[1] East China Univ Technol, Sch Informat Engn, Nanchang 330013, Peoples R China
[2] Nanjing Audit Univ, Sch Informat Engn, Nanjing 211815, Peoples R China
[3] Wuhan Univ, Comp Sch, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Hierarchical text classification; Label hierarchy; Multi -label contrastive learning; KNN;
D O I
10.1016/j.neucom.2024.127323
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the complicated label hierarchy, hierarchical text classification (HTC) has emerged as a challenging subtask in the realm of multi -label text classification. Existing methods enhance the quality of text representations by contrastive learning, but this supervised contrastive learning is designed for single -label setting and has two main limitations. On one hand, sample pairs with completely identical labels which should be treated as positive pairs are ignored. On the other hand, a simple pair is deemed as an absolutely positive or negative pair, which lacks consideration about the situation where sample pairs share some labels while having labels unique to each sample. Therefore, we propose a method combining multi -label contrastive learning with KNN (MLCL-KNN) for HTC. The proposed multi -label contrastive learning method can make text representations of sample pairs having more shared labels closer and separate those with no labels in common. During inference, we employ KNN to retrieve several neighbor samples and regard their labels as additional prediction, which is interpolated into the model output to further improve the performance of MLCL-KNN. Compared with the strongest baseline, MLCL-KNN achieves average improvements of 0.31%, 0.76%, 0.83%, and 0.43% on Micro -F1, Macro -F1, accuracy, and HiF respectively, which demonstrates its effectiveness.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Multi-label disaster text classification via supervised contrastive learning for social media data
    Xie, Shaorong
    Hou, Chunning
    Yu, Hang
    Zhang, Zhenyu
    Luo, Xiangfeng
    Zhu, Nengjun
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [22] An Interactive Fusion Model for Hierarchical Multi-label Text Classification
    Zhao, Xiuhao
    Li, Zhao
    Zhang, Xianming
    Wang, Jibin
    Chen, Tong
    Ju, Zhengyu
    Wang, Canjun
    Zhang, Chao
    Zhan, Yiming
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 168 - 178
  • [23] HMATC: Hierarchical multi-label Arabic text classification model using machine learning
    Aljedani, Nawal
    Alotaibi, Reem
    Taileb, Mounira
    EGYPTIAN INFORMATICS JOURNAL, 2021, 22 (03) : 225 - 237
  • [24] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [25] Effective Multi-Label Active Learning for Text Classification
    Yang, Bishan
    Sun, Jian-Tao
    Wang, Tengjiao
    Chen, Zheng
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 917 - 925
  • [26] Active Learning Strategies for Multi-Label Text Classification
    Esuli, Andrea
    Sebastiani, Fabrizio
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 102 - +
  • [27] Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework
    Zhang, Shu
    Xu, Ran
    Xiong, Caiming
    Ramaiah, Chetan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16639 - 16648
  • [28] Multi-Label Supervised Contrastive Learning
    Zhang, Pingyue
    Wu, Mengyue
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16786 - 16793
  • [29] Variational Continuous Label Distribution Learning for Multi-Label Text Classification
    Zhao, Xingyu
    An, Yuexuan
    Xu, Ning
    Geng, Xin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2716 - 2729
  • [30] Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification
    Chatterjee, Soumya
    Maheshwari, Ayush
    Ramakrishnan, Ganesh
    Jagarlapudi, Saketha Nath
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2829 - 2841