Label prompt for multi-label text classification

被引:19
|
作者
Song, Rui [1 ]
Liu, Zelong [2 ]
Chen, Xingbing [3 ]
An, Haining [2 ]
Zhang, Zhiqi [4 ]
Wang, Xiaoguang [5 ]
Xu, Hao [6 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
[2] Jilin Univ, Coll Construct Engn, Changchun, Peoples R China
[3] Jilin Univ, Coll Elect Sci & Engn, Changchun, Peoples R China
[4] Jilin Univ, Coll Sotfware, Changchun, Peoples R China
[5] Jilin Univ, Publ Comp Educ & Res Ctr, Changchun, Peoples R China
[6] Jilin Univ, Coll Comp Sci & Technol, Key Lab Symbol Comp & Knowledge Engn, Minist Educ, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label text classification; BERT; Pormpt learning; Masked language model;
D O I
10.1007/s10489-022-03896-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label text classification has been widely concerned by scholars due to its contribution to practical applications. One of the key challenges in multi-label text classification is how to extract and leverage the correlation among labels. However, it is quite challenging to directly model the correlations among labels in a complex and unknown label space. In this paper, we propose a Label Prompt Multi-label Text Classification model (LP-MTC), which is inspired by the idea of prompt learning of pre-trained language model. Specifically, we design a set of templates for multi-label text classification, integrate labels into the input of the pre-trained language model, and jointly optimize by Masked Language Models (MLM). In this way, the correlations among labels as well as semantic information between labels and text with the help of self-attention can be captured, and thus the model performance is effectively improved. Extensive empirical experiments on multiple datasets demonstrate the effectiveness of our method. Compared with BERT, LP-MTC improved 3.4% micro-F1 on average over the four public datasets.
引用
收藏
页码:8761 / 8775
页数:15
相关论文
共 50 条
  • [31] Label Correlation Based Graph Convolutional Network for Multi-label Text Classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Manh-Tran Tien
    Van-Hau Nguyen
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [32] Label-representative graph convolutional network for multi-label text classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Minh-Hieu Pham
    Van-Quyet Nguyen
    Van-Hau Nguyen
    Applied Intelligence, 2023, 53 : 14759 - 14774
  • [33] Multi-label text classification with latent word-wise label information
    Ziheng Chen
    Jiangtao Ren
    Applied Intelligence, 2021, 51 : 966 - 979
  • [34] Research of multi-label text classification based on label attention and correlation networks
    Yuan, Ling
    Xu, Xinyi
    Sun, Ping
    Yu, Hai ping
    Wei, Yin Zhen
    Zhou, Jun jie
    PLOS ONE, 2024, 19 (09):
  • [35] Multi-label text classification with latent word-wise label information
    Chen, Ziheng
    Ren, Jiangtao
    APPLIED INTELLIGENCE, 2021, 51 (02) : 966 - 979
  • [36] Multi-label classification of legal text based on label embedding and capsule network
    Zhe Chen
    Shang Li
    Lin Ye
    Hongli Zhang
    Applied Intelligence, 2023, 53 : 6873 - 6886
  • [37] Clinical Multi-label Free Text Classification by Exploiting Disease Label Relation
    Zhao, Rui-Wei
    Li, Guo-Zheng
    Liu, Jia-Ming
    Wang, Xiao
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [38] Multi-label text classification with an ensemble feature space
    Tandon, Kushagri
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4425 - 4436
  • [39] Multi-label Classification with Clustering for Image and Text Categorization
    Nasierding, Gulisong
    Sajjanhar, Atul
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 869 - 874
  • [40] Multi-label legal text classification with BiLSTM and attention
    Enamoto, Liriam
    Santos, Andre R. A. S.
    Maia, Ricardo
    Weigang, Li
    Rocha Filho, Geraldo P.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 68 (04) : 369 - 378