Accurate use of label dependency in multi-label text classification through the lens of causality

被引:3
|
作者
Fan, Caoyun [1 ]
Chen, Wenqing [2 ]
Tian, Jidong [1 ]
Li, Yitian [1 ]
He, Hao [1 ]
Jin, Yaohui [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] Sun Yat Sen Univ, Sch Software Engn, Guangzhou, Peoples R China
关键词
Multi-label text classification; Label dependency; Correlation shortcut; Counterfactual de-bias;
D O I
10.1007/s10489-023-04623-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Label Text Classifiction (MLTC) aims to assign the most relevant labels to each given text. Existing methods demonstrate that label dependency can help to improve the model's performance. However, the introduction of label dependency may cause the model to suffer from unwanted prediction bias. In this study, we attribute the bias to the model's misuse of label dependency, i.e., the model tends to utilize the correlation shortcut in label dependency rather than fusing text information and label dependency for prediction. Motivated by causal inference, we propose a CounterFactual Text Classifier (CFTC) to eliminate the correlation bias, and make causality-based predictions. Specifically, our CFTC first adopts the predict-then-modify backbone to extract precise label information embedded in label dependency, then blocks the correlation shortcut through the counterfactual de-bias technique with the help of the human causal graph. Experimental results on three datasets demonstrate that our CFTC significantly outperforms the baselines and effectively eliminates the correlation bias in datasets.
引用
收藏
页码:21841 / 21857
页数:17
相关论文
共 50 条
  • [41] Research of multi-label text classification based on label attention and correlation networks
    Yuan, Ling
    Xu, Xinyi
    Sun, Ping
    Yu, Hai ping
    Wei, Yin Zhen
    Zhou, Jun jie
    PLOS ONE, 2024, 19 (09):
  • [42] Multi-label text classification with latent word-wise label information
    Chen, Ziheng
    Ren, Jiangtao
    APPLIED INTELLIGENCE, 2021, 51 (02) : 966 - 979
  • [43] Multi-label classification of legal text based on label embedding and capsule network
    Zhe Chen
    Shang Li
    Lin Ye
    Hongli Zhang
    Applied Intelligence, 2023, 53 : 6873 - 6886
  • [44] Clinical Multi-label Free Text Classification by Exploiting Disease Label Relation
    Zhao, Rui-Wei
    Li, Guo-Zheng
    Liu, Jia-Ming
    Wang, Xiao
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [45] Multi-label text classification with an ensemble feature space
    Tandon, Kushagri
    Chatterjee, Niladri
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4425 - 4436
  • [46] Multi-label Classification with Clustering for Image and Text Categorization
    Nasierding, Gulisong
    Sajjanhar, Atul
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 869 - 874
  • [47] Multi-label Text Classification for Public Procurement in Spanish
    Navas-Loro, Maria
    Garijo, Daniel
    Corcho, Oscar
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2022, (69): : 73 - 82
  • [48] Multi-label Text Classification with Deep Neural Networks
    Chen, Yun
    Xiao, Bo
    Lin, Zhiqing
    Dai, Cheng
    Li, Zuochao
    Yang, Liping
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 409 - 413
  • [49] Multi-label legal text classification with BiLSTM and attention
    Enamoto, Liriam
    Santos, Andre R. A. S.
    Maia, Ricardo
    Weigang, Li
    Rocha Filho, Geraldo P.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2022, 68 (04) : 369 - 378
  • [50] Review and Prospect of Multi-Label Text Classification Research
    Zhang, Wenfeng
    Xi, Xuefeng
    Cui, Zhiming
    Zou, Yichen
    Luan, Jinquan
    Computer Engineering and Applications, 2023, 59 (18) : 28 - 48