Accurate use of label dependency in multi-label text classification through the lens of causality

被引:3
|
作者
Fan, Caoyun [1 ]
Chen, Wenqing [2 ]
Tian, Jidong [1 ]
Li, Yitian [1 ]
He, Hao [1 ]
Jin, Yaohui [1 ]
机构
[1] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[2] Sun Yat Sen Univ, Sch Software Engn, Guangzhou, Peoples R China
关键词
Multi-label text classification; Label dependency; Correlation shortcut; Counterfactual de-bias;
D O I
10.1007/s10489-023-04623-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Label Text Classifiction (MLTC) aims to assign the most relevant labels to each given text. Existing methods demonstrate that label dependency can help to improve the model's performance. However, the introduction of label dependency may cause the model to suffer from unwanted prediction bias. In this study, we attribute the bias to the model's misuse of label dependency, i.e., the model tends to utilize the correlation shortcut in label dependency rather than fusing text information and label dependency for prediction. Motivated by causal inference, we propose a CounterFactual Text Classifier (CFTC) to eliminate the correlation bias, and make causality-based predictions. Specifically, our CFTC first adopts the predict-then-modify backbone to extract precise label information embedded in label dependency, then blocks the correlation shortcut through the counterfactual de-bias technique with the help of the human causal graph. Experimental results on three datasets demonstrate that our CFTC significantly outperforms the baselines and effectively eliminates the correlation bias in datasets.
引用
收藏
页码:21841 / 21857
页数:17
相关论文
共 50 条
  • [31] Multi-label classification with label clusters
    Gatto, Elaine Cecilia
    Ferrandin, Mauri
    Cerri, Ricardo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1741 - 1785
  • [32] Label Expansion for Multi-Label Classification
    Rivolli, Adriano
    Soares, Carlos
    de Carvalho, Andre C. P. L. F.
    2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 414 - 419
  • [33] Multi-label text classification via joint learning from label embedding and label correlation
    Liu, Huiting
    Chen, Geng
    Li, Peipei
    Zhao, Peng
    Wu, Xindong
    NEUROCOMPUTING, 2021, 460 : 385 - 398
  • [34] Deep label relevance and label ambiguity based multi-label feature selection for text classification
    Verma, Gurudatta
    Sahu, Tirath Prasad
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 148
  • [35] Multi-Label Text Classification model integrating Label Attention and Historical Attention
    Sun, Guoying
    Cheng, Yanan
    Dong, Fangzhou
    Wang, Luhua
    Zhao, Dong
    Zhang, Zhaoxin
    Tong, Xiaojun
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [36] Multi-label classification of legal text based on label embedding and capsule network
    Chen, Zhe
    Li, Shang
    Ye, Lin
    Zhang, Hongli
    APPLIED INTELLIGENCE, 2023, 53 (06) : 6873 - 6886
  • [37] Label-representative graph convolutional network for multi-label text classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Minh-Hieu Pham
    Van-Quyet Nguyen
    Van-Hau Nguyen
    APPLIED INTELLIGENCE, 2023, 53 (12) : 14759 - 14774
  • [38] Label Correlation Based Graph Convolutional Network for Multi-label Text Classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Manh-Tran Tien
    Van-Hau Nguyen
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [39] Label-representative graph convolutional network for multi-label text classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Minh-Hieu Pham
    Van-Quyet Nguyen
    Van-Hau Nguyen
    Applied Intelligence, 2023, 53 : 14759 - 14774
  • [40] Multi-label text classification with latent word-wise label information
    Ziheng Chen
    Jiangtao Ren
    Applied Intelligence, 2021, 51 : 966 - 979