Few-shot Hierarchical Text Classification with Bidirectional Path Constraint by label weighting

被引:0
|
作者
Zhang, Mingbao [1 ,2 ]
Song, Rui [4 ]
Li, Xiang [1 ,3 ]
Tavares, Adriano [1 ]
Xu, Hao [4 ]
机构
[1] Univ Minho, Braga, Portugal
[2] Neusoft Educ Technol Co Ltd, Shenyang, Peoples R China
[3] Dalian Neusoft Univ Informat, Dalian, Peoples R China
[4] Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
关键词
Text analysis; Multi-label classification; Few-shot learning; Weakly-supervised learning;
D O I
10.1016/j.patrec.2025.01.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical Text Classification (HTC) organizes candidate labels into a hierarchical structure and uses one or more paths within the hierarchy as the ground-truth labels, which has been applied to various downstream tasks, e.g., sentiment analysis and harmful text detection. Existing works often involve data-driven models that are trained on large-scale datasets. However, creating annotated datasets is labor-intensive and timeconsuming. To address this issue, recent work has focused on the few-shot HTC task, where each class has only a few samples, e.g., 5. These approaches perform classification at each layer separately and leverage the prompt learning capability of pre-trained models like BERT. However, we find that these methods always neglect the inter-layer relationships. To solve this problem, we propose anew model called Bidirectional Path Constraint by Label Weighting (BPc-LW). Its basic idea is to use a pre-defined label embedding matrix and a feed-forward neural network for information propagation between layers, while also designing a bidirectional label weighting method to constrain the predictions of each layer to be along the same path in the label hierarchy. In addition, we employ a contrastive learning-based method to enhance the discriminative capacity of the hierarchical embeddings. We compare our proposed method with recent few-shot HTC baseline models across 3 benchmark datasets, and the experimental results demonstrate the effectiveness of BPc-LW.
引用
收藏
页码:81 / 88
页数:8
相关论文
共 50 条
  • [41] Few-Shot Text and Image Classification via Analogical Transfer Learning
    Liu, Wenhe
    Chang, Xiaojun
    Yan, Yan
    Yang, Yi
    Hauptmann, Alexander G.
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2018, 9 (06)
  • [42] Knowledge-Enhanced Prompt Learning for Few-Shot Text Classification
    Liu, Jinshuo
    Yang, Lu
    BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (04)
  • [43] CPCL: Conceptual prototypical contrastive learning for Few-Shot text classification
    Cheng, Tao
    Cheng, Hua
    Fang, Yiquan
    Liu, Yufei
    Gao, Caiting
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 11963 - 11975
  • [44] CLUR: Uncertainty Estimation for Few-Shot Text Classification with Contrastive Learning
    He, Jianfeng
    Zhang, Xuchao
    Lei, Shuo
    Alhamadani, Abdulaziz
    Chen, Fanglan
    Xiao, Bei
    Lu, Chang-Tien
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 698 - 710
  • [45] SELF-ADAPTIVE EMBEDDING FOR FEW-SHOT CLASSIFICATION BY HIERARCHICAL ATTENTION
    Wang, Xueliang
    Wu, Feng
    Wang, Jie
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [46] Knowledge-Guided Prompt Learning for Few-Shot Text Classification
    Wang, Liangguo
    Chen, Ruoyu
    Li, Li
    ELECTRONICS, 2023, 12 (06)
  • [47] Few-Shot Text Classification with Global-Local Feature Information
    Wang, Depei
    Wang, Zhuowei
    Cheng, Lianglun
    Zhang, Weiwen
    SENSORS, 2022, 22 (12)
  • [48] Multitask-Based Cluster Transmission for Few-Shot Text Classification
    Dong, Kaifang
    Xu, Fuyong
    Jiang, Baoxing
    Li, Hongye
    Liu, Peiyu
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2023, 2023, 14117 : 66 - 77
  • [49] Label-Aware Automatic Verbalizer for Few-Shot Text Classification in Mid-To-Low Resource Languages
    Thaminkaew, Thanakorn
    Lertvittayakumjorn, Piyawat
    Vateekul, Peerapon
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 4: STUDENT RESEARCH WORKSHOP, 2024, : 119 - 127
  • [50] Label Augmentation for Zero-Shot Hierarchical Text Classification
    Paletto, Lorenzo
    Basile, Valerio
    Esposito, Roberto
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7697 - 7706