Supervised topic models with weighted words: multi-label document classification

被引:0
|
作者
Yue-peng Zou
Ji-hong Ouyang
Xi-ming Li
机构
[1] Jilin University,College of Computer Science and Technology
[2] Jilin University,MOE Key Laboratory of Symbolic Computation and Knowledge Engineering
关键词
Supervised topic model; Multi-label classification; Class frequency; Labeled latent Dirichlet allocation (L-LDA); Dependency-LDA; TP391;
D O I
暂无
中图分类号
学科分类号
摘要
Supervised topic modeling algorithms have been successfully applied to multi-label document classification tasks. Representative models include labeled latent Dirichlet allocation (L-LDA) and dependency-LDA. However, these models neglect the class frequency information of words (i.e., the number of classes where a word has occurred in the training data), which is significant for classification. To address this, we propose a method, namely the class frequency weight (CF-weight), to weight words by considering the class frequency knowledge. This CF-weight is based on the intuition that a word with higher (lower) class frequency will be less (more) discriminative. In this study, the CF-weight is used to improve L-LDA and dependency-LDA. A number of experiments have been conducted on real-world multi-label datasets. Experimental results demonstrate that CF-weight based algorithms are competitive with the existing supervised topic models.
引用
收藏
页码:513 / 523
页数:10
相关论文
共 50 条
  • [31] Robust Multi-Label Semi-Supervised Classification
    Li, Sheng
    Fu, Yun
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 27 - 36
  • [32] SUPERVISED LOW DIMENSIONAL EMBEDDING FOR MULTI-LABEL CLASSIFICATION
    Chen, Zi-Jie
    Hao, Zhi-Feng
    PROCEEDINGS OF 2014 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2014, : 193 - 199
  • [33] Combination of Neural Networks for Multi-label Document Classification
    Lenc, Ladislav
    Kral, Pavel
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 278 - 282
  • [34] Learning Section Weights for Multi-label Document Classification
    Fard, Maziar Moradi
    Bayod, Paula Sorolla
    Motarjem, Kiomars
    Nejadi, Mohammad Alian
    Akhondi, Saber
    Thorne, Camilo
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT II, NLDB 2024, 2024, 14763 : 359 - 366
  • [35] Triplet Transformer Network for Multi-Label Document Classification
    Melsbach, Johannes
    Stahlmann, Sven
    Hirschmeier, Stefan
    Schoder, Detlef
    PROCEEDINGS OF THE 2022 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2022, 2022,
  • [36] Semi-supervised imbalanced multi-label classification with label propagation
    Du, Guodong
    Zhang, Jia
    Zhang, Ning
    Wu, Hanrui
    Wu, Peiliang
    Li, Shaozi
    PATTERN RECOGNITION, 2024, 150
  • [37] Supervised Deep Dictionary Learning for Single Label and Multi-Label Classification
    Singhal, Vanika
    Majumdar, Angshul
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [38] Multi-Label Emotion Tagging for Online News by Supervised Topic Model
    Zhang, Ying
    Su, Lili
    Yang, Zhifan
    Zhao, Xue
    Yuan, Xiaojie
    WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 : 67 - 79
  • [39] Minimum Classification Error Rate Training of Supervised Topic Mixture Model for Multi-label Text Categorization
    He, Zhiyang
    Lv, Ping
    Wu, Ji
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 39 - +
  • [40] Exploiting Label Dependencies for Multi-Label Document Classification Using Transformers
    Fallah, Haytame
    Bruno, Emmanuel
    Bellot, Patrice
    Murisasco, Elisabeth
    PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,