An Efficient Framework by Topic Model for Multi-label Text Classification

被引:0
|
作者
Sun, Wei [1 ]
Ran, Xiangying [1 ]
Luo, Xiangyang [1 ]
Wang, Chongjun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Dept Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-label text classification; topic model; label correlations;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing multi-label text classification (MLTC) approaches only exploit label correlations from label pairwises or label chains. However, in the real world, features of instances have much importance for classification. In this paper, we propose a simple but efficient framework for MLTC called Hybrid Latent Dirichlet Allocation Multi-Label (HLDAML). To be specific, the topics of text features (i.e., a concrete description of documents) and the topics of label sets (i.e., a summarization of documents) can be obtained from training data by topic model before building models for multi-label classification. After that, hybrid topics can be used in existing approaches to improve the performance of MLTC. Experiments on several benchmark datasets demonstrate that the proposed framework is general and effective when taking text features and label sets into consideration simultaneously. It is also worth mentioning that we construct a new multi-label dataset called Parkinson about diagnosing parkinson disease by Traditional Chinese Medicine.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] All is attention for multi-label text classification
    Liu, Zhi
    Huang, Yunjie
    Xia, Xincheng
    Zhang, Yihao
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1249 - 1270
  • [32] Scalable Multi-Label Arabic Text Classification
    Ahmed, Nizar A.
    Shehab, Mohammed A.
    Al-Ayyoub, Mahmoud
    Hmeidi, Ismail
    2015 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2015, : 212 - 217
  • [33] Image to Text Translation by Multi-Label Classification
    Nasierding, Gulisong
    Kouzani, Abbas Z.
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2010, 6216 : 247 - +
  • [34] A Neural Architecture for Multi-label Text Classification
    Coope, Sam
    Bachrach, Yoram
    Zukov-Gregoric, Andrej
    Rodriguez, Jose
    Maksak, Bogdan
    McMurtie, Conan
    Bordbar, Mahyar
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 676 - 691
  • [35] Multi-label arabic text classification: an overview
    Aljedani N.
    Alotaibi R.
    Taileb M.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (10): : 694 - 706
  • [36] Multi-Label Arabic Text Classification: An Overview
    Aljedani, Nawal
    Alotaibi, Reem
    Taileb, Mounira
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (10) : 694 - 706
  • [37] Efficient Methods for Multi-label Classification
    Sun, Chonglin
    Zhou, Chunting
    Jin, Bo
    Lau, Francis C. M.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART I, 2015, 9077 : 164 - 175
  • [38] Multi-label Classification of Legislative Text into EuroVoc
    Boella, Guido
    Di Caro, Luigi
    Lesmo, Leonardo
    Daniele, Rispoli
    Robaldo, Livio
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS (JURIX 2012), 2012, 250 : 21 - 30
  • [39] Multi-label Text Classification with Multi-variate Bernoulli Model and Label Dependent Representation
    Alfaro A, Rodrigo
    Allende O, Hector
    REVISTA SIGNOS, 2020, 53 (104): : 549 - 567
  • [40] Statistical topic models for multi-label document classification
    Timothy N. Rubin
    America Chambers
    Padhraic Smyth
    Mark Steyvers
    Machine Learning, 2012, 88 : 157 - 208