An Efficient Framework by Topic Model for Multi-label Text Classification

被引:0
|
作者
Sun, Wei [1 ]
Ran, Xiangying [1 ]
Luo, Xiangyang [1 ]
Wang, Chongjun [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Dept Comp Sci & Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-label text classification; topic model; label correlations;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most existing multi-label text classification (MLTC) approaches only exploit label correlations from label pairwises or label chains. However, in the real world, features of instances have much importance for classification. In this paper, we propose a simple but efficient framework for MLTC called Hybrid Latent Dirichlet Allocation Multi-Label (HLDAML). To be specific, the topics of text features (i.e., a concrete description of documents) and the topics of label sets (i.e., a summarization of documents) can be obtained from training data by topic model before building models for multi-label classification. After that, hybrid topics can be used in existing approaches to improve the performance of MLTC. Experiments on several benchmark datasets demonstrate that the proposed framework is general and effective when taking text features and label sets into consideration simultaneously. It is also worth mentioning that we construct a new multi-label dataset called Parkinson about diagnosing parkinson disease by Traditional Chinese Medicine.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] An efficient stacking model with label selection for multi-label classification
    Chen, Yan-Nan
    Weng, Wei
    Wu, Shun-Xiang
    Chen, Bai-Hua
    Fan, Yu-Ling
    Liu, Jing-Hua
    APPLIED INTELLIGENCE, 2021, 51 (01) : 308 - 325
  • [22] Minimum Classification Error Rate Training of Supervised Topic Mixture Model for Multi-label Text Categorization
    He, Zhiyang
    Lv, Ping
    Wu, Ji
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 39 - +
  • [23] Supervised topic models for multi-label classification
    Li, Ximing
    Ouyang, Jihong
    Zhou, Xiaotang
    NEUROCOMPUTING, 2015, 149 : 811 - 819
  • [24] Multi-label text classification model based on semantic embedding
    Yan Danfeng
    Ke Nan
    Gu Chao
    Cui Jianfei
    Ding Yiqi
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2019, 26 (01) : 95 - 104
  • [25] An Interactive Fusion Model for Hierarchical Multi-label Text Classification
    Zhao, Xiuhao
    Li, Zhao
    Zhang, Xianming
    Wang, Jibin
    Chen, Tong
    Ju, Zhengyu
    Wang, Canjun
    Zhang, Chao
    Zhan, Yiming
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT II, 2022, 13552 : 168 - 178
  • [26] Subset Labeled LDA: A Topic Model for Extreme Multi-label Classification
    Papanikolaou, Yannis
    Tsoumakas, Grigorios
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2018), 2018, 11031 : 152 - 162
  • [27] Multi-label Classification via Label-Topic Pairs
    Chen, Gang
    Peng, Yue
    Wang, Chongjun
    WEB AND BIG DATA (APWEB-WAIM 2018), PT I, 2018, 10987 : 32 - 44
  • [28] Multi-Label Text Classification model integrating Label Attention and Historical Attention
    Sun, Guoying
    Cheng, Yanan
    Dong, Fangzhou
    Wang, Luhua
    Zhao, Dong
    Zhang, Zhaoxin
    Tong, Xiaojun
    KNOWLEDGE-BASED SYSTEMS, 2024, 296
  • [29] LABEL-AWARE TEXT REPRESENTATION FOR MULTI-LABEL TEXT CLASSIFICATION
    Guo, Hao
    Li, Xiangyang
    Zhang, Lei
    Liu, Jia
    Chen, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7728 - 7732
  • [30] Metalearning Applied to Multi-label Text Classification
    dos Santos, Vania Batista
    de Campos Merschmann, Luiz Henrique
    PROCEEDINGS OF 16TH BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS ON DIGITAL TRANSFORMATION AND INNOVATION, SBSI 2020, 2020,