A Topic Modeling Based on Prompt Learning

被引:2
|
作者
Qiu, Mingjie [1 ,2 ]
Yang, Wenzhong [2 ,3 ]
Wei, Fuyuan [2 ,3 ]
Chen, Mingliang [1 ,2 ]
机构
[1] Xinjiang Univ, Sch Software, Urumqi 830091, Peoples R China
[2] Xinjiang Univ, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830017, Peoples R China
[3] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
topic modeling; prompt learning; prompt word;
D O I
10.3390/electronics13163212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the existing topic models are based on the Latent Dirichlet Allocation (LDA) or the variational autoencoder (VAE), but these methods have inherent flaws. The a priori assumptions of LDA on documents may not match the actual distribution of the data, and VAE suffers from information loss during the mapping and reconstruction process, which tends to affect the effectiveness of topic modeling. To this end, we propose a Prompt Topic Model (PTM) utilizing prompt learning for topic modeling, which circumvents the structural limitations of LDA and VAE, thereby overcoming the deficiencies of traditional topic models. Additionally, we develop a prompt word selection method that enhances PTM's efficiency in performing the topic modeling task. Experimental results demonstrate that the PTM surpasses traditional topic models on three public datasets. Ablation experiments further validate that our proposed prompt word selection method enhances the PTM's effectiveness in topic modeling.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] LDA Based Topic Modeling of Journal Abstracts
    Anupriya, P.
    Karpagavalli, S.
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [42] Semantic Text Alignment based on Topic Modeling
    Le, Huong T.
    Pham, Lam N.
    Nguyen, Duy D.
    Nguyen, Son V.
    Nguyen, An N.
    2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, : 67 - 72
  • [43] Vector-Quantization-Based Topic Modeling
    Gupta, Amulya
    Zhang, Zhu
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2021, 12 (03)
  • [44] Arabic Sentiment Analysis based on Topic Modeling
    Bekkali, Mohammed
    Lachkar, Abdelmonaime
    PROCEEDINGS OF THE SECOND CONFERENCE OF THE MOROCCAN CLASSIFICATION SOCIETY: NEW CHALLENGES IN DATA SCIENCES (SMC '2019), 2019, : 117 - 122
  • [45] Topic Modeling Based Text Summarization Approach
    Yu, Shusi
    Wang, Wei
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING APPLICATIONS (CSEA 2015), 2015, : 203 - 207
  • [46] Improving author verification based on topic modeling
    Potha, Nektaria
    Stamatatos, Efstathios
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2019, 70 (10) : 1074 - 1088
  • [47] Lexical cohesion based topic modeling for summarization
    Ercan, Gonenc
    Cicekli, Ilyas
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 582 - 592
  • [48] TOPIC MODELING OF NEWS BASED ON SPARK MLLIB
    Gui, Jing
    Wang, Qi
    2017 14TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2017, : 224 - 228
  • [49] A New Sentence-Based Interpretative Topic Modeling and Automatic Topic Labeling
    Kozbagarov, Olzhas
    Mussabayev, Rustam
    Mladenovic, Nenad
    SYMMETRY-BASEL, 2021, 13 (05):
  • [50] Learning Methods for Dynamic Topic Modeling in Automated Behavior Analysis
    Isupova, Olga
    Kuzin, Danil
    Mihaylova, Lyudmila
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (09) : 3980 - 3993