A Topic Modeling Based on Prompt Learning

被引:2
|
作者
Qiu, Mingjie [1 ,2 ]
Yang, Wenzhong [2 ,3 ]
Wei, Fuyuan [2 ,3 ]
Chen, Mingliang [1 ,2 ]
机构
[1] Xinjiang Univ, Sch Software, Urumqi 830091, Peoples R China
[2] Xinjiang Univ, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830017, Peoples R China
[3] Xinjiang Univ, Sch Informat Sci & Engn, Urumqi 830017, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
topic modeling; prompt learning; prompt word;
D O I
10.3390/electronics13163212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the existing topic models are based on the Latent Dirichlet Allocation (LDA) or the variational autoencoder (VAE), but these methods have inherent flaws. The a priori assumptions of LDA on documents may not match the actual distribution of the data, and VAE suffers from information loss during the mapping and reconstruction process, which tends to affect the effectiveness of topic modeling. To this end, we propose a Prompt Topic Model (PTM) utilizing prompt learning for topic modeling, which circumvents the structural limitations of LDA and VAE, thereby overcoming the deficiencies of traditional topic models. Additionally, we develop a prompt word selection method that enhances PTM's efficiency in performing the topic modeling task. Experimental results demonstrate that the PTM surpasses traditional topic models on three public datasets. Ablation experiments further validate that our proposed prompt word selection method enhances the PTM's effectiveness in topic modeling.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Topic Based Machine Learning Summarizer
    Lukyamuzi, Andrew
    Ngubiri, John
    Okori, Washington
    2019 5TH IEEE INTERNATIONAL SMART CITIES CONFERENCE (IEEE ISC2 2019), 2019, : 288 - 291
  • [32] Tibetan Text Classification based on Prompt Learning and Ensemble Learning
    Tang, Chao
    Tan, Zelin
    Zhao, Xiaobing
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2025, 24 (02)
  • [33] A Derivative Topic Dissemination Model Based on Representation Learning and Topic Relevance
    Li, Qian
    Xiao, Yunpeng
    Zhou, Xinming
    Wang, Rong
    Duan, Sirui
    Yu, Xiang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7468 - 7482
  • [34] Adaptive framework for deep learning based dynamic and temporal topic modeling from big data
    Pathak A.R.
    Pandey M.
    Rautaray S.
    Recent Patents on Engineering, 2020, 14 (03): : 394 - 402
  • [35] An enhanced few-shot text classification approach by integrating topic modeling and prompt-tuning
    Zhang, Yinghui
    Xu, Yichun
    Dong, Fangmin
    NEUROCOMPUTING, 2025, 617
  • [36] Teaching System Modeling Based on Topic Maps
    Chen, Xilun
    Hou, Xia
    Li, Ning
    ADVANCES IN COMPUTER SCIENCE, ENVIRONMENT, ECOINFORMATICS, AND EDUCATION, PT II, 2011, 215 : 197 - 204
  • [37] Social Networks Analysis Based on Topic Modeling
    Muon Nguyen
    Thanh Ho
    Phuc Do
    PROCEEDINGS OF 2013 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES: RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2013, : 119 - 122
  • [38] ROMANIAN TOPIC MODELING - AN EVALUATION OF PROBABILISTIC VERSUS TRANSFORMER-BASED TOPIC MODELING FOR DOMAIN CATEGORIZATION
    Nitu, Melania
    Dascalu, Mihai
    Dascalu, Maria-Iuliana
    REVUE ROUMAINE DES SCIENCES TECHNIQUES-SERIE ELECTROTECHNIQUE ET ENERGETIQUE, 2023, 68 (03): : 295 - 300
  • [39] Classification of Programming Problems based on Topic Modeling
    Intisar, Chowdhury Md
    Watanobe, Yutaka
    Poudel, Manoj
    Bhalla, Subhash
    PROCEEDINGS OF 2019 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND EDUCATION TECHNOLOGY (ICIET 2019), 2019, : 275 - 283
  • [40] Topic Modeling Based on Frequent Sequences Graphs
    Ozdzynski, Piotr
    Zakrzewska, Danuta
    ADVANCES IN SYSTEMS SCIENCE, ICSS 2016, 2017, 539 : 86 - 97