Conceptualization topic modeling

被引:0
|
作者
Yi-Kun Tang
Xian-Ling Mao
Heyan Huang
Xuewen Shi
Guihua Wen
机构
[1] Beijing Institute of Technology,School of Computer Science and Technology
[2] Minjiang University,Fujian Provincial Key Laboratory of Information Processing and Intelligent Control
[3] South China University of Technology,Department of Computer Science and Technology
来源
关键词
Conceptualization topic modeling; Hierarchical bayesian structure; Conceptualization latent dirichlet allocation; Conceptualization labeled latent dirichlet allocation;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, topic modeling has been widely used to discover the abstract topics in the multimedia field. Most of the existing topic models are based on the assumption of three-layer hierarchical Bayesian structure, i.e. each document is modeled as a probability distribution over topics, and each topic is a probability distribution over words. However, the assumption is not optimal. Intuitively, it’s more reasonable to assume that each topic is a probability distribution over concepts, and then each concept is a probability distribution over words, i.e. adding a latent concept layer between topic layer and word layer in traditional three-layer assumption. In this paper, we verify the proposed assumption by incorporating the new assumption in two representative topic models, and obtain two novel topic models. Extensive experiments were conducted among the proposed models and corresponding baselines, and the results show that the proposed models significantly outperform the baselines in terms of case study and perplexity, which means the new assumption is more reasonable than traditional one.
引用
收藏
页码:3455 / 3471
页数:16
相关论文
共 50 条
  • [1] Conceptualization topic modeling
    Tang, Yi-Kun
    Mao, Xian-Ling
    Huang, Heyan
    Shi, Xuewen
    Wen, Guihua
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 3455 - 3471
  • [2] ENVIRONMENTAL MODELING AND PRODUCT CONCEPTUALIZATION
    DAHLE, RD
    COMPUTER OPERATIONS, 1968, 2 (02): : 5 - &
  • [3] Tackling topic general words in topic modeling
    Xu, Yueshen
    Yin, Yuyu
    Yin, Jianwei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 62 : 124 - 133
  • [4] Four Keys to Topic Interpretability in Topic Modeling
    Mavrin, Andrey
    Filchenkov, Andrey
    Koltcov, Sergei
    ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE (AINL 2018), 2018, 930 : 117 - 129
  • [5] Efficient Correlated Topic Modeling with Topic Embedding
    He, Junxian
    Hu, Zhiting
    Berg-Kirkpatrick, Taylor
    Huang, Ying
    Xing, Eric P.
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 225 - 233
  • [6] Topic Modeling for Analyzing Topic Manipulation Skills
    Hwang, Seok-Ju
    Lee, Yoon-Kyoung
    Kim, Jong-Dae
    Park, Chan-Young
    Kim, Yu-Seop
    INFORMATION, 2021, 12 (09)
  • [7] The Evolution of Topic Modeling
    Churchill, Rob
    Singh, Lisa
    ACM COMPUTING SURVEYS, 2022, 54 (10S)
  • [8] Crime topic modeling
    Kuang D.
    Brantingham P.J.
    Bertozzi A.L.
    Crime Science, 6 (1)
  • [9] Topic modeling ensembles
    Shen, Zhiyong
    Luo, Ping
    Yang, Shengwen
    Shen, Xukun
    HP Laboratories Technical Report, 2010, (158):
  • [10] Federated Topic Modeling
    Jiang, Di
    Song, Yuanfeng
    Tong, Yongxin
    Wu, Xueyang
    Zhao, Weiwei
    Xu, Qian
    Yang, Qiang
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1071 - 1080