Hybrid Topic Cluster Models for Social Healthcare Data

被引:0
|
作者
Prasad, K. Rajendra [1 ]
Mohammed, Moulana [2 ]
Noorullah, R. M. [2 ]
机构
[1] Inst Aeronaut Engn, Dept CSE, Hyderabad, India
[2] Koneru Lakshmaiah Univ, Dept CSE, Guntur, Andhra Pradesh, India
关键词
Multi-viewpoint based metric; traditional topic models; hybrid topic models; topic visualization; health tendency;
D O I
10.14569/IJACSA.2019.0101168
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Social media and in particular, microblogs are becoming an important data source for disease surveillance, behavioral medicine, and public healthcare. Topic Models are widely used in microblog analytics for analyzing and integrating the textual data within a corpus. This paper uses health tweets as microblogs and attempts the health data clustering by topic models. The traditional topic models, such as Latent Semantic Indexing (LSI), Probabilistic Latent Schematic Indexing (PLSI), Latent Dirichlet Allocation (LDA), Non-negative Matrix Factorization (NMF), and integer Joint NMF(intJNMF) methods are used for health data clustering; however, they are intractable to assess the number of health topic clusters. Proper visualizations are essential to extract the information from and identifying trends of data, as they may include thousands of documents and millions of words. For visualization of topic clouds and health tendency in the document collection, we present hybrid topic models by integrating traditional topic models with VAT. Proposed hybrid topic models viz., Visual Non-negative Matrix Factorization (VNMF), Visual Latent Dirichlet Allocation (VLDA), Visual Probabilistic Latent Schematic Indexing (VPLSI) and Visual Latent Schematic Indexing (VLSI) are promising methods for accessing the health tendency and visualization of topic clusters from benchmarked and Twitter datasets. Evaluation and comparison of hybrid topic models are presented in the experimental section for demonstrating the efficiency with different distance measures, include, Euclidean distance, cosine distance, and multi-viewpoint cosine similarity.
引用
收藏
页码:490 / 506
页数:17
相关论文
共 50 条
  • [1] High performance social data computing with development of intelligent topic models for healthcare
    Narasimhulu, K.
    Abarna, K. T. Meena
    MICROPROCESSORS AND MICROSYSTEMS, 2022, 95
  • [2] Visual topic models for healthcare data clustering
    K. Rajendra Prasad
    Moulana Mohammed
    R. M. Noorullah
    Evolutionary Intelligence, 2021, 14 : 545 - 562
  • [3] Visual topic models for healthcare data clustering
    Prasad, K. Rajendra
    Mohammed, Moulana
    Noorullah, R. M.
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 545 - 562
  • [4] Correction to: Visual topic models for healthcare data clustering
    K. Rajendra Prasad
    Moulana Mohammed
    R. M. Noorullah
    Evolutionary Intelligence, 2021, 14 (2) : 563 - 565
  • [5] Combining topic models and social networks for chat data mining
    Tuulos, VH
    Tirri, H
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 206 - 213
  • [6] Topic Models for Unsupervised Cluster Matching
    Iwata, Tomoharu
    Hirao, Tsutomu
    Ueda, Naonori
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (04) : 786 - 795
  • [7] Aggregated topic models for increasing social media topic coherence
    Blair, Stuart J.
    Bi, Yaxin
    Mulvenna, Maurice D.
    APPLIED INTELLIGENCE, 2020, 50 (01) : 138 - 156
  • [8] Aggregated topic models for increasing social media topic coherence
    Stuart J. Blair
    Yaxin Bi
    Maurice D. Mulvenna
    Applied Intelligence, 2020, 50 : 138 - 156
  • [9] THE CLUSTER-CLUSTER CORRELATION IN HYBRID MODELS
    JING, YP
    MO, HJ
    BORNER, G
    FANG, LZ
    ASTROPHYSICAL JOURNAL, 1993, 411 (02): : 450 - 454
  • [10] A Cluster Guided Topic Model for Social Query Expansion
    Zhao, Wenyu
    Zhou, Dong
    DATA SCIENCE, PT II, 2017, 728 : 66 - 77