Hybrid Topic Cluster Models for Social Healthcare Data

被引:0
|
作者
Prasad, K. Rajendra [1 ]
Mohammed, Moulana [2 ]
Noorullah, R. M. [2 ]
机构
[1] Inst Aeronaut Engn, Dept CSE, Hyderabad, India
[2] Koneru Lakshmaiah Univ, Dept CSE, Guntur, Andhra Pradesh, India
关键词
Multi-viewpoint based metric; traditional topic models; hybrid topic models; topic visualization; health tendency;
D O I
10.14569/IJACSA.2019.0101168
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Social media and in particular, microblogs are becoming an important data source for disease surveillance, behavioral medicine, and public healthcare. Topic Models are widely used in microblog analytics for analyzing and integrating the textual data within a corpus. This paper uses health tweets as microblogs and attempts the health data clustering by topic models. The traditional topic models, such as Latent Semantic Indexing (LSI), Probabilistic Latent Schematic Indexing (PLSI), Latent Dirichlet Allocation (LDA), Non-negative Matrix Factorization (NMF), and integer Joint NMF(intJNMF) methods are used for health data clustering; however, they are intractable to assess the number of health topic clusters. Proper visualizations are essential to extract the information from and identifying trends of data, as they may include thousands of documents and millions of words. For visualization of topic clouds and health tendency in the document collection, we present hybrid topic models by integrating traditional topic models with VAT. Proposed hybrid topic models viz., Visual Non-negative Matrix Factorization (VNMF), Visual Latent Dirichlet Allocation (VLDA), Visual Probabilistic Latent Schematic Indexing (VPLSI) and Visual Latent Schematic Indexing (VLSI) are promising methods for accessing the health tendency and visualization of topic clusters from benchmarked and Twitter datasets. Evaluation and comparison of hybrid topic models are presented in the experimental section for demonstrating the efficiency with different distance measures, include, Euclidean distance, cosine distance, and multi-viewpoint cosine similarity.
引用
收藏
页码:490 / 506
页数:17
相关论文
共 50 条
  • [21] Application of dynamic topic models to toxicogenomics data
    Lee, Mikyung
    Liu, Zhichao
    Huang, Ruili
    Tong, Weida
    BMC BIOINFORMATICS, 2016, 17
  • [22] Application of dynamic topic models to toxicogenomics data
    Mikyung Lee
    Zhichao Liu
    Ruili Huang
    Weida Tong
    BMC Bioinformatics, 17
  • [23] Cluster energetics:: Models and data
    Ford, I
    Vehkamäki, H
    Knott, M
    NUCLEATION AND ATMOSPHERIC AEROSOLS 2000, 2000, 534 : 213 - 216
  • [24] Topical Cluster Discovery in Semistructured Healthcare Data
    Costa, Gianni
    Ortale, Riccardo
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 772 - 775
  • [25] Social-Network Analysis Using Topic Models
    Cha, Youngchul
    Cho, Junghoo
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 565 - 574
  • [26] Temporary Topic Models in Social Sciences: A Study on STM
    Kurnaz, Ahmet
    Unver, H. Akin
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [27] Topic-aware social influence propagation models
    Nicola Barbieri
    Francesco Bonchi
    Giuseppe Manco
    Knowledge and Information Systems, 2013, 37 : 555 - 584
  • [28] Dynamic Topic-Noise Models for Social Media
    Churchill, Rob
    Singh, Lisa
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 429 - 443
  • [29] Exploiting Temporal Topic Models in Social Media Retrieval
    Tran, Tuan A.
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 999 - 999
  • [30] Incorporating Popularity in Topic Models for Social Network Analysis
    Cha, Youngchul
    Bi, Bin
    Hsieh, Chu-Cheng
    Cho, Junghoo
    SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, 2013, : 223 - 232