Topic Models: A Tutorial with R

被引:7
|
作者
Richardson, G. Manning [1 ]
Bowers, Janet [2 ]
Woodill, A. John [3 ]
Barr, Joseph R. [2 ]
Gawron, Jean Mark [4 ]
Levine, Richard A. [2 ]
机构
[1] San Diego State Univ, Computat Sci Res Ctr, San Diego, CA 92182 USA
[2] San Diego State Univ, Dept Math & Stat, San Diego, CA 92182 USA
[3] San Diego State Univ, Dept Econ, San Diego, CA 92182 USA
[4] San Diego State Univ, Dept Linguist & Asian Middle Eastern Languages, San Diego, CA 92182 USA
关键词
Probabilistic topic models; latent semantic analysis; microblogging; twitter;
D O I
10.1142/S1793351X14500044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This tutorial presents topic models for organizing and comparing documents. The technique and corresponding discussion focuses on analysis of short text documents, particularly micro-blogs. However, the base topic model and R implementation are generally applicable to text analytics of document databases.
引用
收藏
页码:85 / 98
页数:14
相关论文
共 50 条
  • [1] A Tutorial on Probabilistic Topic Models for Text Data Retrieval and Analysis
    Zhai, ChengXiang
    Geigle, Chase
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1395 - 1397
  • [2] Topicmodels: An R Package for Fitting Topic Models
    Gruen, Bettina
    Hornik, Kurt
    JOURNAL OF STATISTICAL SOFTWARE, 2011, 40 (13): : 1 - 30
  • [3] stm: An R Package for Structural Topic Models
    Roberts, Margaret E.
    Stewart, Brandon M.
    Tingley, Dustin
    JOURNAL OF STATISTICAL SOFTWARE, 2019, 91 (02): : 1 - 40
  • [4] Tutorial: Developing and Deploying Healthcare Predictive Models in R
    Stiglic, Gregor
    2014 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2014, : 363 - 363
  • [5] TUTORIAL: TEACHING AN ADVANCED SIMULATION TOPIC
    Henderson, Shane G.
    Jacobson, Sheldon
    Robinson, Stewart
    2012 WINTER SIMULATION CONFERENCE (WSC), 2012,
  • [6] A Tutorial on RxODE: Simulating Differential Equation Pharmacometric Models in R
    Wang, W.
    Hallow, K. M.
    James, D. A.
    CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY, 2016, 5 (01): : 3 - 10
  • [7] Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R
    Levi Kumle
    Melissa L.-H. Võ
    Dejan Draschkow
    Behavior Research Methods, 2021, 53 : 2528 - 2543
  • [8] Selecting the Number and Labels of Topics in Topic Modeling: A Tutorial
    Weston, Sara J. J.
    Shryock, Ian
    Light, Ryan
    Fisher, Phillip A. A.
    ADVANCES IN METHODS AND PRACTICES IN PSYCHOLOGICAL SCIENCE, 2023, 6 (02)
  • [10] Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R
    Kumle, Leah
    Vo, Melissa L. -H.
    Draschkow, Dejan
    BEHAVIOR RESEARCH METHODS, 2021, 53 (06) : 2528 - 2543