Evaluating Thesaurus-Based Topic Models

被引:1
|
作者
Loukachevitch, Natalia [1 ]
Ivanov, Kirill [1 ]
机构
[1] Lomonosov Moscow State Univ, Moscow, Russia
基金
俄罗斯基础研究基金会;
关键词
Topic models; Thesaurus; Content-based analysis;
D O I
10.1007/978-3-319-91947-8_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study thesaurus-based topic models and evaluate them from the point of view of topic coherence. Thesaurus based topic models enhance the scores of related terms found in the same text, which means that the model encourages these terms to be on the same topics. We evaluate various variants of such models. First, we carry out a manual evaluation of the obtained topics. Second, we study the possibility to use the collected manual data for evaluating new variants of thesaurus-based models, propose a method and select the best its parameters in cross-validation. Third, we apply the created evaluation method to estimate the influence of word frequencies on adding thesaurus relations for generating coherent topic models.
引用
收藏
页码:364 / 376
页数:13
相关论文
共 50 条
  • [11] Omiotis: A Thesaurus-Based Measure of Text Relatedness
    Tsatsaronis, George
    Varlamis, Iraklis
    Vazirgiannis, Michalis
    Norvag, Kietil
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2009, 5782 : 742 - +
  • [12] Thesaurus-Based Methods for Assessment of Text Complexity in Russian
    Solovyev, Valery
    Ivanov, Vladimir
    Solnyshkina, Marina
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 152 - 166
  • [13] Building thesaurus-based knowledge graph based on schema layer
    Bo Qiao
    Kui Fang
    Yiming Chen
    Xinghui Zhu
    Cluster Computing, 2017, 20 : 81 - 91
  • [14] Thesaurus-based methods for mapping contents of publication sets
    Kevin W. Boyack
    Scientometrics, 2017, 111 : 1141 - 1155
  • [15] Building thesaurus-based knowledge graph based on schema layer
    Qiao, Bo
    Fang, Kui
    Chen, Yiming
    Zhu, Xinghui
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (01): : 81 - 91
  • [16] Assessing thesaurus-based annotations for semantic search applications
    Eckert, Kai
    Pfeffer, Magnus
    Stuckenschmidt, Heiner
    International Journal of Metadata, Semantics and Ontologies, 2008, 3 (01) : 53 - 67
  • [17] Thesaurus-based methods for mapping contents of publication sets
    Boyack, Kevin W.
    SCIENTOMETRICS, 2017, 111 (02) : 1141 - 1155
  • [18] Circumstances of trauma and accidents in children: a thesaurus-based survey
    Sejourne, Claire
    Philbois, Olivier
    Vercherin, Paul
    Patural, Hugues
    SANTE PUBLIQUE, 2016, 28 (05): : 581 - 590
  • [19] Using a Thesaurus-Based Approach for the Categorisation of Web Sites
    Pudaruth, Sameerchand
    Ankiah, Youven
    Sembhoo, Keshav
    2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 624 - 628
  • [20] A GRAPHICAL THESAURUS-BASED INFORMATION-RETRIEVAL SYSTEM
    MCMATH, CF
    TAMARU, RS
    RADA, R
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1989, 31 (02): : 121 - 147