Evaluating Thesaurus-Based Topic Models

被引:1
|
作者
Loukachevitch, Natalia [1 ]
Ivanov, Kirill [1 ]
机构
[1] Lomonosov Moscow State Univ, Moscow, Russia
基金
俄罗斯基础研究基金会;
关键词
Topic models; Thesaurus; Content-based analysis;
D O I
10.1007/978-3-319-91947-8_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study thesaurus-based topic models and evaluate them from the point of view of topic coherence. Thesaurus based topic models enhance the scores of related terms found in the same text, which means that the model encourages these terms to be on the same topics. We evaluate various variants of such models. First, we carry out a manual evaluation of the obtained topics. Second, we study the possibility to use the collected manual data for evaluating new variants of thesaurus-based models, propose a method and select the best its parameters in cross-validation. Third, we apply the created evaluation method to estimate the influence of word frequencies on adding thesaurus relations for generating coherent topic models.
引用
收藏
页码:364 / 376
页数:13
相关论文
共 50 条
  • [1] Thesaurus-based disambiguation of gene symbols
    Schijvenaars, BJA
    Mons, B
    Weeber, M
    Schuemie, MJ
    van Mulligen, EM
    Wain, HM
    Kors, JA
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [2] Thesaurus-based disambiguation of gene symbols
    Bob JA Schijvenaars
    Barend Mons
    Marc Weeber
    Martijn J Schuemie
    Erik M van Mulligen
    Hester M Wain
    Jan A Kors
    BMC Bioinformatics, 6
  • [3] THESAURUS-BASED AUTOMATIC BOOK INDEXING
    DILLON, M
    INFORMATION PROCESSING & MANAGEMENT, 1982, 18 (04) : 167 - 178
  • [4] Evaluating unsupervised thesaurus-based labeling of audiovisual content in an archive production environment
    de Boer, Victor
    Ordelman, Roeland J. F.
    Schuurman, Josefien
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2016, 17 (03) : 189 - 201
  • [5] Thesaurus-based ontology on image analysis
    Colantonio, Sara
    Gurevich, Igor
    Martinelli, Massimo
    Salvetti, Ovidio
    Trusova, Yulia
    SEMANTIC MULTIMEDIA, PROCEEDINGS, 2007, 4816 : 113 - +
  • [6] Thesaurus-based Retrieval of Case Law
    Klein, Michel C. A.
    van Steenbergen, Wouter
    Uijttenbroek, Elisabeth M.
    Lodder, Arno R.
    van Harmelen, Frank
    LEGAL KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 152 : 61 - +
  • [7] Evaluation of a thesaurus-based query expansion technique
    Pizzato, LAS
    de Lima, VLS
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROCEEDINGS, 2003, 2721 : 251 - 258
  • [8] Thesaurus-Based Search in Large Heterogeneous Collections
    Wielemaker, Jan
    Hildebrand, Michiel
    van Ossenbruggen, Jacco
    Schreiber, Guus
    SEMANTIC WEB - ISWC 2008, 2008, 5318 : 695 - +
  • [9] Associative and spatial relationships in thesaurus-based retrieval
    Alani, H
    Jones, C
    Tudhope, D
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2000, 1923 : 45 - 58
  • [10] Paper resources cataloguing - Thesaurus-based approach
    Tymovchak, Oksana
    Moroz, Svidana
    2007 PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS, 2007, : 588 - 589