Evaluating Thesaurus-Based Topic Models

被引:1
|
作者
Loukachevitch, Natalia [1 ]
Ivanov, Kirill [1 ]
机构
[1] Lomonosov Moscow State Univ, Moscow, Russia
基金
俄罗斯基础研究基金会;
关键词
Topic models; Thesaurus; Content-based analysis;
D O I
10.1007/978-3-319-91947-8_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study thesaurus-based topic models and evaluate them from the point of view of topic coherence. Thesaurus based topic models enhance the scores of related terms found in the same text, which means that the model encourages these terms to be on the same topics. We evaluate various variants of such models. First, we carry out a manual evaluation of the obtained topics. Second, we study the possibility to use the collected manual data for evaluating new variants of thesaurus-based models, propose a method and select the best its parameters in cross-validation. Third, we apply the created evaluation method to estimate the influence of word frequencies on adding thesaurus relations for generating coherent topic models.
引用
收藏
页码:364 / 376
页数:13
相关论文
共 50 条
  • [21] Assessing thesaurus-based query expansion using the UMLS metathesaurus
    Hersh, W
    Price, S
    Donohoe, L
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 344 - 348
  • [22] RANKING DOCUMENTS IN THESAURUS-BASED BOOLEAN RETRIEVAL-SYSTEMS
    LEE, JH
    KIM, MH
    LEE, YJ
    INFORMATION PROCESSING & MANAGEMENT, 1994, 30 (01) : 79 - 91
  • [23] A Thesaurus-Based Sentiment Lexicon for Danish: The Danish Sentiment Lexicon
    Nimb, Sanni
    Olsen, Sussi
    Pedersen, Bolette S.
    Troelsgard, Thomas
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 2826 - 2832
  • [24] Thesaurus-based word embeddings for automated biomedical literature classification
    Dimitrios A. Koutsomitropoulos
    Andreas D. Andriopoulos
    Neural Computing and Applications, 2022, 34 : 937 - 950
  • [25] Exploiting a thesaurus-based semantic net for knowledge-based search
    Clark, P
    Thompson, J
    Holmback, H
    Duncan, L
    SEVENTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-2001) / TWELFTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-2000), 2000, : 988 - 995
  • [26] Thesaurus-based feedback to support mixed search and browsing environments
    Meij, Edgar
    de Rijke, Maarten
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, PROCEEDINGS, 2007, 4675 : 247 - +
  • [27] Combining Thesaurus Knowledge and Probabilistic Topic Models
    Loukachevitch, Natalia
    Nokel, Michael
    Ivanov, Kirill
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2017, 2018, 10716 : 59 - 71
  • [28] Adding Thesaurus Information into Probabilistic Topic Models
    Loukachevitch, Natalia
    Nokel, Michael
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 210 - 218
  • [29] THESAURUS-BASED INDEXING AND CLASSIFICATION SYSTEM DEVELOPED FOR INSPEC PRODUCTS AND SERVICES
    FIELD, BJ
    JOURNAL OF DOCUMENTATION, 1974, 30 (01) : 1 - 17
  • [30] Thesaurus-based word embeddings for automated biomedical literature classification
    Koutsomitropoulos, Dimitrios A.
    Andriopoulos, Andreas D.
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (02): : 937 - 950