Evaluating Thesaurus-Based Topic Models

被引:1
|
作者
Loukachevitch, Natalia [1 ]
Ivanov, Kirill [1 ]
机构
[1] Lomonosov Moscow State Univ, Moscow, Russia
基金
俄罗斯基础研究基金会;
关键词
Topic models; Thesaurus; Content-based analysis;
D O I
10.1007/978-3-319-91947-8_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study thesaurus-based topic models and evaluate them from the point of view of topic coherence. Thesaurus based topic models enhance the scores of related terms found in the same text, which means that the model encourages these terms to be on the same topics. We evaluate various variants of such models. First, we carry out a manual evaluation of the obtained topics. Second, we study the possibility to use the collected manual data for evaluating new variants of thesaurus-based models, propose a method and select the best its parameters in cross-validation. Third, we apply the created evaluation method to estimate the influence of word frequencies on adding thesaurus relations for generating coherent topic models.
引用
收藏
页码:364 / 376
页数:13
相关论文
共 50 条
  • [41] Using Thesaurus-Based Tag Clouds to Improve Test-Driven Code Search
    Lazzarini Lemos, Otavio Augusto
    de Paula, Adriano Carvalho
    Konishi, Gustavo
    Ossher, Joel
    Bajracharya, Sushil
    Lopes, Cristina
    7TH BRAZILIAN SYMPOSIUM ON SOFTWARE COMPONENTS, ARCHITECTURES AND REUSE (SBCARS 2013), 2013, : 99 - 108
  • [42] A time-sensitive historical thesaurus-based semantic tagger for deep semantic annotation
    Piao, Scott
    Dallachy, Fraser
    Baron, Alistair
    Demmen, Jane
    Wattam, Steve
    Durkin, Philip
    McCracken, James
    Rayson, Paul
    Alexander, Marc
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 113 - 135
  • [43] Thesaurus-based approach for building domain ontology with a case study of military aircraft prototype ontology construction
    School of Electric and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
    不详
    J. Southeast Univ. Engl. Ed., 2006, 3 (353-356):
  • [44] Improving and Evaluating Topic Models and Other Models of Text
    Airoldi, Edoardo M.
    Bischof, Jonathan M.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (516) : 1381 - 1403
  • [45] Evaluating the Robustness of Embedding-Based Topic Models to OCR Noise
    Zosa, Elaine
    Mutuvi, Stephen
    Granroth-Wilding, Mark
    Doucet, Antoine
    TOWARDS OPEN AND TRUSTWORTHY DIGITAL SOCIETIES, ICADL 2021, 2021, 13133 : 392 - 400
  • [46] A synergistic strategy for combining thesaurus-based and corpus-based approaches in building ontology for multilingual search engines
    Zhuhadar, Leyla
    COMPUTERS IN HUMAN BEHAVIOR, 2015, 51 : 1107 - 1115
  • [47] Finding frugal patent candidates: testing a thesaurus-based process model in the field of small household appliances
    Kronemeyer, Lena L.
    Draeger, Raik
    Moehrle, Martin G.
    INTERNATIONAL JOURNAL OF INNOVATION SCIENCE, 2021, 13 (03) : 286 - 298
  • [48] An Approach for Evaluating Topic Models for Knowledge Management
    Sumpter, Ashley Simone Kelsey
    Pines, Edward
    2024 15TH INTERNATIONAL CONFERENCE ON MECHANICAL AND INTELLIGENT MANUFACTURING TECHNOLOGIES, ICMIMT 2024, 2024, : 46 - 51
  • [49] Evaluating Interactive Topic Models in Applied Settings
    Gao, Sally
    Norkute, Milda
    Agrawal, Abhinav
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [50] Improving and Evaluating Topic Models and Other Models of Text Comment
    Goeva, Aleksandrina
    Kolaczyk, Eric D.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (516) : 1405 - 1408