Comparison of various approaches to tagging for the inflectional Slovak language

被引:0
|
作者
Benko L. [1 ]
Munkova D. [1 ]
Pappová M. [1 ]
Munk M. [1 ,2 ]
机构
[1] Department of Computer Science, Constantine the Philosopher University in Nitra, Nitra
[2] Science and Research Centre, University of Pardubice, Pardubice
关键词
Automatic taggers; Low-resource language; Morhological annotation; Part-of-speech tagging; Slovak language;
D O I
10.7717/PEERJ-CS.2026
中图分类号
学科分类号
摘要
Morphological tagging provides essential insights into grammar, structure, and the mutual relationships of words within the sentence. Tagging text in a highly inflectional language presents a challenging task due to word ambiguity. This research aims to compare six different automatic taggers for the inflectional Slovak language, seeking for the most accurate tagger for literary and non-literary texts. Our results indicate that it is useful to differentiate texts into literary and non-literary and subsequently, based on the text style to deploy a tagger. For literary texts, UDPipe2 outperformed others in seven out of nine examined tagset positions. Conversely, for non-literary texts, the RNNTagger exhibited the highest performance in eight out of nine examined tagset positions. The RNNTagger is recommended for both types of the text, the best captures the inflection of the Slovak language, but UDPipe2 demonstrates a higher accuracy for literary texts. Despite dataset size limitations, this study emphasizes the suitability of various taggers for the inflectional languages like Slovak. © Copyright 2024 Benko et al.
引用
收藏
页码:1 / 31
页数:30
相关论文
共 50 条
  • [1] Comparison of various approaches to tagging for the inflectional Slovak language
    Benko, Lubomir
    Munkova, Dasa
    Pappova, Maria
    Munk, Michal
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [2] A Comparison of Various Methods for Concept Tagging for Spoken Language Understanding
    Hahn, Stefan
    Lehnen, Patrick
    Raymond, Christian
    Ney, Hermann
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 2947 - 2950
  • [3] LATIN ENDING IN POLISH INFLECTIONAL LANGUAGE - INTERFERENCE AS A GENERAL LINGUISTIC PROBLEM - SLOVAK - TRYPUCKO,J
    STONE, G
    SLAVONIC AND EAST EUROPEAN REVIEW, 1977, 55 (02): : 230 - 232
  • [4] MOTIVATION OF INFLECTIONAL CLASSES AND FORMS (ON THE EXAMPLE OF SLOVAK AND CZECH)
    DOLNIK, J
    ZEITSCHRIFT FUR SLAVISCHE PHILOLOGIE, 1993, 53 (02): : 304 - 318
  • [5] Inflectional frames in language production
    Janssen, DP
    Roelofs, A
    Levelt, WJM
    LANGUAGE AND COGNITIVE PROCESSES, 2002, 17 (03): : 209 - 236
  • [6] Statistic Evaluation of Various Speech Parameters for Phonemes in Slovak Language
    Koroesi, Jan
    Vojtko, Juraj
    Rozinaj, Gregor
    PROCEEDINGS ELMAR-2010, 2010, : 375 - 378
  • [7] INFLECTIONAL SIMPLIFICATION IN THE LITHUANIAN LANGUAGE
    REKLAITIS, JK
    JOURNAL OF BALTIC STUDIES, 1991, 22 (02) : 145 - 156
  • [8] EFFICIENCY IN THE SLOVAK BANKING INDUSTRY: A COMPARISON OF THREE APPROACHES
    Bod'a, Martin
    Zimkova, Emilia
    PRAGUE ECONOMIC PAPERS, 2015, 24 (04): : 434 - 451
  • [9] Comparison of Machine Learning Approaches for Sentiment Analysis in Slovak
    Sokolova, Zuzana
    Harahus, Maros
    Juhar, Jozef
    Pleva, Matus
    Stas, Jan
    Hladek, Daniel
    ELECTRONICS, 2024, 13 (04)
  • [10] A FLUENT MORPHOLOGICAL AGRAMMATIC IN AN INFLECTIONAL LANGUAGE
    NIEMI, J
    KOIVUSELKASALLINEN, P
    SARAJARVI, L
    TUOMAINEN, J
    LAINE, M
    LAIHINEN, A
    AHONEN, A
    JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 1988, 10 (01) : 27 - 27