Comparison of various approaches to tagging for the inflectional Slovak language

被引:0
|
作者
Benko L. [1 ]
Munkova D. [1 ]
Pappová M. [1 ]
Munk M. [1 ,2 ]
机构
[1] Department of Computer Science, Constantine the Philosopher University in Nitra, Nitra
[2] Science and Research Centre, University of Pardubice, Pardubice
关键词
Automatic taggers; Low-resource language; Morhological annotation; Part-of-speech tagging; Slovak language;
D O I
10.7717/PEERJ-CS.2026
中图分类号
学科分类号
摘要
Morphological tagging provides essential insights into grammar, structure, and the mutual relationships of words within the sentence. Tagging text in a highly inflectional language presents a challenging task due to word ambiguity. This research aims to compare six different automatic taggers for the inflectional Slovak language, seeking for the most accurate tagger for literary and non-literary texts. Our results indicate that it is useful to differentiate texts into literary and non-literary and subsequently, based on the text style to deploy a tagger. For literary texts, UDPipe2 outperformed others in seven out of nine examined tagset positions. Conversely, for non-literary texts, the RNNTagger exhibited the highest performance in eight out of nine examined tagset positions. The RNNTagger is recommended for both types of the text, the best captures the inflection of the Slovak language, but UDPipe2 demonstrates a higher accuracy for literary texts. Despite dataset size limitations, this study emphasizes the suitability of various taggers for the inflectional languages like Slovak. © Copyright 2024 Benko et al.
引用
收藏
页码:1 / 31
页数:30
相关论文
共 50 条
  • [41] A Language-Independent Feature Schema for Inflectional Morphology
    Sylak-Glassman, John
    Kirov, Christo
    Yarowsky, David
    Que, Roger
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, : 674 - 680
  • [42] Awareness of second language inflectional morphology: A case study on finnish as a second language
    Suni, Minna
    ACTA LINGUISTICA HUNGARICA, 2007, 54 (02) : 217 - 235
  • [43] Inflectional Review of Deep Learning on Natural Language Processing
    Fahad, S. K. Ahammad
    Yahya, Abdulsamad Ebrahim
    2018 INTERNATIONAL CONFERENCE ON SMART COMPUTING AND ELECTRONIC ENTERPRISE (ICSCEE), 2018,
  • [44] Collaborative tagging applications and approaches
    Li, Qingfeng
    Lu, Stephen C-Y
    IEEE MULTIMEDIA, 2008, 15 (03) : 14 - 21
  • [45] MORPHOLOGICAL RANDOM FORESTS FOR LANGUAGE MODELING OF INFLECTIONAL LANGUAGES
    Oparin, Ilya
    Glembek, Ondrej
    Burget, Lukas
    Cernocky, Jan
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 189 - +
  • [46] Inflectional morphology in a family with inherited specific language impairment
    Ullman, MT
    Gopnik, M
    APPLIED PSYCHOLINGUISTICS, 1999, 20 (01) : 51 - 117
  • [47] INFLECTIONAL AND DERIVATIONAL VERBAL FORM IN MALINSKA SLAVONIC LANGUAGE
    Breu, W.
    VOPROSY YAZYKOZNANIYA, 2006, (03): : 70 - 87
  • [48] Preferred document classification for a highly inflectional/derivational language
    Min, K
    Wilson, WH
    Moon, YJ
    AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 12 - 23
  • [49] VARIOUS LANGUAGE-PHILOSOPHY APPROACHES TO THE CATEGORIZATION OF THE PROFESSIONAL OIL LANGUAGE IN ENGLISH AND RUSSIAN
    Morozova, Olga
    Yakhina, Albina
    WISDOM, 2022, 3 (02): : 178 - 192
  • [50] Comparison of neural architectures for machine translation of the Slovak language using the Fairseq toolkit
    Harahus, Maros
    Hladek, Daniel
    Juhar, Jozef
    Sokolova, Zuzana
    2023 IEEE 21ST WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, SAMI, 2023, : 185 - 189