Improving Basic Natural Language Processing Tools for the Ainu Language

被引:3
|
作者
Nowakowski, Karol [1 ]
Ptaszynski, Michal [1 ]
Masui, Fumito [1 ]
Momouchi, Yoshio [2 ]
机构
[1] Kitami Inst Technol, Dept Comp Sci, 165 Koen Cho, Kitami, Hokkaido 0908507, Japan
[2] Hokkai Gakuen Univ, Fac Engn, Dept Elect & Informat Engn, Chuo Ku, 1-1,Nishi 11 Chome,Minami 26 Jo, Sapporo, Hokkaido 0640926, Japan
关键词
Ainu language; endangered languages; normalization; word segmentation; part-of-speech tagging;
D O I
10.3390/info10110329
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Ainu is a critically endangered language spoken by the native inhabitants of northern Japan. This paper describes our research aimed at the development of technology for automatic processing of text in Ainu. In particular, we improved the existing tools for normalizing old transcriptions, word segmentation, and part-of-speech tagging. In the experiments we applied two Ainu language dictionaries from different domains (literary and colloquial) and created a new data set by combining them. The experiments revealed that expanding the lexicon had a positive impact on the overall performance of our tools, especially with test data unrelated to any of the training sets used.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Towards Better Text Processing Tools for the Ainu Language
    Nowakowski, Karol
    Ptaszynski, Michal
    Masui, Fumito
    HUMAN LANGUAGE TECHNOLOGY. CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, LTC 2017, 2020, 12598 : 131 - 145
  • [2] Improving Neurology Clinical Care With Natural Language Processing Tools
    Ge, Wendong
    Rice, Hunter J.
    Sheikh, Irfan S.
    Westover, M. Brandon
    Weathers, Allison L.
    Jones, Lyell K.
    Moura, Lidia
    NEUROLOGY, 2023, 101 (22) : 1010 - 1018
  • [3] Natural Language Processing Tools and Workflows for Improving Research Processes
    Khan, Noel
    Elizondo, David
    Deka, Lipika
    Molina-Cabello, Miguel A.
    APPLIED SCIENCES-BASEL, 2024, 14 (24):
  • [4] Visual tools for natural language processing
    Gaizauskas, R
    Rodgers, PJ
    Humphreys, K
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2001, 12 (04): : 375 - 412
  • [5] Improving Policing with Natural Language Processing
    Dixon, Anthony
    Birks, Daniel
    NLP4POSIMPACT 2021: THE 1ST WORKSHOP ON NLP FOR POSITIVE IMPACT, 2021, : 115 - 124
  • [6] Principles and Interactive Tools for Evaluating and Improving the Behavior of Natural Language Processing models
    Wu, Tongshuang
    EXTENDED ABSTRACTS OF THE 2021 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'21), 2021,
  • [7] Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)
    Toneva, Mariya
    Wehbe, Leila
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Building natural language processing tools for Runyakitara
    Katushemererwe, Fridah
    Caines, Andrew
    Buttery, Paula
    APPLIED LINGUISTICS REVIEW, 2021, 12 (04) : 585 - 609
  • [9] INTERNET TOOLS AND NATURAL-LANGUAGE PROCESSING
    BRETT, GH
    PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1995, 32 : 229 - 229
  • [10] Improving Student Surveys with Natural Language Processing
    Hood, Karoline M.
    Kuiper, Patrick K.
    2018 SECOND IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC), 2018, : 383 - 386