Enhancing HMM-based POS tagger for Mizo language

被引:0
|
作者
Nunsanga, Morrel V. L. [1 ]
Pakray, Partha [2 ]
Devi, Toijam Sonalika [1 ]
Singh, L. Lolit Kr [3 ]
机构
[1] Mizoram Univ, Dept Informat Technol, Mizoram 796004, India
[2] NIT Silchar, Dept CSE, Silchar, Assam, India
[3] Mizoram Univ, Dept ECE, Mizoram, India
关键词
Hybrid POS tagger; rule-based POS tagger; N-gram tagger; Mizo POS tagger; Hidden Markov Model;
D O I
10.3233/JIFS-224220
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of associating words with their relevant parts of speech is known as part-of-speech (POS) tagging. It takes a substantial amount of well-organized data or corpora and significant target language research to obtain good performance for a tagger. Mizo is a language that needs more research attention in computational linguistics due to its under-resourced nature. The limited availability of corpora and relevant literature adds complexity to the task of assigning POS labels to Mizo text. This paper explores two methods to potentially improve the Hidden Markov Model (HMM)-based POS tagger for the Mizo language. The proposed taggers are compared with the baseline HMM tagger and the N-gram taggers on the designed Mizo corpus, which consists of 72,077 manually tagged tokens. The experimental results proved that the two proposed taggers enhanced the HMM-based Mizo POS tagger, achieving 81.52% and 84.29% accuracy, respectively. Moreover, a comprehensive analysis of the performance of the suggested hybrid tagger was conducted, yielding a weighted average precision, recall, and F1-score of 83.09%, 77.88%, and 79.64% respectively.
引用
收藏
页码:11725 / 11736
页数:12
相关论文
共 50 条
  • [31] HMM-based Mixed-language (Mandarin-English) Speech Synthesis
    Qian, Yao
    Cao, Houwei
    Soong, Frank K.
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 13 - 16
  • [32] A Hybrid Morphology-Based POS Tagger for Persian
    Shamsfard, Mehrnoush
    Fadaee, Hakimeh
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3453 - 3460
  • [33] Croatian HMM-based speech synthesis
    Department of Informatics, Faculty of Philosophy, University of Rijeka, Omladinska 14, Rijeka
    51000, Croatia
    J. Compt. Inf. Technol., 2006, 4 (307-313):
  • [34] A HMM-BASED METHOD FOR ANOMALY DETECTION
    Wang, Fei
    Zhu, Hongliang
    Tian, Bin
    Xin, Yang
    Niu, Xinxin
    Yang, Yu
    2011 4TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK AND MULTIMEDIA TECHNOLOGY (4TH IEEE IC-BNMT2011), 2011, : 276 - 280
  • [35] HMM-BASED ARCHITECTURE FOR FACE IDENTIFICATION
    SAMARIA, F
    YOUNG, S
    IMAGE AND VISION COMPUTING, 1994, 12 (08) : 537 - 543
  • [36] HMM-based audio keyword generation
    Xu, M
    Duan, LY
    Cai, J
    Chia, LT
    Xu, CS
    Tian, Q
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2004, PT 3, PROCEEDINGS, 2004, 3333 : 566 - 574
  • [37] An HMM-based approach to humming transcription
    Shih, HH
    Narayanan, SS
    Kuo, CCJ
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 337 - 340
  • [38] HMM-based synthesis of creaky voice
    Raitio, Tuomo
    Kane, John
    Drugman, Thomas
    Gobl, Christer
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2315 - +
  • [39] Developing HMM-based recognizers with ESMERALDA
    Fink, GA
    TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 229 - 234
  • [40] Enhancing the accuracy of HMM-based conserved pathway prediction using global correspondence scores
    Xiaoning Qian
    Sayed Mohammad Ebrahim Sahraeian
    Byung-Jun Yoon
    BMC Bioinformatics, 12