Hybrid approach for Khmer unknown word POS guessing

被引:0
|
作者
Nou, Chenda [1 ]
Kameyama, Wataru [1 ]
机构
[1] Waseda Univ, Grad Sch Global Informat & Telecommun Studies, 1011 Nishi Tomida, Honjo, Saitama 3670035, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
New words are being created everyday and the lexicon is not large enough to cover all the words, unknown words become a serious problem in part-of-speech tagging. This paper presents a hybrid approach to handle the unknown word problem in Khmer part-of-speech tagging. The hybrid approach combined of rule-based model and trigram model makes use of both internal structure of the word and surrounding contextual information to predict the part-of-speech of unknown words. The proposed approach achieves 88.9% and 78.2% of accuracy on training and test data respectively.
引用
收藏
页码:215 / +
页数:2
相关论文
共 50 条
  • [41] The effect of testing condition on word guessing in elementary school children
    Mannamaa, Mairi
    Kikas, Eve
    Raidvee, Aire
    JOURNAL OF PSYCHOEDUCATIONAL ASSESSMENT, 2008, 26 (01) : 16 - 26
  • [42] Analyzing word embeddings and improving POS tagger of Tigrinya
    Tedla, Yemane
    Yamamoto, Kazuhide
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 115 - 118
  • [43] A word-based predictive text entry method for Khmer language
    Ouk, Phavy
    Thu, Ye Kyaw
    Matsumoto, Mitsuji
    Urano, Yoshiyori
    PROCEEDINGS OF THE 2008 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 214 - 219
  • [44] POS-based Word Alignment for Small Corpus
    Srivastava, Jyoti
    Sanyal, Sudip
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 37 - 40
  • [45] Linguistic Sources in Guessing Word Meaning in Reading Arabic Text
    Hanan, Nik Mustapha
    Hasanah, Hidayatul Zulkifli
    Farhan, Nik Mustapha
    GLOBAL JOURNAL AL-THAQAFAH, 2012, 2 (02) : 87 - 94
  • [46] Khmer Word Segmentation based on Bi-Directional Maximal Matching for Plaintext and Microsoft Word Document
    Bi, Narin
    Taing, Nguonly
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [47] The word-detection effect: Sophisticated guessing or perceptual enhancement?
    Prinzmetal, W
    Lyon, CE
    MEMORY & COGNITION, 1996, 24 (03) : 331 - 341
  • [48] Longest Matching and Rule-based Techniques for Khmer Word Segmentation
    Long, Pakrigna
    Boonjing, Veera
    2018 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2018) - CYBERNETICS IN THE NEXT DECADES, 2018, : 80 - 83
  • [49] The method for the unknown word classification
    Kong, Hyunjang
    Hwang, Myunggwon
    Kim, Pankoo
    Advances in Knowledge Acquisition and Management, 2006, 4303 : 207 - 215
  • [50] Word Equations with One Unknown
    Laine, Markku
    Plandowski, Wojciech
    DEVELOPMENTS IN LANGUAGE THEORY, PROCEEDINGS, 2009, 5583 : 348 - +