Hybrid approach for Khmer unknown word POS guessing

被引:0
|
作者
Nou, Chenda [1 ]
Kameyama, Wataru [1 ]
机构
[1] Waseda Univ, Grad Sch Global Informat & Telecommun Studies, 1011 Nishi Tomida, Honjo, Saitama 3670035, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
New words are being created everyday and the lexicon is not large enough to cover all the words, unknown words become a serious problem in part-of-speech tagging. This paper presents a hybrid approach to handle the unknown word problem in Khmer part-of-speech tagging. The hybrid approach combined of rule-based model and trigram model makes use of both internal structure of the word and surrounding contextual information to predict the part-of-speech of unknown words. The proposed approach achieves 88.9% and 78.2% of accuracy on training and test data respectively.
引用
收藏
页码:215 / +
页数:2
相关论文
共 50 条
  • [31] INQUIRY INTO AN UNKNOWN WORD
    PAGE, WD
    SCHOOL REVIEW, 1975, 83 (03): : 461 - 477
  • [32] Parameterized Hybrid Password Guessing Method
    Han W.
    Zhang J.
    Xu M.
    Wang C.
    Zhang H.
    He Z.
    Chen H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2022, 59 (12): : 2708 - 2722
  • [33] Contradict the Machine: A Hybrid Approach to Identifying Unknown Unknowns
    Vandenhof, Colin
    Law, Edith
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2238 - 2240
  • [34] Unknown Words Analysis in POS tagging of Sinhala Language
    Jayaweera, A. J. P. M. P.
    Dias, N. G. J.
    14TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER) 2014, 2014, : 270 - 270
  • [35] The lifespan development of cognate guessing skills in an unknown related language
    Vanhove, Jan
    Berthele, Raphael
    IRAL-INTERNATIONAL REVIEW OF APPLIED LINGUISTICS IN LANGUAGE TEACHING, 2015, 53 (01): : 1 - 38
  • [36] Guessing errors made by children with dyslexia in word and text reading
    De Rom, Margot
    Van Reybroeck, Marie
    FRONTIERS IN PSYCHOLOGY, 2024, 15
  • [37] A hybrid approach to automatic word-spacing in Korean
    Kang, M
    Choi, S
    Kwon, H
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2004, 3029 : 284 - 294
  • [38] A Hybrid Approach for Word Alignment with Statistical Modeling and Chunker
    Srivastava, Jyoti
    Sanyal, Sudip
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 570 - 581
  • [39] Word segmentation and POS tagging for Chinese keyphrase extraction
    Huang, XC
    Chen, J
    Yan, PL
    Luo, X
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2005, 3584 : 364 - 369
  • [40] CHILDRENS USE OF CONTEXT IN WORD RECOGNITION - A PSYCHOLINGUISTIC GUESSING GAME
    SCHWANTES, FM
    BOESL, SL
    RITZ, EG
    CHILD DEVELOPMENT, 1980, 51 (03) : 730 - 736