Hybrid approach for Khmer unknown word POS guessing

被引:0
|
作者
Nou, Chenda [1 ]
Kameyama, Wataru [1 ]
机构
[1] Waseda Univ, Grad Sch Global Informat & Telecommun Studies, 1011 Nishi Tomida, Honjo, Saitama 3670035, Japan
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
New words are being created everyday and the lexicon is not large enough to cover all the words, unknown words become a serious problem in part-of-speech tagging. This paper presents a hybrid approach to handle the unknown word problem in Khmer part-of-speech tagging. The hybrid approach combined of rule-based model and trigram model makes use of both internal structure of the word and surrounding contextual information to predict the part-of-speech of unknown words. The proposed approach achieves 88.9% and 78.2% of accuracy on training and test data respectively.
引用
收藏
页码:215 / +
页数:2
相关论文
共 50 条
  • [1] Khmer POS tagger: A transformation-based approach with hybrid unknown word handling
    Nou, Chenda
    Kameyama, Wataru
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 482 - +
  • [2] POS disambiguation and unknown word guessing with decision trees
    Orphanos, GS
    Christodoulakis, DN
    NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 1999, : 134 - 141
  • [3] Chinese POS Disambiguation and Unknown Word Guessing with Lexicalized HMMs
    Fu, Guohong
    Luke, Kang-Kwong
    INTERNATIONAL JOURNAL OF TECHNOLOGY AND HUMAN INTERACTION, 2006, 2 (01) : 39 - 50
  • [4] A hybrid approach to word segmentation and POS tagging
    Oki Electric Industry Co., Ltd., 2−5−7 Honmachi, Chuo-ku, Osaka
    541−0053, Japan
    不详
    619−0289, Japan
    Proc. Annu. Meet. Assoc. Comput Linguist., 1600, (217-220):
  • [5] ACUT: An Associative Classifier Approach to Unknown Word POS Tagging
    Elahimanesh, Mohammad Hossein
    Minaei-Bidgoli, Behrouz
    Kermani, Fateme
    ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING, AISP 2013, 2014, 427 : 250 - +
  • [6] Automatic Rule Induction for Unknown-Word Guessing
    HCRC, Language Technology Group, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9LW, United Kingdom
    Comput. Linguist., 3 (405-423):
  • [7] Automatic rule induction for unknown-word guessing
    Mikheev, A
    COMPUTATIONAL LINGUISTICS, 1997, 23 (03) : 405 - 423
  • [8] A hybrid model for sense guessing of Chinese unknown words
    Department of Chinese Language and Literature, Peking University, China
    不详
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (464-473):
  • [9] Unknown word processing in HMM-based POS tagging
    Zhang, Xiaofei
    Huang, Heyan
    Zhang, Daoyang
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 110 - 113
  • [10] Word Sense Guessing: A Knowledge Graph based Approach
    Wei, Yiming
    Peng, Zhikun
    Dai, Tan
    2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,