Some improvements on maximum entropy based Chinese POS tagging

被引:0
|
作者
Center for Intelligence Science and Technology Research, Beijing University of Posts and Telecommunications, Beijing 100876, China [1 ]
机构
来源
J. China Univ. Post Telecom. | 2006年 / SUPPL.卷 / 99-103期
关键词
Computer simulation - Entropy - Errors - Knowledge engineering - Mathematical models - Probability - Syntactics - Vocabulary control - Word processing;
D O I
暂无
中图分类号
学科分类号
摘要
This paper explores issues related to part-of-speech tagging for Chinese language using maximum entropy technique, in which we first introduced our feature selection strategy based on incremental experiments and error-driven analysis. Then making use of the knowledge from a syntactic dictionary, we created pseudo-events for external lexicon and restricted tags of words to a specific subset, which shrinked the search space greatly. Experiments on the simplified Chinese corpus of China Peking University show that significant improvements are obtained by our approach.
引用
收藏
相关论文
共 50 条
  • [21] Manual POS Tagging for Middle Chinese Corpus Processing
    Zhang, Yancheng
    Sun, Wan
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EDUCATION, LANGUAGE, ART AND INTERCULTURAL COMMUNICATION, 2014, 3 : 351 - 356
  • [22] Implementing Chinese new word discovery and POS tagging based on support vector machine
    School of Computer Science, Fudan University, Shanghai 200433, China
    不详
    J. Comput. Inf. Syst., 2009, 3 (1279-1285):
  • [23] An Effective Joint Model for Chinese Word Segmentation and POS Tagging
    Wang, Heng-Jun
    Si, Nian-Wen
    Chen, Cheng
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP'16), 2016,
  • [24] CCRFs: Cascaded Conditional Random Fields for Chinese POS Tagging
    Shen, Guoyang
    Qiu, Likun
    Hu, Changjian
    Zhao, Kai
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 437 - 444
  • [25] Chinese Morphological Analysis with Character-level POS Tagging
    Shen, Mo
    Liu, Hongxiao
    Kawahara, Daisuke
    Kurohashi, Sadao
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 253 - 258
  • [26] Discrete and Neural Models for Chinese POS Tagging: Comparison and Combination
    Zhang, Meishan
    Yu, Nan
    Fu, Guohong
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 451 - 460
  • [27] A Transliteration of CRF Based Manipuri POS Tagging
    Nongmeikapam, Kishorjit
    Bandyopadhyay, Sivaji
    2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, COMPUTING & SECURITY [ICCCS-2012], 2012, 1 : 582 - 589
  • [28] A BERT Based Approach for Arabic POS Tagging
    Saidi, Rakia
    Jarray, Fethi
    Mansour, Mahmud
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 311 - 321
  • [29] Chinese POS Tagging Method Based on Bi-GRU plus CRF Hybrid Model
    Guo, Jia-jun
    Wang, Shu-pei
    Yu, Cheng-hai
    Song, Jin-yu
    ADVANCES IN INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS, 2019, 23 : 453 - 460
  • [30] A maximum entropy Chinese character-based parser
    Luo, XQ
    PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2003, : 192 - 199