Some improvements on maximum entropy based Chinese POS tagging

被引:0
|
作者
Center for Intelligence Science and Technology Research, Beijing University of Posts and Telecommunications, Beijing 100876, China [1 ]
机构
来源
J. China Univ. Post Telecom. | 2006年 / SUPPL.卷 / 99-103期
关键词
Computer simulation - Entropy - Errors - Knowledge engineering - Mathematical models - Probability - Syntactics - Vocabulary control - Word processing;
D O I
暂无
中图分类号
学科分类号
摘要
This paper explores issues related to part-of-speech tagging for Chinese language using maximum entropy technique, in which we first introduced our feature selection strategy based on incremental experiments and error-driven analysis. Then making use of the knowledge from a syntactic dictionary, we created pseudo-events for external lexicon and restricted tags of words to a specific subset, which shrinked the search space greatly. Experiments on the simplified Chinese corpus of China Peking University show that significant improvements are obtained by our approach.
引用
收藏
相关论文
共 50 条
  • [1] Chinese POS tagging based on maximum entropy model
    Zhao, R
    Wang, XL
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 601 - 605
  • [2] Applying class triggers in Chinese pos tagging based on maximum entropy model
    Zhao, Y
    Wang, XL
    Liu, BQ
    Guan, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1641 - 1645
  • [3] Chinese POS tagging employing maximum entropy and word clustering
    Ma, Jianjun
    Huang, Degen
    Li, Zezhong
    Journal of Information and Computational Science, 2010, 7 (12): : 2420 - 2428
  • [4] Chinese POS Tagging Using Restricted Maximum Entropy Model
    Zhang Hong
    Ren Fuji
    CHINESE JOURNAL OF ELECTRONICS, 2010, 19 (01): : 39 - 42
  • [5] An English POS Tagging Approach Based on Maximum Entropy
    Yi, Chen
    2015 INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION, BIG DATA AND SMART CITY (ICITBS), 2016, : 81 - 84
  • [6] Semi-supervised maximum entropy based POS tagging for large scale Chinese corpus
    Yuan, Caixia
    Wang, Xiaojie
    Zhai, Junjie
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: 50 YEARS' ACHIEVEMENTS, FUTURE DIRECTIONS AND SOCIAL IMPACTS, 2006, : 385 - 389
  • [7] Improving Persian POS Tagging Using the Maximum Entropy Model
    Kardan, Ahmad A.
    Imani, Maryam Bahojb
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [8] Chinese part of speech tagging based on maximum entropy method
    Lin, H
    Yuan, CF
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1447 - 1450
  • [9] A Modified Markov-Based Maximum-Entropy Model for POS Tagging of Odia Text
    Pattnaik, Sagarika
    Nayak, Ajit Kumar
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [10] PoS Tagging for Classical Chinese Text
    Chiu, Tin-shing
    Lu, Qin
    Xu, Jian
    Xiong, Dan
    Lo, Fengju
    CHINESE LEXICAL SEMANTICS (CLSW 2015), 2015, 9332 : 448 - 456