Some improvements on maximum entropy based Chinese POS tagging

被引:0
|
作者
Center for Intelligence Science and Technology Research, Beijing University of Posts and Telecommunications, Beijing 100876, China [1 ]
机构
来源
J. China Univ. Post Telecom. | 2006年 / SUPPL.卷 / 99-103期
关键词
Computer simulation - Entropy - Errors - Knowledge engineering - Mathematical models - Probability - Syntactics - Vocabulary control - Word processing;
D O I
暂无
中图分类号
学科分类号
摘要
This paper explores issues related to part-of-speech tagging for Chinese language using maximum entropy technique, in which we first introduced our feature selection strategy based on incremental experiments and error-driven analysis. Then making use of the knowledge from a syntactic dictionary, we created pseudo-events for external lexicon and restricted tags of words to a specific subset, which shrinked the search space greatly. Experiments on the simplified Chinese corpus of China Peking University show that significant improvements are obtained by our approach.
引用
收藏
相关论文
共 50 条
  • [41] Effectiveness of POS Tagging in Graph Based Sentiment Analysis Model
    Bordoloi, Monali
    Agarwal, Deepak
    Biswas, Saroj Kumar
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [42] Unknown word processing in HMM-based POS tagging
    Zhang, Xiaofei
    Huang, Heyan
    Zhang, Daoyang
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 110 - 113
  • [43] Fusion of word clustering features for tibetan part of speech tagging based on maximum entropy model
    Ma N.
    Li Y.
    He X.
    International Journal of Simulation: Systems, Science and Technology, 2016, 17 (08): : 19.1 - 19.5
  • [44] A maximum entropy model based answer extraction for Chinese question answering
    Sun, Ang
    Jiang, Minghu
    Ma, Yanjun
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4223 : 1239 - 1248
  • [45] The Research on Chinese Coreference Resolution Based on Maximum Entropy Model and Rules
    Zhang, Yihao
    Guo, Jianyi
    Yu, Zhengtao
    Zhang, Zhikun
    Yao, Xianming
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 1 - 8
  • [46] Method of Chinese Named Entity Recognition Based on Maximum Entropy Model
    Ning Hui
    Yang Hua
    Tan Ya-zhou
    Wu Hao
    2009 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-7, CONFERENCE PROCEEDINGS, 2009, : 2472 - 2477
  • [47] LM Enhanced BiRNN-CRF for Joint Chinese Word Segmentation and POS Tagging
    Zhang, Jianhu
    Liu, Gongshen
    Zhou, Jie
    Zhou, Cheng
    Sun, Huanrong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 105 - 116
  • [48] A Unified Model for Joint Chinese Word Segmentation and POS Tagging with Heterogeneous Annotation Corpora
    Zhao, Jiayi
    Qiu, Xipeng
    Huang, Xuanjing
    2013 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2013), 2013, : 227 - 230
  • [49] Research on the Method and System of Word Segmentation and POS Tagging for Ancient Chinese Medicine Literature
    Fu, Xianjun
    Yuan, Ting
    Li, Xuebo
    Wang, Zhenguo
    Zhou, Yang
    Ju, Fangning
    Li, Jintong
    Chen, Xiaokang
    Sang Xiaoming
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 2493 - 2498
  • [50] On some problems of the maximum entropy ansatz
    Bandyopadhyay, K
    Bhattacharyya, K
    Bhattacharya, AK
    PRAMANA-JOURNAL OF PHYSICS, 2000, 54 (03): : 365 - 375