Corpus based part-of-speech tagging

被引:6
|
作者
Lv, Chengyao [1 ]
Liu, Huihua [1 ]
Dong, Yuanxing [1 ]
Chen, Yunliang [1 ,2 ]
机构
[1] China Univ Geosci, Sch Foreign Language, Wuhan 430074, Peoples R China
[2] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural language processing; POS tagging; Hidden markov models; Support vector machine; Neural networks; Gene expression programming;
D O I
10.1007/s10772-016-9356-2
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In natural language processing, a crucial subsystem in a wide range of applications is a part-of-speech (POS) tagger, which labels (or classifies) unannotated words of natural language with POS labels corresponding to categories such as noun, verb or adjective. Mainstream approaches are generally corpus-based: a POS tagger learns from a corpus of pre-annotated data how to correctly tag unlabeled data. Presented here is a brief state-of-the-art account on POS tagging. POS tagging approaches make use of labeled corpus to train computational trained models. Several typical models of three kings of tagging are introduced in this article: rule-based tagging, statistical approaches and evolution algorithms. The advantages and the pitfalls of each typical tagging are discussed and analyzed. Some rule-based and stochastic methods have been successfully achieved accuracies of 93-96 %, while that of some evolution algorithms are about 96-97 %.
引用
收藏
页码:647 / 654
页数:8
相关论文
共 50 条
  • [41] Part-of-Speech Tagging Using Multiview Learning
    Lim, Kyungtae
    Park, Jungyeul
    IEEE ACCESS, 2020, 8 : 195184 - 195196
  • [42] FarsiTag: A part-of-speech tagging system for Persian
    Rezai, Mohammad Javad
    Miangah, Tayebeh Mosavi
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2017, 32 (03) : 632 - 642
  • [43] Part-of-speech tagging with two sequential transducers
    Kempe, A
    COMPUTATIONAL LINGUISTICS IN THE NETHERLANDS 2000, 2001, (37): : 88 - 96
  • [44] Part-Of-Speech Tagging for Social Media Texts
    Neunerdt, Melanie
    Trevisan, Bianka
    Reyer, Michael
    Mathar, Rudolf
    LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 139 - 150
  • [45] Improved estimation for unsupervised part-of-speech tagging
    Wang, QI
    Schuurmans, D
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 219 - 224
  • [46] A part-of-speech tagging method for English essay
    1600, Beijing University of Posts and Telecommunications (37):
  • [47] Ripple Down Rules for Part-of-Speech Tagging
    Dat Quoc Nguyen
    Dai Quoc Nguyen
    Son Bao Pham
    Dang Duc Pham
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT I, 2011, 6608 : 190 - 201
  • [48] Part-of-Speech (POS) Tagging Using Deep Learning-Based Approaches on the Designed Khasi POS Corpus
    Warjri, Sunita
    Pakray, Partha
    Lyngdoh, Saralin A.
    Maji, Arnab Kumar
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (03)
  • [49] Semi-supervised Part-of-speech Tagging in Speech Applications
    Dufour, Richard
    Favre, Benoit
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1373 - 1376
  • [50] A fine-grained Chinese word segmentation and part-of-speech tagging corpus for clinical text
    Ying Xiong
    Zhongmin Wang
    Dehuan Jiang
    Xiaolong Wang
    Qingcai Chen
    Hua Xu
    Jun Yan
    Buzhou Tang
    BMC Medical Informatics and Decision Making, 19