Semi-supervised maximum entropy based POS tagging for large scale Chinese corpus

被引:0
|
作者
Yuan, Caixia [1 ]
Wang, Xiaojie [1 ]
Zhai, Junjie [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat Engn, Beijing 100876, Peoples R China
关键词
semi-supervised; maximum entropy; Chinese POS tagging;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents issues related to POS tagging for large-scale Chinese corpus using the maximum entropy technique, in which unlabeled data are introduced for compensating the sparseness and inconsistency of labeled data. We test our method on the corpus of Peking University China and show that as much as 27% error reduction is obtained by semi-supervised strategy.
引用
收藏
页码:385 / 389
页数:5
相关论文
共 50 条
  • [31] AraSenCorpus: A Semi-Supervised Approach for Sentiment Annotation of a Large Arabic Text Corpus
    Al-Laith, Ali
    Shahbaz, Muhammad
    Alaskar, Hind F.
    Rehmat, Asim
    APPLIED SCIENCES-BASEL, 2021, 11 (05):
  • [32] Active Learning for Semi-supervised Classification Based On Information Entropy
    Jie, Shen
    Xin, Fan
    Wen, Shen
    2009 INTERNATIONAL FORUM ON INFORMATION TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 591 - 595
  • [33] Semi-supervised sequence tagging with bidirectional language models
    Peters, Matthew E.
    Ammar, Waleed
    Bhagavatula, Chandra
    Power, Russell
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1756 - 1765
  • [34] A survey of large-scale graph-based semi-supervised classification algorithms
    Song Y.
    Zhang J.
    Zhang C.
    International Journal of Cognitive Computing in Engineering, 2022, 3 : 188 - 198
  • [35] Semi-supervised multi-view maximum entropy discrimination with expectation Laplacian regularization
    Chao, Guoqing
    Sun, Shiliang
    INFORMATION FUSION, 2019, 45 : 296 - 306
  • [36] Semi-supervised Multitask Learning via Self-training and Maximum Entropy Discrimination
    Chao, Guoqing
    Sun, Shiliang
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 340 - 347
  • [37] A Modified Markov-Based Maximum-Entropy Model for POS Tagging of Odia Text
    Pattnaik, Sagarika
    Nayak, Ajit Kumar
    INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
  • [38] Large scale semi-supervised linear SVM with stochastic gradient descent
    Zhou, X. (zhouxin@mtlab.hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [39] Transductive Centroid Projection for Semi-supervised Large-Scale Recognition
    Liu, Yu
    Song, Guanglu
    Shao, Jing
    Jin, Xiao
    Wang, Xiaogang
    COMPUTER VISION - ECCV 2018, PT V, 2018, 11209 : 72 - 89
  • [40] Nonnegative Spectral Clustering for Large-Scale Semi-supervised Learning
    Hu, Weibo
    Chen, Chuan
    Ye, Fanghua
    Zheng, Zibin
    Ling, Guohui
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 287 - 291