Learning word segmentation rules for tag prediction

被引:0
|
作者
Kazakov, D [1 ]
Manandhar, S
Erjavec, T
机构
[1] Univ York, York YO10 5DD, N Yorkshire, England
[2] Jozef Stefan Inst, Dept Intelligent Syst, Ljubljana, Slovenia
来源
INDUCTIVE LOGIC PROGRAMMING | 1999年 / 1634卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our previous work we introduced a hybrid, GA&ILP-based approach for learning of stem-suffix segmentation rules from an unmarked list of words. Evaluation of the method was made difficult by the lack of word corpora annotated with their morphological segmentation. Here the hybrid approach is evaluated indirectly, on the task of tag prediction. A pair of stem-tag and suffix-tag lexicons is obtained by the application of that approach to an annotated lexicon of word-tag pairs. The two lexicons are then used to predict the tags of unseen words in two ways, (1) by using only the stem and suffix generated by the segmentation rules, and (2) for all matching combinations of stem and suffix present in the lexicons. The results show high correlation between the constituents generated by the segmentation rules, and the tags of the words in which they appear, thereby demonstrating the linguistic relevance of the segmentations produced by the hybrid approach.
引用
收藏
页码:152 / 161
页数:10
相关论文
共 50 条
  • [31] Assessing performance of prediction rules in machine learning
    Martin, Rory
    Yu, Kai
    PHARMACOGENOMICS, 2006, 7 (04) : 543 - 550
  • [32] Implicitly Bayesian Prediction Rules in Deep Learning
    Mlodozeniec, Bruno
    Krueger, David
    Turner, Richard
    SYMPOSIUM ON ADVANCES IN APPROXIMATE BAYESIAN INFERENCE, 2024, 253 : 79 - 110
  • [33] Learning Compact Hashing Codes for Efficient Tag Completion and Prediction
    Wang, Qifan
    Ruan, Lingyun
    Zhang, Zhiwei
    Si, Luo
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1789 - 1794
  • [34] Joint learning of Chinese word segmentation and named entity recognition
    Huang X.
    Qiao L.
    Yu W.
    Li J.
    Xue H.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2021, 43 (01): : 86 - 94
  • [35] Realization of Chinese Word Segmentation based on Deep Learning Method
    Wang, Xuefei
    Wang, Mingjiang
    Zhang, Qiquan
    GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
  • [36] Adversarial Multi-Criteria Learning for Chinese Word Segmentation
    Chen, Xinchi
    Shi, Zhan
    Qiu, Xipeng
    Huang, Xuanjing
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1193 - 1203
  • [37] Word Segmentation Method for Handwritten Documents based on Structured Learning
    Ryu, Jewoong
    Koo, Hyung Il
    Cho, Nam Ik
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (08) : 1161 - 1165
  • [38] Do Chinese Readers Follow the National Standard Rules for Word Segmentation during Reading?
    Liu, Ping-Ping
    Li, Wei-Jun
    Lin, Nan
    Li, Xing-Shan
    PLOS ONE, 2013, 8 (02):
  • [39] Cross-learning in analytic word recognition without segmentation
    Choisy C.
    Belaïd A.
    International Journal on Document Analysis and Recognition, 2002, 4 (4) : 281 - 289
  • [40] Chinese word segmentation with local and global context representation learning
    李岩
    Zhang Yinghua
    Huang Xiaoping
    Yin Xucheng
    Hao Hongwei
    High Technology Letters, 2015, 21 (01) : 71 - 77