Learning word segmentation rules for tag prediction

被引:0
|
作者
Kazakov, D [1 ]
Manandhar, S
Erjavec, T
机构
[1] Univ York, York YO10 5DD, N Yorkshire, England
[2] Jozef Stefan Inst, Dept Intelligent Syst, Ljubljana, Slovenia
来源
INDUCTIVE LOGIC PROGRAMMING | 1999年 / 1634卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our previous work we introduced a hybrid, GA&ILP-based approach for learning of stem-suffix segmentation rules from an unmarked list of words. Evaluation of the method was made difficult by the lack of word corpora annotated with their morphological segmentation. Here the hybrid approach is evaluated indirectly, on the task of tag prediction. A pair of stem-tag and suffix-tag lexicons is obtained by the application of that approach to an annotated lexicon of word-tag pairs. The two lexicons are then used to predict the tags of unseen words in two ways, (1) by using only the stem and suffix generated by the segmentation rules, and (2) for all matching combinations of stem and suffix present in the lexicons. The results show high correlation between the constituents generated by the segmentation rules, and the tags of the words in which they appear, thereby demonstrating the linguistic relevance of the segmentations produced by the hybrid approach.
引用
收藏
页码:152 / 161
页数:10
相关论文
共 50 条
  • [41] Chinese word segmentation with local and global context representation learning
    School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing
    100083, China
    不详
    100190, China
    High Technol Letters, 1 (71-77):
  • [42] COMPUTATIONAL LEARNING AND LANGUAGE ACQUISITION: A VIEW FROM WORD SEGMENTATION
    Gambell, Timothy
    Yang, Charles
    LINGUE E LINGUAGGIO, 2007, 6 (02) : 139 - 150
  • [43] An Unsupervised Learning and Statistical Approach for Vietnamese Word Recognition and Segmentation
    Trung, Hieu Le
    Vu Le Anh
    Trung, Kien Le
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT II, PROCEEDINGS, 2010, 5991 : 195 - +
  • [44] Pre-training with Meta Learning for Chinese Word Segmentation
    Ke, Zhen
    Shi, Liang
    Sun, Songtao
    Meng, Erli
    Wang, Bin
    Qiu, Xipeng
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5514 - 5523
  • [45] Predictive Feature Learning for Future Segmentation Prediction
    Lin, Zihang
    Sun, Jiangxin
    Hu, Jian-Fang
    Yu, Qizhi
    Lai, Jian-Huang
    Zheng, Wei-Shi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7345 - 7354
  • [46] Tag Recommendation by Word-Level Tag Sequence Modeling
    Shi, Xuewen
    Huang, Heyan
    Zhao, Shuyang
    Jian, Ping
    Tang, Yi-Kun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 420 - 424
  • [47] Incidental learning of abstract rules for non-dominant word orders
    Francis, Andrea P.
    Schmidt, Gwen L.
    Carr, Thomas H.
    Clegg, Benjamin A.
    PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 2009, 73 (01): : 60 - 74
  • [48] Incidental learning of abstract rules for non-dominant word orders
    Andrea P. Francis
    Gwen L. Schmidt
    Thomas H. Carr
    Benjamin A. Clegg
    Psychological Research, 2009, 73 : 60 - 74
  • [49] Machine Learning Methods for Word Prediction in Brasilian Portuguese
    Palazuelos-Cagigas, Sira E.
    Martin-Sanchez, Jose L.
    Macias-Guarasa, Javier
    Garcia-Garcia, Juan C.
    Cavalieri, Daniel. C.
    Bastos-Filho, Teodiano F.
    Sarcinelli-Filho, Mario
    EVERYDAY TECHNOLOGY FOR INDEPENDENCE AND CARE, 2011, 29 : 424 - 431
  • [50] Machine Learning and Clinical Prediction Rules: A Perfect Match?
    Chamberlain, James M.
    Chamberlain, Daniel B.
    Zorc, Joseph J.
    PEDIATRICS, 2020, 146 (03)