Learning word segmentation rules for tag prediction

被引:0
|
作者
Kazakov, D [1 ]
Manandhar, S
Erjavec, T
机构
[1] Univ York, York YO10 5DD, N Yorkshire, England
[2] Jozef Stefan Inst, Dept Intelligent Syst, Ljubljana, Slovenia
来源
INDUCTIVE LOGIC PROGRAMMING | 1999年 / 1634卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In our previous work we introduced a hybrid, GA&ILP-based approach for learning of stem-suffix segmentation rules from an unmarked list of words. Evaluation of the method was made difficult by the lack of word corpora annotated with their morphological segmentation. Here the hybrid approach is evaluated indirectly, on the task of tag prediction. A pair of stem-tag and suffix-tag lexicons is obtained by the application of that approach to an annotated lexicon of word-tag pairs. The two lexicons are then used to predict the tags of unseen words in two ways, (1) by using only the stem and suffix generated by the segmentation rules, and (2) for all matching combinations of stem and suffix present in the lexicons. The results show high correlation between the constituents generated by the segmentation rules, and the tags of the words in which they appear, thereby demonstrating the linguistic relevance of the segmentations produced by the hybrid approach.
引用
收藏
页码:152 / 161
页数:10
相关论文
共 50 条
  • [1] Recurrent Neural Word Segmentation with Tag Inference
    Zhou, Qianrong
    Ma, Long
    Zheng, Zhenyu
    Wang, Yue
    Wang, Xiaojie
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 734 - 743
  • [2] Unsupervised Learning of Word Segmentation Rules with Genetic Algorithms and Inductive Logic Programming
    Dimitar Kazakov
    Suresh Manandhar
    Machine Learning, 2001, 43 : 121 - 162
  • [3] Unsupervised learning of word segmentation rules with genetic algorithms and inductive logic programming
    Kazakov, D
    Manandhar, S
    MACHINE LEARNING, 2001, 43 (1-2) : 121 - 162
  • [4] Word segmentation as word learning: Integrating meaning learning with distributional cues in segmentation
    Frank, Michael C.
    Mansinghka, Vikash
    Gibson, Edward
    Tenenbaum, Joshua B.
    PROCEEDINGS OF THE 31ST ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT VOLS 1 AND 2, 2007, : 218 - +
  • [5] Learning to Tag Text from Rules and Examples
    Diligenti, Michelangelo
    Gori, Marco
    Maggini, Marco
    AI(STAR)IA 2011: ARTIFICIAL INTELLIGENCE AROUND MAN AND BEYOND, 2011, 6934 : 45 - 56
  • [6] Using Language Rules to Improve the Performance of Word Segmentation
    Sun, Rong
    Zhou, Wen
    Liu, Zongtian
    2013 6TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), VOLS 1-3, 2013, : 1665 - 1669
  • [7] Neural Word Segmentation Learning for Chinese
    Cai, Deng
    Zhao, Hai
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 409 - 420
  • [8] Word Segmentation Method Based on Inductive Learning and Segmentation Rule
    Wang, Zhongjian
    Araki, Kenji
    Tochinai, Koji
    PROCEEDINGS OF THE 2008 INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN, VOL 1, 2008, : 95 - +
  • [9] Statistical Speech Segmentation and Word Learning in Parallel
    Yurovsky, Daniel
    Yu, Chen
    Smith, Linda B.
    PROCEEDINGS OF THE 34TH ANNUAL BOSTON UNIVERSITY CONFERENCE ON LANGUAGE DEVELOPMENT, VOLS 1 AND 2, 2010, : 491 - 502
  • [10] A Chinese Word Segmentation Based on Machine Learning
    Wang Hongsheng
    Cui Mingming
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 610 - 613