Unsupervised morphological parsing of Bengali

被引:15
|
作者
Dasgupta, Sajib [1 ]
Ng, Vincent [1 ]
机构
[1] Univ Texas, Human Language Technol Res Inst, Richardson, TX 75083 USA
关键词
morphological parsing; word segmentation; data annotation; unsupervised learning; Asian language processing; Bengali;
D O I
10.1007/s10579-007-9031-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unsupervised morphological analysis is the task of segmenting words into prefixes, suffixes and stems without prior knowledge of language-specific morphotactics and morpho-phonological rules. This paper introduces a simple, yet highly effective algorithm for unsupervised morphological learning for Bengali, an Indo-Aryan language that is highly inflectional in nature. When evaluated on a set of 4,110 human-segmented Bengali words, our algorithm achieves an F-score of 83%, substantially outperforming Linguistica, one of the most widely-used unsupervised morphological parsers, by about 23%.
引用
收藏
页码:311 / 330
页数:20
相关论文
共 50 条
  • [21] Punctuation: Making a point in unsupervised dependency parsing
    Spitkovsky, Valentin I.
    Alshawi, Hiyan
    Jurafsky, Daniel
    CoNLL 2011 - Fifteenth Conference on Computational Natural Language Learning, Proceedings of the Conference, 2011, : 19 - 28
  • [22] Phrase-aware Unsupervised Constituency Parsing
    Gu, Xiaotao
    Shen, Yikang
    Shen, Jiaming
    Shang, Jingbo
    Han, Jiawei
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 6406 - 6415
  • [23] An Empirical Comparison of Unsupervised Constituency Parsing Methods
    Li, Jun
    Cao, Yifan
    Cai, Jiong
    Jiang, Yong
    Tu, Kewei
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 3278 - 3283
  • [24] An All-Subtrees Approach to Unsupervised Parsing
    Bod, Rens
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 865 - 872
  • [25] Self-Training for Unsupervised Parsing with PRPN
    Mohananey, Anhad
    Kann, Katharina
    Bowman, Samuel R.
    16TH INTERNATIONAL CONFERENCE ON PARSING TECHNOLOGIES AND IWPT 2020 SHARED TASK ON PARSING INTO ENHANCED UNIVERSAL DEPENDENCIES, 2020, : 105 - 110
  • [26] Spectral Unsupervised Parsing with Additive Thee Metrics
    Parikh, Ankur P.
    Cohen, Shay B.
    Xing, Eric P.
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1062 - 1072
  • [27] Pattern-based unsupervised parsing method
    Santamaria, Jesus
    Araujo, Lourdes
    NATURAL LANGUAGE ENGINEERING, 2016, 22 (03) : 397 - 422
  • [28] Dealing with Function Words in Unsupervised Dependency Parsing
    Marecek, David
    Zabokrtsky, Zdenek
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PT I, 2014, 8403 : 250 - 261
  • [29] Viterbi training improves unsupervised dependency parsing
    Computer Science Department, Stanford University and Google Inc, United States
    不详
    不详
    CoNLL 2010 - Fourteenth Conf. Comput. Nat. Lang. Learning, Proc. Conf., (9-17):
  • [30] Unsupervised Video Adaptation for Parsing Human Motion
    Shen, Haoquan
    Yu, Shoou-, I
    Yang, Yi
    Meng, Deyu
    Hauptmann, Alexander
    COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 347 - 360