Unsupervised morphological parsing of Bengali

被引:15
|
作者
Dasgupta, Sajib [1 ]
Ng, Vincent [1 ]
机构
[1] Univ Texas, Human Language Technol Res Inst, Richardson, TX 75083 USA
关键词
morphological parsing; word segmentation; data annotation; unsupervised learning; Asian language processing; Bengali;
D O I
10.1007/s10579-007-9031-y
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Unsupervised morphological analysis is the task of segmenting words into prefixes, suffixes and stems without prior knowledge of language-specific morphotactics and morpho-phonological rules. This paper introduces a simple, yet highly effective algorithm for unsupervised morphological learning for Bengali, an Indo-Aryan language that is highly inflectional in nature. When evaluated on a set of 4,110 human-segmented Bengali words, our algorithm achieves an F-score of 83%, substantially outperforming Linguistica, one of the most widely-used unsupervised morphological parsers, by about 23%.
引用
收藏
页码:311 / 330
页数:20
相关论文
共 50 条
  • [31] From Paraphrasing to Semantic Parsing: Unsupervised Semantic Parsing via Synchronous Semantic Decoding
    Wu, Shan
    Chen, Bo
    Xin, Chunlei
    Han, Xianpei
    Sun, Le
    Zhang, Weipeng
    Chen, Jiansong
    Yang, Fan
    Cai, Xunliang
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 5110 - 5121
  • [32] Unsupervised Morphological Relatedness
    Khorsi, Ahmed
    Alsheddi, Abeer
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (10) : 348 - 355
  • [33] Unsupervised Discourse Constituency Parsing Using Viterbi EM
    Nishida, Noriki
    Nakayama, Hideki
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 (08) : 215 - 230
  • [34] Unsupervised method for parsing coordinated base noun phrases
    Rus, Vasile
    Ravi, Sireesha
    Lintean, Mihai C.
    McCarthy, Philip M.
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 229 - +
  • [35] Split or Merge: Which is Better for Unsupervised RST Parsing?
    Kobayashi, Naoki
    Hirao, Tsutomu
    Nakamura, Kengo
    Kamigaito, Hidetaka
    Okumura, Manabu
    Nagata, Masaaki
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5797 - 5802
  • [36] Unsupervised Hierarchical Dynamic Parsing and Encoding for Action Recognition
    Su, Bing
    Zhou, Jiahuan
    Ding, Xiaoqing
    Wu, Ying
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (12) : 5784 - 5799
  • [37] Unsupervised Person Image Generation with Semantic Parsing Transformation
    Song, Sijie
    Zhang, Wei
    Liu, Jiaying
    Mei, Tao
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2352 - 2361
  • [38] Iterative Integration of Unsupervised Features for Chinese Dependency Parsing
    Luo, Te
    Zhang, Yujie
    Xu, Jinan
    Chen, Yufeng
    NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 531 - 540
  • [39] Nighttime Road Scene Parsing by Unsupervised Domain Adaptation
    Song, Can
    Wu, Jin
    Zhu, Lei
    Zhang, Mei
    Ling, Haibin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (04) : 3244 - 3255
  • [40] Is shallow parsing useful for unsupervised learning of semantic clusters?
    Reinberger, ML
    Daelemans, W
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, 2003, 2588 : 304 - 313