Parser combinators for Tigrinya and Oromo morphology

被引:0
|
作者
Littell, Patrick [1 ]
McCoy, Tom [2 ]
Han, Na-Rae [3 ]
Rijhwani, Shruti [4 ]
Sheikh, Zaid [4 ]
Mortensen, David [4 ]
Mitamura, Teruko [4 ]
Levin, Lori [4 ]
机构
[1] Natl Res Council Canada, Digital Technol, 1200 Montreal Rd, Ottawa, ON K1A 0R6, Canada
[2] Johns Hopkins Univ, Dept Cognit Sci, 3400 N Charles St, Baltimore, MD 21218 USA
[3] Univ Pittsburgh, Dept Linguist, 2816 Cathedral Learning, Pittsburgh, PA 15260 USA
[4] Carnegie Mellon Univ, Language Technol Inst, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
关键词
morphology; parsing; Tigrinya; Oromo;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present rule-based morphological parsers in the Tigrinya and Oromo languages, based on a parser-combinator rather than finite-state paradigm. This paradigm allows rapid development and ease of integration with other systems, although at the cost of non-optimal theoretical efficiency. These parsers produce multiple output representations simultaneously, including lemmatization, morphological segmentation, and an English word-for-word gloss, and we evaluate these representations as input for entity detection and linking and humanitarian need detection.
引用
收藏
页码:3867 / 3874
页数:8
相关论文
共 22 条
  • [21] Explicitly Recursive Grammar Combinators A Better Model for Shallow Parser DSLs
    Devriese, Dominique
    Piessens, Frank
    PRACTICAL ASPECTS OF DECLARATIVE LANGUAGES, 2011, 6539 : 84 - 98
  • [22] Nom, a byte oriented, streaming, zero copy, parser combinators library in Rust
    Couprie, Geoffoy
    2015 IEEE SECURITY AND PRIVACY WORKSHOPS (SPW), 2015, : 142 - 148