Identification of transliterated foreign words in Hebrew script

被引:0
|
作者
Goldberg, Yoav [1 ]
Elhadad, Michael [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a loosely-supervised method for context-free identification of transliterated foreign names and borrowed words in Hebrew text. The method is purely statistical and does not require the use of any lexicons or linguistic analysis tool for the source languages (Hebrew, in our case). It also does not require any manually annotated data for training we learn from noisy data acquired by over-generation. We report precision/recall results of 80/82 for a corpus of 4044 unique words, containing 368 foreign words.
引用
收藏
页码:466 / 477
页数:12
相关论文
共 50 条
  • [41] Script Identification of Multi-Script Documents: A Survey
    Ubul, Kurban
    Tursun, Gulzira
    Aysa, Alimjan
    Impedovo, Donato
    Pirlo, Giuseppe
    Yibulayin, Tuergen
    IEEE ACCESS, 2017, 5 : 6546 - 6559
  • [42] OWN AND FOREIGN WORDS
    Guimaraes, Eduardo
    LINGUAS E INSTRUMENTOS LINGUISTICOS, 2008, (21): : 9 - 18
  • [43] Nasty foreign words
    Gammie, I
    NEW SCIENTIST, 1998, 160 (2155) : 54 - 54
  • [44] In Your Words: Translations from the Yiddish and the Hebrew
    Johnstone, David
    CANADIAN LITERATURE, 2017, (235): : 147 - 148
  • [45] LATERALIZATION EFFECTS IN THE PERCEPTION OF HEBREW AND ENGLISH WORDS
    SHANON, B
    BRAIN AND LANGUAGE, 1982, 17 (01) : 107 - 123
  • [46] Semantic parameters of vision words in Hebrew and English
    Myhill, John
    LANGUAGES IN CONTRAST, 2006, 6 (02) : 229 - 260
  • [47] A DESCRIPTION OF THE SEMANTIC FIELD OF HEBREW WORDS FOR HIDE
    BALENTINE, SE
    VETUS TESTAMENTUM, 1980, 30 (02) : 137 - 153
  • [48] In Your Words: Translations from the Yiddish and the Hebrew
    Martin Pallero, Facundo
    HERMENEUS, 2019, (21): : 565 - 569
  • [49] Identification of Foreign-Accented Words in Preschoolers With and Without Speech Sound Disorders
    Brosseau-Lapre, Francoise
    Kim, Wan Hee
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2020, 63 (05): : 1340 - 1351
  • [50] Stress Assignment in Words with -i Suffix in Hebrew
    Schwarzwald, Ora
    SKASE JOURNAL OF THEORETICAL LINGUISTICS, 2018, 15 (03): : 110 - 127