Identification of transliterated foreign words in Hebrew script

被引:0
|
作者
Goldberg, Yoav [1 ]
Elhadad, Michael [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a loosely-supervised method for context-free identification of transliterated foreign names and borrowed words in Hebrew text. The method is purely statistical and does not require the use of any lexicons or linguistic analysis tool for the source languages (Hebrew, in our case). It also does not require any manually annotated data for training we learn from noisy data acquired by over-generation. We report precision/recall results of 80/82 for a corpus of 4044 unique words, containing 368 foreign words.
引用
收藏
页码:466 / 477
页数:12
相关论文
共 50 条
  • [21] A Hybrid Approach to Design Automatic Spelling Corrector and Converter for Transliterated Bangla Words
    Debnath, Tanmoy
    Sajnin, Sumaiya
    Hamid, Md Montaser
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [22] Bag-of-Visual Words for Word-Wise Video Script Identification: A Study
    Sharma, Nabin
    Mandal, Ranju
    Sharma, Rabi
    Pal, Umapada
    Blumenstein, Michael
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [23] Foreign Words, Foreigners' Words
    Lecercle, Jean-Jacques
    REVUE LISA-LISA E-JOURNAL, 2015, 13 (01):
  • [24] GLOSSES IN GREEK SCRIPT AND LANGUAGE IN MEDIEVAL HEBREW MANUSCRIPTS
    de Lange, Nicholas
    Tchernetska, Natalie
    SCRIPTORIUM, 2014, 68 (02): : 253 - 264
  • [25] EGYPTIAN PAPYRI IN HEBREW SCRIPT - FRENCH - SIRAT,C
    HOPKINS, S
    JOURNAL OF SEMITIC STUDIES, 1990, 35 (01) : 153 - 156
  • [26] Hebrew-script tombstones from Jam, Afghanistan
    Hunter, Erica C. D.
    JOURNAL OF JEWISH STUDIES, 2010, 61 (01): : 72 - 87
  • [27] Reverse-Transliteration of Hebrew script for Entity Disambiguation
    Christianson, Aaron
    Dadvar, Maral
    Eckert, Kai
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 335 - 338
  • [28] 'Foreign Words'
    Beil, UJ
    POETRY, 1998, 173 (01) : 88 - 89
  • [29] Bangla Interrogative Sentence Identification from Transliterated Bangla Sentences
    Hamid, Md Montaser
    Alam, Tanvir
    Ismail, Sabir
    Rabbi, Md Forhad
    2018 INTERNATIONAL CONFERENCE ON BANGLA SPEECH AND LANGUAGE PROCESSING (ICBSLP), 2018,
  • [30] Proper Noun Recognition in Cross-Language Record Linkage by Exploiting Transliterated Words
    Song, Yuting
    Kimura, Taisuke
    Batjargal, Biligsaikhan
    Maeda, Akira
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 83 - 86