A hybrid model for extracting transliteration equivalents from parallel corpora

被引:0
|
作者
Oh, Jong-Hoon
Choi, Key-Sun
Isahara, Hitoshi
机构
[1] NICT, Computat Linguist Grp, Kyoto 6190289, Japan
[2] Korea Adv Inst Sci & Technol, EECS, Div Comp Sci, Taejon 305701, South Korea
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several models for transliteration pair acquisition have been proposed to overcome the out-of-vocabulary problem caused by transliterations. To date, however, there has been little literature regarding a framework that can accommodate several models at the same time. Moreover, there is little concern for validating acquired transliteration pairs using up-to-date corpora, such as web documents. To address these problems, we propose a hybrid model for transliteration pair acquisition. In this paper, we concentrate on a framework for combining several models for transliteration pair acquisition. Experiments showed that our hybrid model was more effective than each individual transliteration pair acquisition model alone.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 50 条
  • [1] Extraction of transliteration pairs from parallel corpora using a statistical transliteration model
    Lee, CJ
    Chang, JS
    Jang, JSR
    INFORMATION SCIENCES, 2006, 176 (01) : 67 - 90
  • [2] Extracting translation equivalents from bilingual comparable corpora
    Kaji, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (02): : 313 - 323
  • [3] Extracting English-Korean transliteration pairs from web corpora
    Oh, Jong-Hoon
    Isahara, Hitoshi
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 222 - +
  • [4] Extracting Parallel Phrases from Comparable Corpora
    Zhang, Jiexin
    Cao, Hailong
    Zhao, Tiejun
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2014), 2014, : 166 - 169
  • [5] Extraction of name and transliteration in monolingual and parallel corpora
    Lin, T
    Wu, JC
    Chang, JS
    MACHINE TRANSLATION: FROM REAL USERS TO RESEARCH, PROCEEDINGS, 2004, 3265 : 177 - 186
  • [6] Extracting Multilingual Lexicons from Parallel Corpora
    Dan Tufiş
    Ana Maria Barbu
    Radu Ion
    Computers and the Humanities, 2004, 38 : 163 - 189
  • [7] Extracting multilingual lexicons from parallel corpora
    Tufis, D
    Barbu, AM
    Ion, R
    COMPUTERS AND THE HUMANITIES, 2004, 38 (02): : 163 - 189
  • [8] Extracting paraphrase patterns from bilingual parallel corpora
    Zhao, Shiqi
    Wang, Haifeng
    Liu, Ting
    Li, Sheng
    NATURAL LANGUAGE ENGINEERING, 2009, 15 : 503 - 526
  • [9] An Efficient Framework for Extracting Parallel Sentences from Non-Parallel Corpora
    Cuong Hoang
    Anh-Cuong Le
    Phuong-Thai Nguyen
    Son Bao Pham
    Tu Bao Ho
    FUNDAMENTA INFORMATICAE, 2014, 130 (02) : 179 - 199
  • [10] Extracting transfer rules for multiword expressions from parallel corpora
    Division of Linguistics and Multilingual Studies, Nanyang Technological University, Singapore, Singapore
    Workshop Multiword Expressions: Parsing Gener. Real World, MWE Annu. Meet. Assoc. Comput. Linguist.: Hum. Lang. Technol., ACL-HLT - Proc., (92-100):