A hybrid model for extracting transliteration equivalents from parallel corpora

被引:0
|
作者
Oh, Jong-Hoon
Choi, Key-Sun
Isahara, Hitoshi
机构
[1] NICT, Computat Linguist Grp, Kyoto 6190289, Japan
[2] Korea Adv Inst Sci & Technol, EECS, Div Comp Sci, Taejon 305701, South Korea
来源
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2006年 / 4188卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several models for transliteration pair acquisition have been proposed to overcome the out-of-vocabulary problem caused by transliterations. To date, however, there has been little literature regarding a framework that can accommodate several models at the same time. Moreover, there is little concern for validating acquired transliteration pairs using up-to-date corpora, such as web documents. To address these problems, we propose a hybrid model for transliteration pair acquisition. In this paper, we concentrate on a framework for combining several models for transliteration pair acquisition. Experiments showed that our hybrid model was more effective than each individual transliteration pair acquisition model alone.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 50 条
  • [41] Extracting parallel fragments from comparable documents using a generative model
    Bakhshaei, Somayeh
    Safabakhsh, Reza
    Khadivi, Shahram
    COMPUTER SPEECH AND LANGUAGE, 2019, 53 : 25 - 42
  • [42] A Process for Extracting Knowledge Base for Chatbots from Text Corpora
    Krassmann, Aliane Loureiro
    Flach, Joao Marcos
    Cestari da Silva Grando, Anita Raquel
    Rockenbach Tarouco, Liane Margarida
    Bercht, Magda
    PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, : 322 - 329
  • [43] Extracting Semantic Frames from Specialized Corpora for Lexicographic Purposes
    Sanchez-Cardenas, Beatriz
    CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2024, (99): : 163 - 177
  • [44] Extracting Privileged Information from Untagged Corpora for Classifier Learning
    Yao, Yazhou
    Zhang, Jian
    Shen, Fumin
    Yang, Wankou
    Hua, Xian-Sheng
    Tang, Zhenmin
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1085 - 1091
  • [45] Automatic creation of WordNets from parallel corpora
    Oliver, Antoni
    Climent, Salvador
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1112 - 1116
  • [46] Novelty Extraction from Special and Parallel Corpora
    Dura, Elzbieta
    Gawronska, Barbara
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 : 291 - 302
  • [47] Building English - Punjabi Aligned Parallel Corpora of Nouns from Comparable Corpora
    Kaur, Dilshad
    Singh, Satwinder
    APPLIED COMPUTER SYSTEMS, 2023, 28 (02) : 245 - 251
  • [48] Acquisition of translation rules from parallel corpora
    Matsumoto, Y
    Kitamura, M
    RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING, 1997, 136 : 405 - 416
  • [49] Extracting paraphrases from a parallel corpus
    Barzilay, R
    McKeown, KR
    39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 50 - 57
  • [50] A Parallel Model for Jointly Extracting Entities and Relations
    Chen, Zuqin
    Zheng, Yujie
    Ge, Jike
    Yu, Wencheng
    Wang, Zining
    NEURAL PROCESSING LETTERS, 2024, 56 (04)