A hybrid model for extracting transliteration equivalents from parallel corpora

被引：0

作者：

Oh, Jong-Hoon

Choi, Key-Sun

Isahara, Hitoshi

机构：

[1] NICT, Computat Linguist Grp, Kyoto 6190289, Japan

[2] Korea Adv Inst Sci & Technol, EECS, Div Comp Sci, Taejon 305701, South Korea

来源：

TEXT, SPEECH AND DIALOGUE, PROCEEDINGS | 2006年 / 4188卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Several models for transliteration pair acquisition have been proposed to overcome the out-of-vocabulary problem caused by transliterations. To date, however, there has been little literature regarding a framework that can accommodate several models at the same time. Moreover, there is little concern for validating acquired transliteration pairs using up-to-date corpora, such as web documents. To address these problems, we propose a hybrid model for transliteration pair acquisition. In this paper, we concentrate on a framework for combining several models for transliteration pair acquisition. Experiments showed that our hybrid model was more effective than each individual transliteration pair acquisition model alone.

引用

页码：119 / 126

页数：8

共 50 条

[41] Extracting parallel fragments from comparable documents using a generative model
Bakhshaei, Somayeh
Safabakhsh, Reza
Khadivi, Shahram
COMPUTER SPEECH AND LANGUAGE, 2019, 53 : 25 - 42
[42] A Process for Extracting Knowledge Base for Chatbots from Text Corpora
Krassmann, Aliane Loureiro
Flach, Joao Marcos
Cestari da Silva Grando, Anita Raquel
Rockenbach Tarouco, Liane Margarida
Bercht, Magda
PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, : 322 - 329
[43] Extracting Semantic Frames from Specialized Corpora for Lexicographic Purposes
Sanchez-Cardenas, Beatriz
CIRCULO DE LINGUISTICA APLICADA A LA COMUNICACION, 2024, (99): : 163 - 177
[44] Extracting Privileged Information from Untagged Corpora for Classifier Learning
Yao, Yazhou
Zhang, Jian
Shen, Fumin
Yang, Wankou
Hua, Xian-Sheng
Tang, Zhenmin
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1085 - 1091
[45] Automatic creation of WordNets from parallel corpora
Oliver, Antoni
Climent, Salvador
LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1112 - 1116
[46] Novelty Extraction from Special and Parallel Corpora
Dura, Elzbieta
Gawronska, Barbara
HUMAN LANGUAGE TECHNOLOGY: CHALLENGES OF THE INFORMATION SOCIETY, 2009, 5603 : 291 - 302
[47] Building English - Punjabi Aligned Parallel Corpora of Nouns from Comparable Corpora
Kaur, Dilshad
Singh, Satwinder
APPLIED COMPUTER SYSTEMS, 2023, 28 (02) : 245 - 251
[48] Acquisition of translation rules from parallel corpora
Matsumoto, Y
Kitamura, M
RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING, 1997, 136 : 405 - 416
[49] Extracting paraphrases from a parallel corpus
Barzilay, R
McKeown, KR
39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, : 50 - 57
[50] A Parallel Model for Jointly Extracting Entities and Relations
Chen, Zuqin
Zheng, Yujie
Ge, Jike
Yu, Wencheng
Wang, Zining
NEURAL PROCESSING LETTERS, 2024, 56 (04)

← 1 2 3 4 5 →