Construct chunk-level templates for improving rule-based machine translation

被引:0
|
作者
Yin, Dechun [1 ]
Zhang, Dakui [1 ]
机构
[1] School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
来源
关键词
Chinese corpus - Machine translation systems - Machine translations - Morphological analysis - Rule-based machine translations - Semiautomatic methods - Template-based - Translation templates;
D O I
10.12733/jcis6299
中图分类号
学科分类号
摘要
For improving the conventional rule-based machine translation by using translation templates, we propose a data-driven and semiautomatic method, which can semiautomatically extract translation templates at chunk level from the unannotated Chinese corpus. The method includes four steps: morphological analysis, chunk analysis, extract and refine. After extracting and preforming the preliminary templates, we manually add English translations for them and then get the ultimate templates, which are used in a template-based machine translation system. The experimental results show that the method is effective to improve the quality of machine translation, and that the template-based machine translation system outperforms the conventional rule-based machine translation system without templates. 1553-9105/Copyright © 2013 Binary Information Press.
引用
收藏
页码:5505 / 5512
相关论文
共 50 条
  • [41] A Granular Way to Construct a Rule-Based Fuzzy Hierarchical Model
    Niu, Pian
    Song, Ming-Li
    Liang, Chao
    FUZZY SYSTEM AND DATA MINING, 2016, 281 : 113 - 121
  • [42] PRE-PROCESSING TASKS FOR RULE-BASED ENGLISH-KOREAN MACHINE TRANSLATION SYSTEM
    Kim, Sung-Dong
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1, 2011, : 257 - 262
  • [43] Shallow-transfer rule-based machine translation for the Western group of South Slavic languages
    Peradin, Hrvoje
    Petkovsky, Filip
    Tyers, Francis M.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [44] Learning from Chunk-based Feedback in Neural Machine Translation
    Petrushkov, Pavel
    Khadivi, Shahram
    Matusov, Evgeny
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 326 - 331
  • [45] Improving the compromise between accuracy, interpretability and personalization of rule-based machine learning in medical problems
    Valente, Francisco
    Henriques, Jorge
    Paredes, Simao
    Rocha, Teresa
    de Carvalho, Paulo
    Morais, Joao
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 2132 - 2135
  • [46] A hybrid approach to interactive machine translation - Integrating rule-based, corpus-based, and example-based method
    Yamabana, K
    Kamei, S
    Muraki, K
    Doi, S
    Tamura, S
    Satoh, K
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 977 - 982
  • [47] Unsupervised Weighting of Transfer Rules in Rule-Based Machine Translation using Maximum-Entropy Approach
    Bayatli, Sevilay
    Kurnaz, Sefer
    Ali, Aboelhamd
    Washington, Jonathan North
    Tyers, Francis M.
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (02) : 309 - 322
  • [48] Improving the Rule based Machine Translation System using Sentence Simplification (English to Tamil)
    Kavirajan, B.
    Kumar, Anand M.
    Soman, K. P.
    Rajendran, S.
    Vaithehi, S.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 957 - 963
  • [49] IMPROVING LEARNING OF GENETIC RULE-BASED CLASSIFIER SYSTEMS
    MCAULAY, AD
    OH, JC
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (01): : 152 - 159
  • [50] Automatic and human evaluation study of a rule-based and a statistical Catalan-Spanish machine translation systems
    Costa-jussa, Marta R.
    Farrus, Mireia
    Marino, Jose B.
    Fonollosa, Jose A. R.
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1706 - 1711