An English-Hindi statistical machine translation system

被引:0
|
作者
Udupa, R [1 ]
Faruquie, TA [1 ]
机构
[1] IBM Corp, India Res Lab, New Delhi 110016, India
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently statistical methods for natural language translation have become popular and found reasonable success. In this paper we describe an English-Hindi statistical machine translation system. Our machine translation system is based on IBM Models 1, 2, and 3. We present experimental results on an English-Hindi parallel corpus consisting of 150,000 sentence pairs. We propose two new algorithms for the transfer of fertility parameters from Model 2 to Model 3. Our algorithms have a worst case time complexity of O(m(3)) improving on the exponential time algorithm proposed in the classical paper on IBM Models. When the maximum fertility of a word is small, our algorithms are O(m(2)) and hence very efficient in practice.
引用
收藏
页码:254 / 262
页数:9
相关论文
共 50 条
  • [31] Data Issues in English-to-Hindi Machine Translation
    Bojar, Ondrej
    Stranak, Pavel
    Zeman, Daniel
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1771 - 1777
  • [32] English-Hindi Transliteration using Multiple Similarity Metrics
    Aswani, Niraj
    Gaizauskas, Robert
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 1786 - 1793
  • [33] Dealing with mixing of English verbs in Hindi for machine translation
    Sinha, RMK
    ICAI '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 2005, : 773 - 778
  • [34] HindEnCorp - Hindi-English and Hindi-only Corpus for Machine Translation
    Bojar, Ondrej
    Diatka, Vojtech
    Rychly, Pavel
    Stranak, Pavel
    Suchomel, Vit
    Tamchyna, Ales
    Zeman, Daniel
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3550 - 3555
  • [35] Hybrid Appraoch to English-Hindi Name Entity Transliteration
    Mathur, Shruti
    Saxena, Varun Prakash
    2014 IEEE STUDENTS' CONFERENCE ON ELECTRICAL, ELECTRONICS AND COMPUTER SCIENCE (SCEECS), 2014,
  • [36] A Model for English to Urdu and Hindi Machine Translation System using Translation Rules and Artificial Neural Network
    Khan, Shahnawaz
    Usman, Imran
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (01) : 125 - 131
  • [37] An Improvement in Statistical Machine Translation in Perspective of Hindi-English Cross-Lingual Information Retrieval
    Sharma, Vijay Kumar
    Mittal, Namita
    COMPUTACION Y SISTEMAS, 2018, 22 (04): : 1277 - 1285
  • [38] Hindi Visual Genome: A Dataset for Multi-Modal English to Hindi Machine Translation
    Parida, Shantipriya
    Bojar, Ondrej
    Dash, Satya Ranjan
    COMPUTACION Y SISTEMAS, 2019, 23 (04): : 1499 - 1505
  • [39] Interpreting unknown words in machine translation from Hindi to English
    Sinha, RMK
    Proceedings of the IASTED International Conference on Computational Intelligence, 2005, : 278 - 282
  • [40] Factored Statistical Machine Translation System for English to Tamil Language
    Anand, Kumar M.
    Dhanalakshmi
    Soman, K. P.
    Rajendran, S.
    PERTANIKA JOURNAL OF SOCIAL SCIENCE AND HUMANITIES, 2014, 22 (04): : 1045 - 1061