CUDA-based Parallel Implementation of IBM Word Alignment Algorithm for Statistical Machine Translation

被引:0
|
作者
Jing, Si-Yuan [1 ]
Yan, Gao-Rong [2 ]
Chen, Xing-Yuan [1 ]
Jin, Peng [1 ]
Guo, Zhao-Yi [1 ]
机构
[1] Leshan Normal Univ, Sch Comp Sci, Leshan, Peoples R China
[2] Leshan Normal Univ, Sch Foreign Language, Leshan, Peoples R China
关键词
Word Alignment; GPU; Parallel Computation; Expectation-Maximization Algorithm; CUDA;
D O I
10.1109/PDCAT.2016.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Word alignment is a basic task in natural language processing and it usually serves as the starting point when building a modern statistical machine translation system. However, the state-of-art parallel algorithm for word alignment is still time-consuming. In this work, we explore a parallel implementation of word alignment algorithm on Graphics Processor Unit (GPU), which has been widely available in the field of high performance computing. We use the Compute Unified Device Architecture (CUDA) programming model to re-implement a state-of-the-art word alignment algorithm, called IBM Expectation-Maximization (EM) algorithm. A Tesla K40M card with 2880 cores is used for experiments and execution times obtained with the proposed algorithm are compared with a sequential algorithm and a multi-threads algorithm on an IBM X3850 server, which has two Intel Xeon E7 CPUs (2.0GHz * 10 cores). The best experimental results show a 16.8-fold speedup compared to the multi-threads algorithm and a 234.7-fold speedup compared to the sequential algorithm.
引用
收藏
页码:189 / 194
页数:6
相关论文
共 50 条
  • [1] A CUDA-Based Parallel Adaptive Dynamic Programming Algorithm
    Li, Lu
    Chen, Xin
    Wang, Wei
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 3510 - 3515
  • [2] Syntactic Pattern Based Word Alignment for Statistical Machine Translation
    Le, Quang-Hung
    Le, Anh-Cuong
    INTERNATIONAL JOURNAL OF KNOWLEDGE AND SYSTEMS SCIENCE, 2014, 5 (03) : 36 - 45
  • [3] Improving Word Alignment for Statistical Machine Translation based on Constraints
    Le Quang Hung
    Le Anh Cuong
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 113 - 116
  • [4] A CUDA-based parallel implementation of a test vectors encoding algorithm in compression-based scan designs
    AlQuraishi, Eman
    AlDwaisan, Eman
    AlSaqaa, Alaa
    Ahmad, Imtiaz
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2016, 31 (03) : 280 - 293
  • [5] CUDA-based parallel K-means clustering algorithm
    Huo, Yingqiu
    Qin, Renbo
    Xing, Caiyan
    Chen, Xi
    Fang, Yong
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2014, 45 (11): : 47 - 53
  • [6] CUDA-based algorithm for high-speed parallel Gaussian filtering
    Lu, Wenlong
    Wang, Jianjun
    Liu, Xiaojun
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2011, 39 (05): : 10 - 13
  • [7] CUDA-Based Parallel Computation Framework for Phase Root Seeking Algorithm
    Peng, Bo
    Lai, Junliang
    Wang, Yang
    Wang, Ling
    Liu, Dong C.
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2014, 4 (06) : 922 - 931
  • [8] Measuring word alignment quality for statistical machine translation
    Fraser, Alexander
    Marcu, Daniel
    COMPUTATIONAL LINGUISTICS, 2007, 33 (03) : 293 - 303
  • [9] HMM word and phrase alignment for statistical machine translation
    Deng, Yonggang
    Byrne, William
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 494 - 507
  • [10] A CUDA-based Implementation of Convolutional Neural Network
    Choi, Sejin
    Lee, Kwangyeob
    2017 4TH INTERNATIONAL CONFERENCE ON COMPUTER APPLICATIONS AND INFORMATION PROCESSING TECHNOLOGY (CAIPT), 2017, : 325 - 328