CUDA-based Parallel Implementation of IBM Word Alignment Algorithm for Statistical Machine Translation

被引:0
|
作者
Jing, Si-Yuan [1 ]
Yan, Gao-Rong [2 ]
Chen, Xing-Yuan [1 ]
Jin, Peng [1 ]
Guo, Zhao-Yi [1 ]
机构
[1] Leshan Normal Univ, Sch Comp Sci, Leshan, Peoples R China
[2] Leshan Normal Univ, Sch Foreign Language, Leshan, Peoples R China
关键词
Word Alignment; GPU; Parallel Computation; Expectation-Maximization Algorithm; CUDA;
D O I
10.1109/PDCAT.2016.49
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Word alignment is a basic task in natural language processing and it usually serves as the starting point when building a modern statistical machine translation system. However, the state-of-art parallel algorithm for word alignment is still time-consuming. In this work, we explore a parallel implementation of word alignment algorithm on Graphics Processor Unit (GPU), which has been widely available in the field of high performance computing. We use the Compute Unified Device Architecture (CUDA) programming model to re-implement a state-of-the-art word alignment algorithm, called IBM Expectation-Maximization (EM) algorithm. A Tesla K40M card with 2880 cores is used for experiments and execution times obtained with the proposed algorithm are compared with a sequential algorithm and a multi-threads algorithm on an IBM X3850 server, which has two Intel Xeon E7 CPUs (2.0GHz * 10 cores). The best experimental results show a 16.8-fold speedup compared to the multi-threads algorithm and a 234.7-fold speedup compared to the sequential algorithm.
引用
收藏
页码:189 / 194
页数:6
相关论文
共 50 条
  • [41] CUDA-based Acceleration and Algorithm Refinement for Volume Image Registration
    Chen, Shifu
    Qin, Jing
    Xie, Yongming
    Pang, Wai-Man
    Heng, Pheng-Ann
    2009 INTERNATIONAL CONFERENCE ON FUTURE BIOMEDICAL INFORMATION ENGINEERING (FBIE 2009), 2009, : 544 - +
  • [42] Study on CUDA-based Heterogeneous Parallel for Advanced Assembly Neutronics Program
    Zheng Y.
    Lu W.
    Ma Y.
    Cui X.
    Guo F.
    Ma D.
    Tu X.
    Hedongli Gongcheng/Nuclear Power Engineering, 2021, 42 : 124 - 129
  • [43] A Parallel Many-core CUDA-based Graph Labeling Computation
    Quer, Stefano
    ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 597 - 605
  • [44] GPU-NB: A Fast CUDA-based Implementation of Naive Bayes
    Viegas, Felipe
    Andrade, Guilherme
    Almeida, Jussara
    Ferreira, Renato
    Goncalves, Marcos
    Ramos, Gabriel
    Rocha, Leonardo
    2013 25TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING (SBAC-PAD), 2013, : 168 - 175
  • [45] The study of CUDA-based ELA algorithm for de-interlacing
    Gu, Yuwan
    Shi, Guodong
    Zhao, Dean
    Sun, Yuqiang
    International Review on Computers and Software, 2012, 7 (06) : 3042 - 3046
  • [46] CUK-Band: A CUDA-Based Multiple Genomic Sequence Alignment on GPU
    Kong, Xianghua
    Shen, Cong
    Tang, Jijun
    ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT II, ICIC 2024, 2024, 14882 : 84 - 95
  • [47] Phrase-based alignment models for statistical machine translation
    Tomás, J
    Lloret, J
    Casacuberta, F
    PATTERN RECOGNITION AND IMAGE ANALYSIS, PT 2, PROCEEDINGS, 2005, 3523 : 605 - 613
  • [48] Refining Kazakh Word Alignment Using Simulation Modeling Methods for Statistical Machine Translation
    Kartbayev, Amandyk
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2015, 2015, 9362 : 421 - 427
  • [49] On the Word Alignment from Neural Machine Translation
    Li, Xintong
    Li, Guanlin
    Liu, Lemao
    Meng, Max
    Shi, Shuming
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1293 - 1303
  • [50] Structural Supervision for Word Alignment and Machine Translation
    Li, Lei
    Fan, Kai
    Li, Hongjia
    Yuan, Chun
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 4084 - 4094