An efficient implementation of Bailey and Borwein's algorithm for parallel random number generation on graphics processing units

被引:2
|
作者
Beliakov, Gleb [1 ]
Johnstone, Michael [2 ]
Creighton, Doug [2 ]
Wilkin, Tim [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Burwood 3125, Australia
[2] Deakin Univ, Ctr Intelligent Syst Res, Geelong, Vic 3217, Australia
关键词
GPU; Random number generation; Normal numbers;
D O I
10.1007/s00607-012-0234-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Pseudorandom number generators are required for many computational tasks, such as stochastic modelling and simulation. This paper investigates the serial and parallel implementation of a Linear Congruential Generator for Graphics Processing Units (GPU) based on the binary representation of the normal number . We adapted two methods of modular reduction which allowed us to perform most operations in 64-bit integer arithmetic, improving on the original implementation based on 106-bit double-double operations, which resulted in four-fold increase in efficiency. We found that our implementation is faster than existing methods in literature, and our generation rate is close to the limiting rate imposed by the efficiency of writing to a GPU's global memory.
引用
收藏
页码:309 / 326
页数:18
相关论文
共 50 条
  • [1] An efficient implementation of Bailey and Borwein’s algorithm for parallel random number generation on graphics processing units
    Gleb Beliakov
    Michael Johnstone
    Doug Creighton
    Tim Wilkin
    Computing, 2013, 95 : 309 - 326
  • [2] Efficient implementation of Bailey and Borwein pseudo-random number generator based on normal numbers
    Beliakov, G.
    Creighton, D.
    Johnstone, M.
    Wilkin, T.
    COMPUTER PHYSICS COMMUNICATIONS, 2013, 184 (08) : 1999 - 2004
  • [3] Random Number Generation with Entropy Sources in the Graphics Processing Units
    Yeom, Yongjin
    MATERIALS, MECHANICAL ENGINEERING AND MANUFACTURE, PTS 1-3, 2013, 268-270 : 1863 - 1868
  • [4] High-Performance Pseudo-Random Number Generation on Graphics Processing Units
    Nandapalan, Nimalan
    Brent, Richard P.
    Murray, Lawrence M.
    Rendell, Alistair P.
    PARALLEL PROCESSING AND APPLIED MATHEMATICS, PT I, 2012, 7203 : 609 - 618
  • [5] GPUDePiCt: A Parallel Implementation of a Clustering Algorithm for Computing Degenerate Primers on Graphics Processing Units
    Cickovski, Trevor
    Flor, Tiffany
    Irving-Sachs, Galen
    Novikov, Philip
    Parda, James
    Narasimhan, Giri
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (02) : 445 - 454
  • [6] Efficient parallel implementation of a density peaks clustering algorithm on graphics processing unit
    Ge, Ke-shi
    Su, Hua-you
    Li, Dong-sheng
    Lu, Xi-cheng
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (07) : 915 - 927
  • [7] Efficient parallel implementation of a density peaks clustering algorithm on graphics processing unit
    Ke-shi Ge
    Hua-you Su
    Dong-sheng Li
    Xi-cheng Lu
    Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 915 - 927
  • [8] Parallel Implementation of the Discrete Wavelet Transform on Graphics Processing Units
    Khemiri, Randa
    Sayadi, Fatma
    Saidani, Taoufik
    Chouchene, Marwa
    Bahri, Haythem
    Tourki, Rached
    2014 1ST INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP 2014), 2014, : 111 - 114
  • [9] PARALLEL IMPLEMENTATION OF THE N-FINDR ENDMEMBER EXTRACTION ALGORITHM ON COMMODITY GRAPHICS PROCESSING UNITS
    Sanchez, Sergio
    Martin, Gabriel
    Plaza, Antonio
    2010 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2010, : 955 - 958
  • [10] Optimized Parallel Implementation of Gillespie's First Reaction Method on Graphics Processing Units
    Dittamo, Cristian
    Cangelosi, Davide
    2009 INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION, PROCEEDINGS, 2009, : 156 - 161