Scientific Computing Kernels on the Cell Processor

被引:0
|
作者
Samuel Williams
John Shalf
Leonid Oliker
Shoaib Kamil
Parry Husbands
Katherine Yelick
机构
[1] CRD/NERSC,Lawrence Berkeley National Laboratory
来源
International Journal of Parallel Programming | 2007年 / 35卷
关键词
Cell processor; GEMM; SpMV; sparse matrix; FFT; Stencil; three level memory;
D O I
暂无
中图分类号
学科分类号
摘要
In this work, we examine the potential of using the recently-released STI Cell processor as a building block for future high-end scientific computing systems. Our work contains several novel contributions. First, we introduce a performance model for Cell and apply it to several key numerical kernels: dense matrix multiply, sparse matrix vector multiply, stencil computations, and 1D/2D FFTs. Next, we validate our model by comparing results against published hardware data, as well as our own Cell blade implementations. Additionally, we compare Cell performance to benchmarks run on leading superscalar (AMD Opteron), VLIW (Intel Itanium2), and vector (Cray X1E) architectures. Our work also explores several different kernel implementations and demonstrates a simple and effective programming model for Cell’s unique architecture. Finally, we propose modest microarchitectural modifications that could significantly increase the efficiency of double-precision calculations. Overall results demonstrate the tremendous potential of the Cell architecture for scientific computations in terms of both raw performance and power efficiency.
引用
收藏
页码:263 / 298
页数:35
相关论文
共 50 条
  • [21] CRYSTALLOGRAPHIC COMPUTING ON AN ARRAY PROCESSOR
    FUREY, W
    WANG, BC
    SAX, M
    JOURNAL OF APPLIED CRYSTALLOGRAPHY, 1982, 15 (APR) : 160 - 166
  • [22] THE BURROUGHS SCIENTIFIC PROCESSOR (BSP)
    KUCK, DJ
    STOKES, RA
    IEEE TRANSACTIONS ON COMPUTERS, 1982, 31 (05) : 363 - 376
  • [23] SCIENTIFIC COMPUTING
    MCMILLAN, T
    POPULAR COMPUTING, 1985, 4 (06): : 67 - 67
  • [24] Scientific Computing
    Schneebeli, H. R.
    ELEMENTE DER MATHEMATIK, 2012, 67 (04) : 201 - 201
  • [25] SCIENTIFIC COMPUTING
    MOLAU, GE
    BYTE, 1985, 10 (05): : 347 - 348
  • [26] Processor Selection for OpenCL Kernels using KNN Algorithm
    Rahmawan, Hendra
    Kuspriyanto
    Mutijarsa, Kusprasapta
    2018 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY SYSTEMS AND INNOVATION (ICITSI), 2018, : 336 - 341
  • [27] On the single processor performance of simple lattice Boltzmann kernels
    Wellein, G.
    Zeiser, T.
    Hager, G.
    Donath, S.
    COMPUTERS & FLUIDS, 2006, 35 (8-9) : 910 - 919
  • [28] Computing kernels in directed bichromatic graphs
    vonKarger, B
    Berghammer, R
    INFORMATION PROCESSING LETTERS, 1997, 62 (01) : 5 - 11
  • [29] A New Method for Computing Reproducing Kernels
    凌捷
    李岳生
    Northeastern Mathematical Journal, 1998, (04) : 90 - 96
  • [30] Fast and small short vector SIMD matrix multiplication kernels for the synergistic processing element of the CELL processor
    Alvaro, Wesley
    Kurzak, Jakub
    Dongarra, Jack
    COMPUTATIONAL SCIENCE - ICCS 2008, PT 1, 2008, 5101 : 935 - 944