Explicit Fourth-Order Runge–Kutta Method on Intel Xeon Phi Coprocessor

被引:0
|
作者
Beata Bylina
Joanna Potiopa
机构
[1] Maria Curie-Skłodowska University,Department of Computer Science
关键词
Intel Xeon Phi; Fourth-order Runge–Kutta method; CSR format; Intel Math Kernel Library (Intel MKL); SpMV; OpenMP;
D O I
暂无
中图分类号
学科分类号
摘要
This paper concerns an Intel Xeon Phi implementation of the explicit fourth-order Runge–Kutta method (RK4) for very sparse matrices with very short rows. Such matrices arise during Markovian modeling of computer and telecommunication networks. In this work an implementation based on Intel Math Kernel Library (Intel MKL) routines and the authors’ own implementation, both using the CSR storage scheme and working on Intel Xeon Phi, were investigated. The implementation based on the Intel MKL library uses the high-performance BLAS and Sparse BLAS routines. In our application we focus on OpenMP style programming. We implement SpMV operation and vector addition using the basic optimizing techniques and the vectorization. We evaluate our approach in native and offload modes for various number of cores and thread allocation affinities. Both implementations (based on Intel MKL and made by the authors) were compared in respect of the time, the speedup and the performance. The numerical experiments on Intel Xeon Phi show that the performance of authors’ implementation is very promising and gives a gain of up to two times compared to the multithreaded implementation (based on Intel MKL) running on CPU (Intel Xeon processor) and even three times in comparison with the application which uses Intel MKL on Intel Xeon Phi.
引用
收藏
页码:1073 / 1090
页数:17
相关论文
共 50 条
  • [21] High Performance Stencil Computations for Intel® Xeon Phi™ Coprocessor
    Feng, Luxia
    Dong, Yushan
    Li, Chunjiang
    Jiang, Hao
    ADVANCED COMPUTER ARCHITECTURE, ACA 2016, 2016, 626 : 108 - 117
  • [22] Optimization of Molecular Dynamics Application for Intel Xeon Phi Coprocessor
    Harode, Amit
    Gupta, Apaar
    Mathew, Benny
    Rai, Nitin
    2014 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND APPLICATIONS (ICHPCA), 2014,
  • [23] Fourth-Order Runge–Kutta Schemes for Fluid Mechanics Applications
    M. H. Carpenter
    C. A. Kennedy
    Hester Bijl
    S. A. Viken
    Veer N. Vatsa
    Journal of Scientific Computing, 2005, 25 : 157 - 194
  • [24] ON OPTIMAL CHOICE OF FOURTH-ORDER RUNGE-KUTTA FORMULAS
    LUTHER, HA
    SIERRA, HG
    NUMERISCHE MATHEMATIK, 1970, 15 (04) : 354 - &
  • [25] Similarity (range and kNN) queries processing on an Intel Xeon Phi coprocessor
    Carlos M. Toledo
    Ricardo J. Barrientos
    Andrés I. Ávila
    Cluster Computing, 2016, 19 : 57 - 71
  • [26] Adaptation of MPDATA Heterogeneous Stencil Computation to Intel Xeon Phi Coprocessor
    Szustak, Lukasz
    Rojek, Krzysztof
    Olas, Tomasz
    Kuczynski, Lukasz
    Halbiniak, Kamil
    Gepner, Pawel
    SCIENTIFIC PROGRAMMING, 2015, 2015
  • [27] Similarity (range and kNN) queries processing on an Intel Xeon Phi coprocessor
    Toledo, Carlos M.
    Barrientos, Ricardo J.
    Avila, Andres I.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (01): : 57 - 71
  • [28] Energy Efficiency Evaluation of Workload Execution on Intel Xeon Phi Coprocessor
    Zhao, Qi
    Yang, Hailong
    Wei, Guang
    Luan, Zhongzhi
    Qian, Depei
    TRUSTWORTHY COMPUTING AND SERVICES, 2014, 426 : 268 - 275
  • [29] Evaluating the transport layer of the ALFA framework for the Intel® Xeon Phi™ Coprocessor
    Santogidis, Aram
    Hirstius, Andreas
    Lalis, Spyros
    21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
  • [30] Using Intel Xeon Phi Coprocessor to Accelerate Computations in MPDATA Algorithm
    Szustak, Lukasz
    Rojek, Krzysztof
    Gepner, Pawel
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT I, 2014, 8384 : 582 - 592