Acceleration of PDE-based FTLE Calculations on Intel Multi-core and Many-core Architectures

被引:0
|
作者
Wang, Fang [1 ]
Deng, Liang [2 ]
Zhao, Dan [2 ]
Li, Sikun [2 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China
[2] China Aerodynam Res & Dev Ctr, Computat Aerodynam Inst, Mianyang, Peoples R China
关键词
finite-time Lyapunov exponent (FTLE); coherent structure; partial differential equation (PDE); Intel MIC; hardware performance metrics; LAGRANGIAN COHERENT STRUCTURES; TIME LYAPUNOV EXPONENTS; FLUID-FLOWS; VORTEX; IDENTIFICATION; DEFINITION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Finite-time Lyapunov exponent (FTLE) iswidely used to extract coherent structure of unsteady flow. However, the calculation of FTLE can be highly time-consuming, which greatly limits the application's performance efficiency. In this paper, we accelerate a double precision PDE-based FTLE application for two-and three-dimensionalanalytical flow field on Intel multicore and many-core architectures such as Intel Sandy Bridge and Intel Many Integrated Core (MIC) coprocessor. Through analysis of the calculation processes of FTLE and the characteristics of Intelmulti-core and many-core architectures, we employ three categories of optimization techniques, namely, thread parallelism for multi-/many-core scaling, data parallelism to exploit SIMD (single-instruction multiple-data) mechanism and improving onchip data reuse, to maximize the performance. Also, the hardware performance metrics through an open source performance analysis tool, in order to explain performance difference between Sandy Bridge and MIC, are discussed. The experiment results show that our MIC-enabled FTLE achieves about 1.8x speed-ups relative to a parallel computation on two Intel Sandy Bridge CPUs, and perfect parallel efficiency is also observed from the experiment results.
引用
收藏
页码:178 / 183
页数:6
相关论文
共 50 条
  • [31] Performance Optimization and Comparison of the Alternating Direction Implicit CFD Solver on Multi-core and Many-Core Architectures
    Deng Liang
    Zhao Dan
    Bai Hanli
    Wang Fang
    CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (03) : 540 - 548
  • [32] Optimizing Machine Learning Algorithms on Multi-core and Many-core Architectures using Thread and Data Mapping
    Serpa, Matheus S.
    Krause, Arthur M.
    Cruz, Eduardo H. M.
    Navaux, Philippe O. A.
    Pasin, Marcelo
    Felber, Pascal
    2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 329 - 333
  • [33] The SDAV Software Frameworks for Visualization and Analysis on Next-Generation Multi-Core and Many-Core Architectures
    Sewell, Christopher
    Meredith, Jeremy
    Moreland, Kenneth
    Peterka, Tom
    DeMarle, Dave
    Lo, Li-ta
    Ahrens, James
    Maynard, Robert
    Geveci, Berk
    2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 206 - 214
  • [34] Accelerating collision detection for large-scale crowd simulation on multi-core and many-core architectures
    Vigueras, Guillermo
    Orduna, Juan M.
    Lozano, Miguel
    Cecilia, Jose M.
    Garcia, Jose M.
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2014, 28 (01): : 33 - 49
  • [35] Exploiting multi-core and many-core architectures for efficient simulation of biologically realistic models of Golgi cells
    Florimbi, Giordana
    Torti, Emanuele
    Masoli, Stefano
    D'Angelo, Egidio
    Danese, Giovanni
    Leporati, Francesco
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 126 (48-66) : 48 - 66
  • [36] Performance Optimization and Comparison of the Alternating Direction Implicit CFD Solver on Multi-core and Many-Core Architectures
    DENG Liang
    ZHAO Dan
    BAI Hanli
    WANG Fang
    Chinese Journal of Electronics, 2018, 27 (03) : 540 - 548
  • [37] Evaluating Multi-core and Many-core Architectures Through Accelerating an Alternating Direction Implicit CFD Solver
    Deng, Liang
    Fang, Jianbin
    Wang, Fang
    Bai, Hanli
    2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2016, : 1 - 10
  • [38] Accelerating network coding on many-core GPUs and multi-core CPUs
    Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
    不详
    J. Commun., 2009, 11 (902-909):
  • [39] On the parallelization of Hirschberg's algorithm for multi-core and many-core systems
    Joao, Mario, Jr.
    Sena, Alexandre C.
    Rebello, Vinod E. F.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (18):
  • [40] On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures
    Pennycook, S. J.
    Hammond, S. D.
    Mudalige, G. R.
    Wright, S. A.
    Jarvis, S. A.
    COMPUTER JOURNAL, 2012, 55 (02): : 138 - 153