Acceleration of PDE-based FTLE Calculations on Intel Multi-core and Many-core Architectures

被引：0

作者：

Wang, Fang ^{[1
]}

Deng, Liang ^{[2
]}

Zhao, Dan ^{[2
]}

Li, Sikun ^{[2
]}

机构：

[1] Natl Univ Def Technol, Sch Comp, Changsha, Hunan, Peoples R China

[2] China Aerodynam Res & Dev Ctr, Computat Aerodynam Inst, Mianyang, Peoples R China

来源：

PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015) | 2015年

关键词：

finite-time Lyapunov exponent (FTLE); coherent structure; partial differential equation (PDE); Intel MIC; hardware performance metrics; LAGRANGIAN COHERENT STRUCTURES; TIME LYAPUNOV EXPONENTS; FLUID-FLOWS; VORTEX; IDENTIFICATION; DEFINITION;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Finite-time Lyapunov exponent (FTLE) iswidely used to extract coherent structure of unsteady flow. However, the calculation of FTLE can be highly time-consuming, which greatly limits the application's performance efficiency. In this paper, we accelerate a double precision PDE-based FTLE application for two-and three-dimensionalanalytical flow field on Intel multicore and many-core architectures such as Intel Sandy Bridge and Intel Many Integrated Core (MIC) coprocessor. Through analysis of the calculation processes of FTLE and the characteristics of Intelmulti-core and many-core architectures, we employ three categories of optimization techniques, namely, thread parallelism for multi-/many-core scaling, data parallelism to exploit SIMD (single-instruction multiple-data) mechanism and improving onchip data reuse, to maximize the performance. Also, the hardware performance metrics through an open source performance analysis tool, in order to explain performance difference between Sandy Bridge and MIC, are discussed. The experiment results show that our MIC-enabled FTLE achieves about 1.8x speed-ups relative to a parallel computation on two Intel Sandy Bridge CPUs, and perfect parallel efficiency is also observed from the experiment results.

引用

页码：178 / 183

页数：6

共 50 条

[31] Performance Optimization and Comparison of the Alternating Direction Implicit CFD Solver on Multi-core and Many-Core Architectures
Deng Liang
Zhao Dan
Bai Hanli
Wang Fang
CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (03) : 540 - 548
[32] Optimizing Machine Learning Algorithms on Multi-core and Many-core Architectures using Thread and Data Mapping
Serpa, Matheus S.
Krause, Arthur M.
Cruz, Eduardo H. M.
Navaux, Philippe O. A.
Pasin, Marcelo
Felber, Pascal
2018 26TH EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2018), 2018, : 329 - 333
[33] The SDAV Software Frameworks for Visualization and Analysis on Next-Generation Multi-Core and Many-Core Architectures
Sewell, Christopher
Meredith, Jeremy
Moreland, Kenneth
Peterka, Tom
DeMarle, Dave
Lo, Li-ta
Ahrens, James
Maynard, Robert
Geveci, Berk
2012 SC COMPANION: HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SCC), 2012, : 206 - 214
[34] Accelerating collision detection for large-scale crowd simulation on multi-core and many-core architectures
Vigueras, Guillermo
Orduna, Juan M.
Lozano, Miguel
Cecilia, Jose M.
Garcia, Jose M.
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2014, 28 (01): : 33 - 49
[35] Exploiting multi-core and many-core architectures for efficient simulation of biologically realistic models of Golgi cells
Florimbi, Giordana
Torti, Emanuele
Masoli, Stefano
D'Angelo, Egidio
Danese, Giovanni
Leporati, Francesco
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2019, 126 (48-66) : 48 - 66
[36] Performance Optimization and Comparison of the Alternating Direction Implicit CFD Solver on Multi-core and Many-Core Architectures
DENG Liang
ZHAO Dan
BAI Hanli
WANG Fang
Chinese Journal of Electronics, 2018, 27 (03) : 540 - 548
[37] Evaluating Multi-core and Many-core Architectures Through Accelerating an Alternating Direction Implicit CFD Solver
Deng, Liang
Fang, Jianbin
Wang, Fang
Bai, Hanli
2016 15TH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING (ISPDC), 2016, : 1 - 10
[38] Accelerating network coding on many-core GPUs and multi-core CPUs
Department of Computer Science, Hong Kong Baptist University, Hong Kong, China
不详
J. Commun., 2009, 11 (902-909):
[39] On the parallelization of Hirschberg's algorithm for multi-core and many-core systems
Joao, Mario, Jr.
Sena, Alexandre C.
Rebello, Vinod E. F.
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (18):
[40] On the Acceleration of Wavefront Applications using Distributed Many-Core Architectures
Pennycook, S. J.
Hammond, S. D.
Mudalige, G. R.
Wright, S. A.
Jarvis, S. A.
COMPUTER JOURNAL, 2012, 55 (02): : 138 - 153

← 1 2 3 4 5 →