Dynamic and Speculative Polyhedral Parallelization Using Compiler-Generated Skeletons

被引：0

作者：

Alexandra Jimborean

Philippe Clauss

Jean-François Dollinger

Vincent Loechner

Juan Manuel Martinez Caamaño

机构：

[1] University of Uppsala,UPMARC

[2] University of Strasbourg,ICube, INRIA, CNRS

来源：

International Journal of Parallel Programming | 2014年 / 42卷

关键词：

Algorithmic skeletons; Polytope model; Automatic parallelization; Speculative parallelization; Dynamic parallelization; Loop nests; Compilation;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We propose a framework based on an original generation and use of algorithmic skeletons, and dedicated to speculative parallelization of scientific nested loop kernels, able to apply at run-time polyhedral transformations to the target code in order to exhibit parallelism and data locality. Parallel code generation is achieved almost at no cost by using binary algorithmic skeletons that are generated at compile-time, and that embed the original code and operations devoted to instantiate a polyhedral parallelizing transformation and to verify the speculations on dependences. The skeletons are patched at run-time to generate the executable code. The run-time process includes a transformation selection guided by online profiling phases on short samples, using an instrumented version of the code. During this phase, the accessed memory addresses are used to compute on-the-fly dependence distance vectors, and are also interpolated to build a predictor of the forthcoming accesses. Interpolating functions and distance vectors are then employed for dependence analysis to select a parallelizing transformation that, if the prediction is correct, does not induce any rollback during execution. In order to ensure that the rollback time overhead stays low, the code is executed in successive slices of the outermost original loop of the nest. Each slice can be either a parallel version which instantiates a skeleton, a sequential original version, or an instrumented version. Moreover, such slicing of the execution provides the opportunity of transforming differently the code to adapt to the observed execution phases, by patching differently one of the pre-built skeletons. The framework has been implemented with extensions of the LLVM compiler and an x86-64 runtime system. Significant speed-ups are shown on a set of benchmarks that could not have been handled efficiently by a compiler.

引用

页码：529 / 545

页数：16

共 50 条

[31] Speculative Parallelization Using State Separation and Multiple Value Prediction
Tian, Chen
Feng, Min
Gupta, Rajiv
ACM SIGPLAN NOTICES, 2010, 45 (08) : 63 - 72
[32] Speculative Parallelization Using Software Multi-threaded Transactions
Raman, Arun
Kim, Hanjun
Mason, Thomas R.
Jablin, Thomas B.
August, David I.
ACM SIGPLAN NOTICES, 2010, 45 (03) : 65 - 76
[33] Speculative Parallelization Using Software Multi-threaded Transactions
Raman, Arun
Kim, Hanjun
Mason, Thomas R.
Jablin, Thomas B.
August, David I.
ASPLOS XV: FIFTEENTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2010, : 65 - 76
[34] AUTOMATIC PARALLELIZATION OF WHILE-LOOPS USING SPECULATIVE EXECUTION
COLLARD, JF
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1995, 23 (02) : 191 - 219
[35] Code Bones: Fast and Flexible Code Generation for Dynamic and Speculative Polyhedral Optimization
Caamano, Juan Manuel Martinez
Wolff, Willy
Clauss, Philippe
EURO-PAR 2016: PARALLEL PROCESSING, 2016, 9833 : 225 - 237
[36] Compiler-Assisted Dynamic Scheduling for Effective Parallelization of Loop Nests on Multicore Processors
Baskaran, Muthu Manikandan
Vydyanathan, Nagavijayalakshmi
Bondhugula, Uday Kumar
Ramanujam, J.
Rountev, Atanas
Sadayappan, P.
ACM SIGPLAN NOTICES, 2009, 44 (04) : 219 - 228
[37] Assisting Static Compiler Vectorization with a Speculative Dynamic Vectorizer in an HW/SW Codesigned Environment
Kumar, Rakesh
Martinez, Alejandro
Gonzalez, Antonio
ACM TRANSACTIONS ON COMPUTER SYSTEMS, 2016, 33 (04):
[38] Transparent Speculative Parallelization of Discrete Event Simulation Applications Using Global Variables
Pellegrini, Alessandro
Peluso, Sebastiano
Quaglia, Francesco
Vitali, Roberto
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (06) : 1200 - 1247
[39] Transparent Speculative Parallelization of Discrete Event Simulation Applications Using Global Variables
Alessandro Pellegrini
Sebastiano Peluso
Francesco Quaglia
Roberto Vitali
International Journal of Parallel Programming, 2016, 44 : 1200 - 1247
[40] Optimized Two-Level Parallelization for GPU Accelerators using the Polyhedral Model
Shirako, Jun
Hayashi, Akihiro
Sarkar, Vivek
CC'17: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON COMPILER CONSTRUCTION, 2017, : 22 - 33

← 1 2 3 4 5 →