Locality-Conscious Nested-Loops Parallelization

被引:0
|
作者
Parsa, Saeed [1 ]
Hamzei, Mohammad [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Automatic nested loops parallelization; data locality; loop tiling; OPTIMIZATION;
D O I
10.4218/etrij.13.0113.0266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To speed up data-intensive programs, two complementary techniques, namely nested loops parallelization and data locality optimization, should be considered. Effective parallelization techniques distribute the computation and necessary data across different processors, whereas data locality places data on the same processor. Therefore, locality and parallelization may demand different loop transformations. As such, an integrated approach that combines these two can generate much better results than each individual approach. This paper proposes a unified approach that integrates these two techniques to obtain an appropriate loop transformation. Applying this transformation results in coarse grain parallelism through exploiting the largest possible groups of outer permutable loops in addition to data locality through dependence satisfaction at inner loops. These groups can be further tiled to improve data locality through exploiting data reuse in multiple dimensions.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [41] Parallelization of recurrent loops with conditional statements
    Shteinberg, BY
    AUTOMATION AND REMOTE CONTROL, 1995, 56 (09) : 1344 - 1350
  • [42] Fusion of loops for parallelism and locality
    Manjikian, N
    Abdelrahman, TS
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1997, 8 (02) : 193 - 209
  • [43] Speculative Parallelization of Sequential Loops on Multicores
    Tian, Chen
    Feng, Min
    Nagarajan, Vijay
    Gupta, Rajiv
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2009, 37 (05) : 508 - 535
  • [44] On the parallelization of single dynamic conditional loops
    Mahjoub, Zaher
    Jemni, Mohamed
    Simulation Practice and Theory, 1996, 4 (2-3): : 141 - 154
  • [45] PARALLELIZATION OF WHILE LOOPS ON PIPELINED ARCHITECTURES
    TIRUMALAI, PP
    LEE, M
    SCHLANSKER, MS
    JOURNAL OF SUPERCOMPUTING, 1991, 5 (2-3): : 119 - 136
  • [46] Parallelization of Recurrent Loops with Conditional Statements
    Shteinberg, B. Y.
    Automation and Remote Control (English translation of Avtomatika i Telemekhanika), 56 (02):
  • [47] Nested parallelization of the flow solver TFS using the ParaWise parallelization environment
    Johnson, Steve
    Leggett, Peter
    Ierotheou, Constantinos
    Spiegel, Alexander
    Mey, Dieter an
    Horschler, Ingolf
    OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2008, 4315 : 217 - +
  • [48] Optimized unrolling of nested loops
    IBM T. J. Watson Research Center, P.O. Box 704, Yorktown Heights, NY 10598
    Int J Parallel Program, 5 (545-581):
  • [49] Optimized unrolling of nested loops
    Sarkar, V
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2001, 29 (05) : 545 - 581
  • [50] Optimized Unrolling of Nested Loops
    Vivek Sarkar
    International Journal of Parallel Programming, 2001, 29 : 545 - 581