Locality-Conscious Nested-Loops Parallelization

被引:0
|
作者
Parsa, Saeed [1 ]
Hamzei, Mohammad [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Automatic nested loops parallelization; data locality; loop tiling; OPTIMIZATION;
D O I
10.4218/etrij.13.0113.0266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To speed up data-intensive programs, two complementary techniques, namely nested loops parallelization and data locality optimization, should be considered. Effective parallelization techniques distribute the computation and necessary data across different processors, whereas data locality places data on the same processor. Therefore, locality and parallelization may demand different loop transformations. As such, an integrated approach that combines these two can generate much better results than each individual approach. This paper proposes a unified approach that integrates these two techniques to obtain an appropriate loop transformation. Applying this transformation results in coarse grain parallelism through exploiting the largest possible groups of outer permutable loops in addition to data locality through dependence satisfaction at inner loops. These groups can be further tiled to improve data locality through exploiting data reuse in multiple dimensions.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [21] A new approach to parallelization of serial nested loops using genetic algorithms
    Parsa, S
    Lotfi, S
    JOURNAL OF SUPERCOMPUTING, 2006, 36 (01): : 83 - 94
  • [22] Synchronization-Free Automatic Parallelization for Arbitrarily Nested Affine Loops
    Klimek, Tomasz
    Palkowski, Marek
    Bielecki, Wlodzimierz
    2016 28TH IEEE INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW), 2016, : 43 - 48
  • [23] An Automatic Parallelization Framework for Complex Nested Loops Based on LLVM Pass
    Ma C.-Y.
    Lv B.-X.
    Ye X.-J.
    Zhang Y.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (07):
  • [24] Nested Parallelism on GPU: Exploring Parallelization Templates for Irregular Loops and Recursive Computations
    Li, Da
    Wu, Hancheng
    Becchi, Michela
    2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2015, : 979 - 988
  • [25] Transformations for improving data access locality in non-perfectly nested loops
    Kulkarni, D
    1998 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 1998, : 314 - 321
  • [26] Parallelization of While Loops in Nested Loop Programs for Shared-Memory Multiprocessor Systems
    Geuns, Stefan J.
    Bekooij, Marco J. G.
    Bijlsma, Tjerk
    Corporaal, Henk
    2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 697 - 702
  • [27] Nested Parallelization with OpenMP
    Dieter an Mey
    Samuel Sarholz
    Christian Terboven
    International Journal of Parallel Programming, 2007, 35 : 459 - 476
  • [28] Automatic Loops Parallelization
    Goncalves, Cristiano Oliveira
    Spolon, Roberta
    Lobato, Renata Spolon
    Manacero, Aleardo, Jr.
    Lobato, Daniel Correa
    PROCEEDINGS OF THE 2014 9TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2014), 2014,
  • [29] Nested parallelization with OpenMP
    Mey, Dieter an
    Sarholz, Samuel
    Terboven, Christian
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2007, 35 (05) : 459 - 476
  • [30] ON OPTIMAL PARALLELIZATION OF ARBITRARY LOOPS
    SCHWIEGELSHOHN, U
    GASPERONI, F
    EBCIOGLU, K
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1991, 11 (02) : 130 - 134