Locality-Conscious Nested-Loops Parallelization

被引:0
|
作者
Parsa, Saeed [1 ]
Hamzei, Mohammad [1 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
关键词
Automatic nested loops parallelization; data locality; loop tiling; OPTIMIZATION;
D O I
10.4218/etrij.13.0113.0266
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To speed up data-intensive programs, two complementary techniques, namely nested loops parallelization and data locality optimization, should be considered. Effective parallelization techniques distribute the computation and necessary data across different processors, whereas data locality places data on the same processor. Therefore, locality and parallelization may demand different loop transformations. As such, an integrated approach that combines these two can generate much better results than each individual approach. This paper proposes a unified approach that integrates these two techniques to obtain an appropriate loop transformation. Applying this transformation results in coarse grain parallelism through exploiting the largest possible groups of outer permutable loops in addition to data locality through dependence satisfaction at inner loops. These groups can be further tiled to improve data locality through exploiting data reuse in multiple dimensions.
引用
收藏
页码:124 / 133
页数:10
相关论文
共 50 条
  • [1] NESTED-LOOPS TILING FOR PARALLELIZATION AND LOCALITY OPTIMIZATION
    Parsa, Saeed
    Hamzei, Mohammad
    COMPUTING AND INFORMATICS, 2017, 36 (03) : 566 - 596
  • [2] Locality-conscious process scheduling in embedded systems
    Kadayif, I
    Kandemir, M
    Kolcu, I
    Chen, G
    CODES 2002: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON HARDWARE/SOFTWARE CODESIGN, 2002, : 193 - 198
  • [3] An adaptive locality-conscious process scheduler for embedded systems
    Chen, GL
    Chen, GY
    Ozturk, O
    Kandemir, M
    RTAS 2005: 11th IEEE Real Time and Embedded Technology and Applications Symposium, Proceedings, 2005, : 354 - 364
  • [4] Locality-Conscious Lock-Free Linked Lists
    Braginsky, Anastasia
    Petrank, Erez
    DISTRIBUTED COMPUTING AND NETWORKING, 2011, 6522 : 107 - 118
  • [5] Improving locality in the parallelization of doacross loops
    Martín, MJ
    Singh, DE
    Touriño, J
    Rivera, FF
    EURO-PAR 2002 PARALLEL PROCESSING, PROCEEDINGS, 2002, 2400 : 275 - 279
  • [6] A NEW APPROACH TO SCHEDULE OPERATIONS ACROSS NESTED-IFS AND NESTED-LOOPS
    HUANG, SH
    HWANG, CT
    HSU, YC
    OYANG, YJ
    MICROPROCESSING AND MICROPROGRAMMING, 1995, 41 (01): : 37 - 52
  • [7] Precise data locality optimization of nested loops
    Loechner, V
    Meister, B
    Clauss, P
    JOURNAL OF SUPERCOMPUTING, 2002, 21 (01): : 37 - 76
  • [8] Precise Data Locality Optimization of Nested Loops
    Vincent Loechner
    Benoît Meister
    Philippe Clauss
    The Journal of Supercomputing, 2002, 21 : 37 - 76
  • [10] Modular Divide-and-Conquer Parallelization of Nested Loops
    Farzan, Azadeh
    Nicolet, Victor
    PROCEEDINGS OF THE 40TH ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION (PLDI '19), 2019, : 610 - 624