Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

被引:11
|
作者
Muddukrishna, Ananya [1 ]
Jonsson, Peter A. [2 ]
Brorsson, Mats [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Sch Informat & Commun Technol, S-16440 Kista, Sweden
[2] SICS Swedish ICT AB, S-16429 Kista, Sweden
关键词
Compendex;
D O I
10.1155/2015/981759
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance degradation due to nonuniform data access latencies has worsened on NUMA systems and can now be felt onchip in manycore processors. Distributing data across NUMA nodes and manycore processor caches is necessary to reduce the impact of nonuniform latencies. However, techniques for distributing data are error-prone and fragile and require low-level architectural knowledge. Existing task scheduling policies favor quick load-balancing at the expense of locality and ignore NUMA node/manycore cache access latencies while scheduling. Locality-aware scheduling, in conjunction with or as a replacement for existing scheduling, is necessary to minimize NUMA effects and sustain performance. We present a data distribution and locality-aware scheduling technique for task-based OpenMP programs executing on NUMA systems and manycore processors. Our technique relieves the programmer from thinking of NUMA system/manycore processor architecture details by delegating data distribution to the runtime system and uses task data dependence information to guide the scheduling of OpenMP tasks to reduce data stall times. We demonstrate our technique on a four-socket AMD Opteron machine with eight NUMA nodes and on the TILEPro64 processor and identify that data distribution and locality-aware task scheduling improve performance up to 69% for scientific benchmarks compared to default policies and yet provide an architecture-oblivious approach for programmers.
引用
收藏
页数:16
相关论文
共 39 条
  • [21] ISP-Friendly Data Scheduling by Advanced Locality-Aware Network Coding for P2P Distribution Cloud
    Li, Yanjun
    Zhang, Guoqing
    Zhang, Guoqiang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [22] Scope-aware data cache analysis for OpenMP programs on multi-core processors
    Du, He
    Zhang, Wei
    Guan, Nan
    Yi, Wang
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 98 : 443 - 452
  • [23] Prediction-Based and Locality-Aware Task Scheduling for Parallelizing Video Transcoding Over Heterogeneous MapReduce Cluster
    Zhao, Hui
    Zheng, Qinghua
    Zhang, Weizhan
    Wang, Jing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (04) : 1009 - 1020
  • [24] An Enhanced Data-Locality-Aware Task Scheduling Algorithm for Hadoop Applications
    Choi, Dongjoo
    Jeon, Myunghoon
    Kim, Namgi
    Lee, Byoung-Dai
    IEEE SYSTEMS JOURNAL, 2018, 12 (04): : 3346 - 3357
  • [25] Balancing Job Performance with System Performance via Locality-Aware Scheduling on Torus-Connected Systems
    Yang, Xu
    Zhou, Zhou
    Tang, Wei
    Zheng, Xingwu
    Wang, Jia
    Lan, Zhiling
    2014 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2014, : 140 - 148
  • [26] Locality and Network-Aware Reduce Task Scheduling for Data-Intensive Applications
    Arslan, Engin
    Shekhar, Mrigank
    Kosar, Tevfik
    2014 5TH INTERNATIONAL WORKSHOP ON DATA-INTENSIVE COMPUTING IN THE CLOUDS (DATACLOUD), 2014, : 17 - 24
  • [27] A Data Distribution Aware Task Scheduling Strategy for MapReduce System
    Guo, Leitao
    Sun, Hongwei
    Luo, Zhiguo
    CLOUD COMPUTING, PROCEEDINGS, 2009, 5931 : 694 - 699
  • [28] NUMA-aware Scheduling and Memory Allocation for data-flow task-parallel Applications
    Drebes, Andi
    Pop, Antoniu
    Heydemann, Karine
    Drach, Nathalie
    Cohen, Albert
    ACM SIGPLAN NOTICES, 2016, 51 (08) : 391 - 392
  • [29] Taming data locality for task scheduling under memory constraint in runtime systems
    Gonthier, Maxime
    Marchal, Loris
    Thibault, Samuel
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 143 : 305 - 321
  • [30] A highly efficient data locality aware task scheduler for cloud-based systems
    Ru, Jia
    Yang, Yun
    Grundy, John
    Keung, Jacky
    Hao, Li
    2019 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (IEEE CLOUD 2019), 2019, : 496 - 498