Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

被引:11
|
作者
Muddukrishna, Ananya [1 ]
Jonsson, Peter A. [2 ]
Brorsson, Mats [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Sch Informat & Commun Technol, S-16440 Kista, Sweden
[2] SICS Swedish ICT AB, S-16429 Kista, Sweden
关键词
Compendex;
D O I
10.1155/2015/981759
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance degradation due to nonuniform data access latencies has worsened on NUMA systems and can now be felt onchip in manycore processors. Distributing data across NUMA nodes and manycore processor caches is necessary to reduce the impact of nonuniform latencies. However, techniques for distributing data are error-prone and fragile and require low-level architectural knowledge. Existing task scheduling policies favor quick load-balancing at the expense of locality and ignore NUMA node/manycore cache access latencies while scheduling. Locality-aware scheduling, in conjunction with or as a replacement for existing scheduling, is necessary to minimize NUMA effects and sustain performance. We present a data distribution and locality-aware scheduling technique for task-based OpenMP programs executing on NUMA systems and manycore processors. Our technique relieves the programmer from thinking of NUMA system/manycore processor architecture details by delegating data distribution to the runtime system and uses task data dependence information to guide the scheduling of OpenMP tasks to reduce data stall times. We demonstrate our technique on a four-socket AMD Opteron machine with eight NUMA nodes and on the TILEPro64 processor and identify that data distribution and locality-aware task scheduling improve performance up to 69% for scientific benchmarks compared to default policies and yet provide an architecture-oblivious approach for programmers.
引用
收藏
页数:16
相关论文
共 39 条
  • [31] Kernel mechanisms with dynamic task-aware scheduling to reduce resource contention in NUMA multi-core systems
    Chiang, Mei-Ling
    Yang, Chieh-Jui
    Tu, Shu-Wei
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 121 : 72 - 87
  • [32] Data-aware task scheduling on heterogeneous hybrid memory multiprocessor systems
    Chen, Junjie
    Li, Kenli
    Tang, Zhuo
    Liu, Chubo
    Wang, Yan
    Li, Keqin
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (17): : 4443 - 4459
  • [33] Adaptive NUMA-aware data placement and task scheduling for analytical workloads in main-memory column-stores
    Psaroudakis, Iraklis
    Scheuer, Tobias
    May, Norman
    Sellami, Abdelkader
    Ailamaki, Anastasia
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 10 (02): : 37 - 48
  • [34] An Energy and Data Locality Aware Bi-level Multiobjective Task Scheduling Model Based on MapReduce for Cloud Computing
    Wang, Xiaoli
    Wang, Yuping
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 648 - 655
  • [35] Energy-Aware Data Allocation and Task Scheduling on Heterogeneous Multiprocessor Systems With Time Constraints
    Wang, Yan
    Li, Kenli
    Chen, Hao
    He, Ligang
    Li, Keqin
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (02) : 134 - 148
  • [36] Data-aware task scheduling for all-to-all comparison problems in heterogeneous distributed systems
    Zhang, Yi-Fan
    Tian, Yu-Chu
    Fidge, Colin
    Kelly, Wayne
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2016, 93-94 : 87 - 101
  • [37] A Cluster-Based Data-Centric Model for Network-Aware Task Scheduling in Distributed Systems
    Fiore, Ugo
    Palmieri, Francesco
    Castiglione, Aniello
    De Santis, Alfredo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (05) : 755 - 775
  • [38] A Cluster-Based Data-Centric Model for Network-Aware Task Scheduling in Distributed Systems
    Ugo Fiore
    Francesco Palmieri
    Aniello Castiglione
    Alfredo De Santis
    International Journal of Parallel Programming, 2014, 42 : 755 - 775
  • [39] Topology Aware Task Allocation and Scheduling for Real-Time Data Fusion applications in Networked Embedded Sensor Systems
    Zhao, Baokang
    Wang, Meng
    Shao, Zili
    Cao, Jiannong
    Chan, Keith C. C.
    Su, Jinshu
    RTCSA 2008: 14TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS - PROCEEDINGS, 2008, : 293 - +