Locality-Aware Task Scheduling and Data Distribution for OpenMP Programs on NUMA Systems and Manycore Processors

被引:11
|
作者
Muddukrishna, Ananya [1 ]
Jonsson, Peter A. [2 ]
Brorsson, Mats [1 ,2 ]
机构
[1] KTH Royal Inst Technol, Sch Informat & Commun Technol, S-16440 Kista, Sweden
[2] SICS Swedish ICT AB, S-16429 Kista, Sweden
关键词
Compendex;
D O I
10.1155/2015/981759
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Performance degradation due to nonuniform data access latencies has worsened on NUMA systems and can now be felt onchip in manycore processors. Distributing data across NUMA nodes and manycore processor caches is necessary to reduce the impact of nonuniform latencies. However, techniques for distributing data are error-prone and fragile and require low-level architectural knowledge. Existing task scheduling policies favor quick load-balancing at the expense of locality and ignore NUMA node/manycore cache access latencies while scheduling. Locality-aware scheduling, in conjunction with or as a replacement for existing scheduling, is necessary to minimize NUMA effects and sustain performance. We present a data distribution and locality-aware scheduling technique for task-based OpenMP programs executing on NUMA systems and manycore processors. Our technique relieves the programmer from thinking of NUMA system/manycore processor architecture details by delegating data distribution to the runtime system and uses task data dependence information to guide the scheduling of OpenMP tasks to reduce data stall times. We demonstrate our technique on a four-socket AMD Opteron machine with eight NUMA nodes and on the TILEPro64 processor and identify that data distribution and locality-aware task scheduling improve performance up to 69% for scientific benchmarks compared to default policies and yet provide an architecture-oblivious approach for programmers.
引用
收藏
页数:16
相关论文
共 39 条
  • [1] Locality-aware predictive scheduling of network processors
    Wolf, T
    Franklin, MA
    ISPASS: 2001 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2001, : 152 - 159
  • [2] Locality-aware task scheduling for homogeneous parallel computing systems
    Muhammad Khurram Bhatti
    Isil Oz
    Sarah Amin
    Maria Mushtaq
    Umer Farooq
    Konstantin Popov
    Mats Brorsson
    Computing, 2018, 100 : 557 - 595
  • [3] Locality-aware task scheduling for homogeneous parallel computing systems
    Bhatti, Muhammad Khurram
    Oz, Isil
    Amin, Sarah
    Mushtaq, Maria
    Farooq, Umer
    Popov, Konstantin
    Brorsson, Mats
    COMPUTING, 2018, 100 (06) : 557 - 595
  • [4] Locality-Aware Dynamic Task Graph Scheduling
    Maglalang, Jordyn
    Krishnamoorthy, Sriram
    Agrawal, Kunal
    2017 46TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP), 2017, : 70 - 80
  • [5] Tiresias: Optimizing NUMA Performance with CXL Memory and Locality-Aware Process Scheduling
    Tang, Wenda
    Ai, Tianxiang
    Wu, Jie
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 6 - 11
  • [6] OpenMP task scheduling strategies for multicore NUMA systems
    Olivier, Stephen L.
    Porterfield, Allan K.
    Wheeler, Kyle B.
    Spiegel, Michael
    Prins, Jan F.
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2012, 26 (02): : 110 - 124
  • [7] Locality-Aware Scheduling of Independent Tasks for Runtime Systems
    Gonthier, Maxime
    Marchal, Loris
    Thibault, Samuel
    EURO-PAR 2021: PARALLEL PROCESSING WORKSHOPS, 2022, 13098 : 5 - 16
  • [8] Data-Driven Locality-Aware Batch Scheduling
    Gonthier, Maxime
    Larsson, Elisabeth
    Marchal, Loris
    Nettelblad, Carl
    Thibault, Samuel
    2024 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW 2024, 2024, : 202 - 211
  • [9] Taming Big Data SVM with Locality-Aware Scheduling
    Ye, Mao
    Wang, Jun
    Yin, Jiangling
    Han, Dezhi
    2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 37 - 44
  • [10] Locality-aware and load-balanced static task scheduling for MapReduce
    Selvitopi, Oguz
    Demirci, Gunduz Vehbi
    Turk, Ata
    Aykanat, Cevdet
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 90 : 49 - 61