MANY-TASK COMPUTING ON MANY-CORE ARCHITECTURES

被引:0
|
作者
Valero-Lara, Pedro [1 ,2 ]
Nookala, Poornima [3 ]
Pelayo, Fernando L. [4 ]
Jansson, Johan [2 ,5 ]
Dimitropoulos, Serapheim [3 ]
Raicu, Ioan [3 ]
机构
[1] Univ Manchester, Manchester M13 9PL, Lancs, England
[2] BCAM, Bilbao, Spain
[3] IIT, Chicago, IL 60616 USA
[4] UCLM, Albacete, Spain
[5] KTH Royal Inst Technol, Stockholm, Sweden
来源
关键词
Parallel Computing; Multi-Task Computing; Many-Core; GPU; Intel Xeon Phi; CUDA; OpenMP;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Many-Task Computing (MTC) is a common scenario for multiple parallel systems, such as cluster, grids, cloud and supercomputers, but it is not so popular in shared memory parallel processors. In this sense and given the spectacular growth in performance and in number of cores integrated in many-core architectures, the study of MTC on such architectures is becoming more and more relevant. In this paper, authors present what are those programming mechanisms to take advantages of such massively parallel features for the particular target of MTC. Also, the hardware features of the two dominant many-core platforms (NVIDIA's GPUs and Intel Xeon Phi) are also analyzed for our specific framework. Given the important differences in terms of hardware and software in our two many-core platforms, we have considered different strategies based on CUDA (for GPUs) and OpenMP (for Intel Xeon Phi). We carried out several test cases based on an appropriate and widely studied problem for benchmarking as matrix multiplication. Essentially, this study consisted of comparing the time consumed for computing in parallel several tasks one by one (the whole computational resources are used just to compute one task at a time) with the time consumed for computing in parallel the same set of tasks simultaneously (the whole computational resources are used for computing the set of tasks at very same time). Finally, we compared both software-hardware scenarios to identify the most relevant computer features in each of our many-core architectures.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [31] Benchmarking Molecular Dynamics with OpenCL on Many-Core Architectures
    Halver, Rene
    Homberg, Wilhelm
    Sutmann, Godehard
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2017), PT II, 2018, 10778 : 244 - 253
  • [32] Exploiting memory allocations in clusterised many-core architectures
    Garibotti, Rafael
    Ost, Luciano
    Butko, Anastasiia
    Reis, Ricardo
    Gamatie, Abdoulaye
    Sassatelli, Gilles
    IET COMPUTERS AND DIGITAL TECHNIQUES, 2019, 13 (04): : 302 - 311
  • [33] TOOLS AND ENVIRONMENTS FOR MULTICORE AND MANY-CORE ARCHITECTURES INTRODUCTION
    Feng, Wu-Chun
    Balaji, Pavan
    COMPUTER, 2009, 42 (12) : 26 - 27
  • [34] Adapting Particle Filter Algorithms to Many-Core Architectures
    Chitchian, Mehdi
    van Amesfoort, Alexander S.
    Simonetto, Andrea
    Keviczky, Tamas
    Sips, Henk J.
    IEEE 27TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2013), 2013, : 427 - 438
  • [35] Hybrid Coarrays: a PGAS Feature for Many-Core Architectures
    Cardellini, Valeria
    Fanfarillo, Alessandro
    Filippone, Salvatore
    Rouson, Damian
    PARALLEL COMPUTING: ON THE ROAD TO EXASCALE, 2016, 27 : 175 - 184
  • [36] Power Efficient Photonic Networks for Many-Core Architectures
    Neel, Brian
    Morris, Randy
    Ditomaso, Dominic
    Kodi, Avinash
    2012 INTERNATIONAL GREEN COMPUTING CONFERENCE (IGCC), 2012,
  • [37] Vectorizing unstructured mesh computations for many-core architectures
    Reguly, Istvan Z.
    Laszlo, Endre
    Mudalige, Gihan R.
    Giles, Mike B.
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (02): : 557 - 577
  • [38] Scalable Parallel Flash Firmware for Many-core Architectures
    Zhang, Jie
    Kwon, Miryeong
    Swift, Michael
    Jung, Myoungsoo
    PROCEEDINGS OF THE 18TH USENIX CONFERENCE ON FILE AND STORAGE TECHNOLOGIES, 2020, : 121 - 136
  • [39] Distributed Peak Power Management for Many-core Architectures
    Sartori, John
    Kumar, Rakesh
    DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 1556 - 1559
  • [40] Architectural Support for Cilk Computations on Many-core Architectures
    Long, Guoping
    Fan, Dongrui
    Zhang, Junchao
    ACM SIGPLAN NOTICES, 2009, 44 (04) : 285 - 286