Performance Characterization and Evaluation of HPC Algorithms on Dissimilar Multicore Architectures

被引:0
|
作者
Krishnan, S. P. T. [1 ]
Veeravalli, Bharadwaj [2 ]
机构
[1] Agcy Sci Technol & Res, Inst Infocomm Res, Singapore 138632, Singapore
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117583, Singapore
关键词
RNA SECONDARY STRUCTURE; PARALLEL GENETIC ALGORITHM; STRUCTURE PREDICTION; PSEUDOKNOTS; IMPLEMENTATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we share our experiences in using two important yet different High Performance Computing (HPC) architectures for evaluating two HPC algorithms. The first architecture is an Intel x64 ISA based homogenous multicore with Uniform Memory Access (UMA) type shared-memory based Symmetric Multi-Processing system. The second architecture is an IBM Power ISA based heterogenous multicore with Non-Uniform Memory Access (NUMA) based distributed-memory Asymmetric Multi-Processing system. The two HPC algorithms are for predicting biological molecular structures, specifically the RNA secondary structures. The first algorithm that we created is a parallelized version of a popular serial RNA secondary structure prediction algorithm called PKNOTS. The second algorithm is a new parallel-by-design algorithm that we have developed called MARSs. Using real Ribo-Nucleic Acid (RNA) sequences, we conducted large-scale experiments involving hundreds of sequences using the above two algorithms. Based on thousands of data points that we collected as an outcome of our experiments, we report on the observed performance metrics for both the algorithms on the two architectures. Through our experiments, we infer that architectures with specialized co-processors for number-crunching along with high-speed memory bus and dedicated bus controllers generally perform better than general-purpose multi-processor architectures. In addition, we observed that algorithms that are intrinsically parallelized by design are able to scale & perform better by taking advantage of the underlying parallel architecture. We further share best practices on handling scalability aspects with regards to workload size. We believe our results are applicable to other HPC applications on similar HPC architectures.
引用
收藏
页码:1288 / 1295
页数:8
相关论文
共 50 条
  • [41] Evaluation of medical image algorithms on multicore processors
    2018, Slovene Society Informatika (42):
  • [42] Evaluation of Medical Image Algorithms on Multicore Processors
    Demirovic, Damir
    Sabanovic, Zekerijah
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2018, 42 (02): : 167 - 173
  • [43] PERFORMANCE EVALUATION OF PARALLEL GENETIC AND PARTICLE SWARM OPTIMIZATION ALGORITHMS WITHIN THE MULTICORE ARCHITECTURE
    Radhamani, A. S.
    Baburaj, E.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (04)
  • [44] Evaluation of Deep Learning Frameworks over Different HPC Architectures
    Shams, Shayan
    Platania, Richard
    Lee, Kisung
    Park, Seung-Jong
    2017 IEEE 37TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2017), 2017, : 1389 - 1396
  • [45] Analysis of dynamically scheduled tile algorithms for dense linear algebra on multicore architectures
    Haidar, Azzam
    Ltaief, Hatem
    YarKhan, Asim
    Dongarra, Jack
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2012, 24 (03): : 305 - 321
  • [46] Solving the Generalized Symmetric Eigenvalue Problem using Tile Algorithms on Multicore Architectures
    Ltaief, Hatem
    Luszczek, Piotr
    Haidar, Azzam
    Dongarra, Jack
    APPLICATIONS, TOOLS AND TECHNIQUES ON THE ROAD TO EXASCALE COMPUTING, 2012, 22 : 397 - 404
  • [47] Scheduling two-sided transformations using tile algorithms on multicore architectures
    Ltaief, Hatem
    Kurzak, Jakub
    Dongarra, Jack
    Badia, Rosa M.
    SCIENTIFIC PROGRAMMING, 2010, 18 (01) : 35 - 50
  • [48] Performance and Energy Efficient Asymmetrically Reliable Caches for Multicore Architectures
    Arslan, Sanem
    Topcuoglu, Haluk Rahmi
    Kandemir, Mahmut Taylan
    Tosun, Oguz
    2015 IEEE 29TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, 2015, : 1025 - 1032
  • [49] A Robust Methodology for Performance Analysis on Hybrid Embedded Multicore Architectures
    Saussard, Romain
    Bouzid, Boubker
    Vasiliu, Marius
    Reynaud, Roger
    2016 IEEE 10TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC), 2016, : 77 - 84
  • [50] Empirical Evaluation of the Parallel Distribution Sweeping Framework on Multicore Architectures
    Ajwani, Deepak
    Sitchinava, Nodari
    ALGORITHMS - ESA 2013, 2013, 8125 : 25 - 36