A methodology for detailed performance modeling of reduction computations on SMP machines

被引:3
|
作者
Jin, RM [1 ]
Agrawal, G [1 ]
机构
[1] Ohio State Univ, Dept Comp & Informat Sci, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
parallel processing; shared memory; memory hierarchy; data mining;
D O I
10.1016/j.peva.2004.10.017
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we revisit the problem of performance prediction on SMP machines, motivated by the need for selecting parallelization strategy for random write reductions. Such reductions frequently arise in data mining algorithms. In our previous work, we have developed a number of techniques for parallelizing this class of reductions. Our previous work has shown that each of the three techniques, full replication, optimized full locking, and cache-sensitive, can outperform others depending upon problem, dataset, and machine parameters. Therefore, an important question is, "Can we predict the performance of these techniques for a given problem, dataset, and machine?". This paper addresses this question by developing an analytical performance model that captures a two-level cache, coherence cache misses, TLB misses, locking overheads, and contention for memory. Analytical model is combined with results from micro-benchmarking to predict performance on real machines. We have validated our model on two different SMP machines. Our results show that our model effectively captures the impact of memory hierarchy (two-level cache and TLB) as well as the factors that limit parallelism (contention for locks, memory contention, and coherence cache misses). The difference between predicted and measured performance is within 20% in almost all cases. Moreover, the model is quite accurate in predicting the relative performance of the three parallelization techniques. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:73 / 105
页数:33
相关论文
共 50 条
  • [31] Performance and Energy Modeling for Live Migration of Virtual Machines
    Liu, Haikun
    Xu, Cheng-Zhong
    Jin, Hai
    Gong, Jiayu
    Liao, Xiaofei
    HPDC 11: PROCEEDINGS OF THE 20TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2011, : 171 - 181
  • [32] Performance and energy modeling for live migration of virtual machines
    Liu, Haikun
    Jin, Hai
    Xu, Cheng-Zhong
    Liao, Xiaofei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2013, 16 (02): : 249 - 264
  • [33] Performance and energy modeling for live migration of virtual machines
    Haikun Liu
    Hai Jin
    Cheng-Zhong Xu
    Xiaofei Liao
    Cluster Computing, 2013, 16 : 249 - 264
  • [34] Hierarchical methodology for modeling hydrogen storage systems. Part II: Detailed models
    Hardy, Bruce J.
    Anton, Donald L.
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2009, 34 (07) : 2992 - 3004
  • [35] Development of a diesel surrogate for improved autoignition prediction: Methodology and detailed chemical kinetic modeling
    Kukkadapu, Goutham
    Whitesides, Russell
    Wang, Mengyuan
    Wagnon, Scott W.
    Mehl, Marco
    Westbrook, Charles K.
    McCormick, Robert
    Sung, Chih-Jen
    Pitz, William J.
    APPLICATIONS IN ENERGY AND COMBUSTION SCIENCE, 2023, 16
  • [36] Partial data reuse for windowing computations: Performance modeling for FPGA implementations
    Park, Joonseok
    Diniz, Pedro C.
    RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2007, 4419 : 97 - +
  • [37] A Genetic Algorithms Approach to Modeling the Performance of Memory-bound Computations
    Tikir, Mustafa M.
    Carrington, Laura
    Strohmaier, Erich
    Snavely, Allan
    2007 ACM/IEEE SC07 CONFERENCE, 2010, : 11 - +
  • [38] Constitutive modeling of SMA SMP multifunctional high performance smart adaptive shape memory composite
    Jarali, Chetan S.
    Raja, S.
    Upadhya, A. R.
    SMART MATERIALS AND STRUCTURES, 2010, 19 (10)
  • [39] Energy-Based Model Reduction Methodology for Automated Modeling
    Louca, Loucas S.
    Stein, Jeffrey L.
    Hulbert, Gregory M.
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 2010, 132 (06):
  • [40] EMPIRICAL MODELING OF THE SELECTIVE NONCATALYTIC REDUCTION OF NO - COMPARISON WITH LARGE-SCALE EXPERIMENTS AND DETAILED KINETIC MODELING
    OSTBERG, M
    DAMJOHANSEN, K
    CHEMICAL ENGINEERING SCIENCE, 1994, 49 (12) : 1897 - 1904