Bounds modelling and compiler optimizations for superscalar performance tuning

被引:4
|
作者
Bose, P
Kim, S
O'Connell, FP
Ciarfella, WA
机构
[1] IBM Corp, TJ Watson Res Ctr, Yorktown Heights, NY 10598 USA
[2] IBM Corp, High End Proc Dev, Austin, TX USA
关键词
loop performance; super scalar processors; bounds analysis; compiler optimization; performance tuning;
D O I
10.1016/S1383-7621(98)00053-8
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the floating point microarchitecture support in RISC superscalar processors. We briefly review the fundamental performance trade-offs in the design of such microarchitecutres. We propose a simple, yet effective bounds model to deduce the "best-case" loop performance limits for these processors. We compare these bounds to simulated and real performance measurements. From this study, we identify several loop tuning opportunities. In particular. we illustrate the use of this analysis in suggesting loop unrolling and scheduling heuristics. We report our experimental results in the context of a set of application-based loop test cases. These an designed to stress various resource limits in the core (infinite cache) microarchitecture, (C) 1999 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:1111 / 1137
页数:27
相关论文
共 50 条
  • [1] Studying compiler optimizations on superscalar processors through interval analysis
    Eyerman, Stijn
    Eeckhout, Lieven
    Smith, James E.
    HIGH PERFORMANCE EMBEDDED ARCHITECTURES AND COMPILERS, 2008, 4917 : 114 - +
  • [2] Fast and effective orchestration of compiler optimizations for automatic performance tuning
    Pan, Zhelong
    Eigenmann, Rudolf
    CGO 2006: 4TH INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2006, : 319 - +
  • [3] An Automatic Tool for Tuning Compiler Optimizations
    Plotnikov, Dmitry
    Melnik, Dmitry
    Vardanyan, Mamikon
    Buchatskiy, Ruben
    Zhuykov, Roman
    2013 COMPUTER SCIENCE AND INFORMATION TECHNOLOGIES (CSIT), 2013,
  • [4] Tuning compiler optimizations for simultaneous multithreading
    Lo, JL
    Eggers, SJ
    Levy, HM
    Parekh, SS
    Tullsen, DM
    THIRTIETH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS, 1997, : 114 - 124
  • [5] Tuning Compiler Optimizations for Simultaneous Multithreading
    Jack L. Lo
    Susan J. Eggers
    Henry M. Levy
    Sujay S. Parekh
    Dean M. Tullsen
    International Journal of Parallel Programming, 1999, 27 : 477 - 503
  • [6] Tuning compiler optimizations for simultaneous multithreading
    Lo, JL
    Eggers, SJ
    Levy, HM
    Parekh, SS
    Tulsen, DM
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 1999, 27 (06) : 477 - 503
  • [7] Automatic Tuning of Compiler Optimizations and Analysis of their Impact
    Plotnikov, Dmitry
    Melnik, Dmitry
    Vardanyan, Mamikon
    Buchatskiy, Ruben
    Zhuykov, Roman
    Lee, Je-Hyung
    2013 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2013, 18 : 1312 - 1321
  • [8] Performance instrumentation and compiler optimizations for MPI/OpenMP applications
    Hernandez, Oscar
    Song, Fengguang
    Chapman, Barbara
    Dongarra, Jack
    Mohr, Bernd
    Moore, Shirley
    Wolf, Felix
    OPENMP SHARED MEMORY PARALLEL PROGRAMMING, PROCEEDINGS, 2008, 4315 : 267 - +
  • [9] Compiler Optimizations for OpenMP
    Doerfert, Johannes
    Finkel, Hal
    EVOLVING OPENMP FOR EVOLVING ARCHITECTURES, 2018, 11128 : 113 - 127
  • [10] COMPILER OPTIMIZATIONS FOR THE WAM
    TURK, AK
    LECTURE NOTES IN COMPUTER SCIENCE, 1986, 225 : 657 - 662