An empirical study of data speculation use on the Intel Itanium 2 processor

被引:1
|
作者
Mock, M [1 ]
Villamarín, R [1 ]
Baiocchi, J [1 ]
机构
[1] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
关键词
D O I
10.1109/INTERACT.2005.2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Intel Itanium architecture uses a dedicated 32-entrv hardware table, the Advanced Load Address Table (ALAT) to support data speculation via an instruction set interface. This study presents an empirical evaluation of the use of the ALAT and data speculative instructions for several optimizing compilers. We determined what and how often compilers generated the different speculative instructions, and used the Itanium's hardware performance counters to evaluate their run-time behavior We also performed a limit Study by modifying one compiler to always generate data speculation when possible. We found that this aggressive approach significantly increased the amount of data speculation and often resulted in performance improvements, of as much as 10% in one case. Since it worsened performance only for one application and then only for some inputs, we conclude that more aggressive data speculation heuristics than those employed by current compilers are desirable and may further improve performance gains from data speculation.
引用
收藏
页码:22 / 33
页数:12
相关论文
共 50 条
  • [1] Intel unveils Itanium processor
    不详
    IEEE MICRO, 1999, 19 (06) : 2 - 2
  • [2] Optimizations to prevent cache penalties for the Intel® Itanium® 2 processor
    Collard, JF
    Lavery, D
    CGO 2003: INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, 2003, : 105 - 114
  • [3] Data cache design considerations for the Itanium® 2 processor
    Lyon, T
    Delano, E
    McNairy, C
    Mulla, D
    ICCD'2002: IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN: VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 2002, : 356 - 362
  • [4] Itanium 2 processor microarchitecture
    McNairy, C
    Soltis, D
    IEEE MICRO, 2003, 23 (02) : 44 - 55
  • [5] Analysis and characterization of Intel® Itanium® instruction bundles for improving VLIW processor performance
    Liu, Jiangjiang
    Bell, Brian
    Truong, Tan
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 389 - +
  • [6] Optimized compiler for Intel® Itanium® processor family and compiler enhancements from NEC
    Sakon, S
    Yamamoto, H
    Kusano, K
    Washio, T
    NEC RESEARCH & DEVELOPMENT, 2003, 44 (01): : 80 - 84
  • [7] The impact of if-conversion and branch prediction on program execution on the Intel® Itanium™ processor
    Choi, Y
    Knies, A
    Gerke, L
    Ngai, TF
    34TH ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO-34, PROCEEDINGS, 2001, : 182 - 191
  • [8] Timing Closure of Clock Enable Signals on a 32 nm Intel Itanium Processor
    Malnar, Branimir
    Zelic, Goran
    2018 41ST INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2018, : 78 - 83
  • [9] Memory performance analysis of SPEC2000C for the Intel® Itanium™ processor
    Serrano, MJ
    Wu, YF
    WWC-4: IEEE INTERNATIONAL WORKSHOP ON WORKLOAD CHARACTERIZATION, 2001, : 184 - 192
  • [10] Test Access and the Testability Features of the Poulson Multi-core Intel Itanium® Processor
    Bhavsar, Dilip K.
    Poehlman, Steve J.
    2011 IEEE INTERNATIONAL TEST CONFERENCE (ITC), 2011,