An empirical study of data speculation use on the Intel Itanium 2 processor

被引:1
|
作者
Mock, M [1 ]
Villamarín, R [1 ]
Baiocchi, J [1 ]
机构
[1] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
关键词
D O I
10.1109/INTERACT.2005.2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Intel Itanium architecture uses a dedicated 32-entrv hardware table, the Advanced Load Address Table (ALAT) to support data speculation via an instruction set interface. This study presents an empirical evaluation of the use of the ALAT and data speculative instructions for several optimizing compilers. We determined what and how often compilers generated the different speculative instructions, and used the Itanium's hardware performance counters to evaluate their run-time behavior We also performed a limit Study by modifying one compiler to always generate data speculation when possible. We found that this aggressive approach significantly increased the amount of data speculation and often resulted in performance improvements, of as much as 10% in one case. Since it worsened performance only for one application and then only for some inputs, we conclude that more aggressive data speculation heuristics than those employed by current compilers are desirable and may further improve performance gains from data speculation.
引用
收藏
页码:22 / 33
页数:12
相关论文
共 50 条
  • [21] INSPECTOR: Data Provenance using Intel Processor Trace (PT)
    Thalheim, Joerg
    Bhatotia, Pramod
    Fetzer, Christof
    PROCEEDINGS 2016 IEEE 36TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2016, 2016, : 25 - 34
  • [22] Performance analysis and tracing of technical and Java']Java applications on the itanium 2 processor
    Hassanein, W
    Astfalk, G
    Eigenmann, R
    ISPASS: 2003 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, 2003, : 91 - 100
  • [23] INSPECTOR: Data Provenance Using Intel Processor Trace (PT)
    1600, Institute of Electrical and Electronics Engineers Inc., United States (2016-August):
  • [24] A 65 nm 2-Billion Transistor Quad-Core Itanium Processor
    Stackhouse, Blaine
    Bhimji, Sal
    Bostak, Chris
    Bradley, Dave
    Cherkauer, Brian
    Desai, Jayen
    Francom, Erin
    Gowan, Mike
    Gronowski, Paul
    Krueger, Dan
    Morganti, Charles
    Troyer, Steve
    IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2009, 44 (01) : 18 - 31
  • [25] Processor Controlled Test Development : A Case Study with an Intel i7 Processor Board
    Er, Nur Baki
    Aydogan, Ali
    Kesim, Hayrettin
    2015 IEEE AUTOTESTCON, 2015, : 47 - 51
  • [26] On the power of bitslice implementation on intel core2 processor
    Matsui, Mitsuru
    Nakajima, Junko
    CRYPTOGRAPHIC HARDWARE AND EMBEDDED SYSTEMS - CHES 2007, PROCEEDINGS, 2007, 4727 : 121 - +
  • [27] Exploring Better Speculation and Data Locality in Sparse Matrix-Vector Multiplication on Intel Xeon
    Zhao, Haoran
    Xia, Tian
    Li, Chenyang
    Zhao, Wenzhe
    Zheng, Nanning
    Ren, Pengju
    2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 601 - 609
  • [28] INTERTEMPORAL COMPETITIVE EQUILIBRIUM - EMPIRICAL-STUDY OF SPECULATION
    MILLER, RM
    PLOTT, CR
    SMITH, VL
    QUARTERLY JOURNAL OF ECONOMICS, 1977, 91 (04): : 599 - 624
  • [29] A Comparative Study of Application Performance and Scalability on the Intel Knights Landing Processor
    Rosales, Carlos
    Cazes, John
    Milfeld, Kent
    Gomez-Iglesias, Antonio
    Koesterke, Lars
    Huang, Lei
    Vienne, Jerome
    HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2016 INTERNATIONAL WORKSHOPS, 2016, 9945 : 307 - 318
  • [30] Itanium 2 processor 6M: Higher frequency and larger L3 cache
    Rusu, S
    Muljono, H
    Cherkauer, B
    IEEE MICRO, 2004, 24 (02) : 10 - 18