SMT Malleability in IBM POWER5 and POWER6 Processors

被引:6
|
作者
Morari, Alessandro [1 ]
Boneti, Carlos [2 ]
Cazorla, Francisco J. [3 ]
Gioiosa, Roberto [3 ]
Cher, Chen-Yong [4 ]
Buyuktosunoglu, Alper [4 ]
Bose, Pradip [4 ]
Valero, Mateo [5 ]
机构
[1] Pacific NW Natl Lab, Richland, WA 99352 USA
[2] Schlumberger Brazil Res & Geoengn Ctr BRGC, Houston, TX 77056 USA
[3] Barcelona Supercomp Ctr, Barcelona 08034, Spain
[4] Thomas J Watson Res Ctr, New York, NY USA
[5] Tech Univ Catalonia, Barcelona 08034, Spain
关键词
Malleability; simultaneous multithreading; hardware-thread priorities; IBM POWER5; IBM POWER6; PERFORMANCE; CHIP; QOS;
D O I
10.1109/TC.2012.34
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While several hardware mechanisms have been proposed to control the interaction between hardware threads in an SMT processor, few have addressed the issue of software-controllable SMT performance. The IBM POWER5 and POWER6 are the first high-performance processors implementing a software-controllable hardware-thread prioritization mechanism that controls the rate at which each hardware-thread decodes instructions. This paper shows the potential of this basic mechanism to improve several target metrics for various applications on POWER5 and POWER6 processors. Our results show that although the software interface is exactly the same, the software-controlled priority mechanism has a different effect on POWER5 and POWER6. For instance, hardware threads in POWER6 are less sensitive to priorities than in POWER5 due to the in order design. We study the SMT thread malleability to enable user-level optimizations that leverage software-controlled thread priorities. We also show how to achieve various system objectives such as parallel application load balancing, in order to reduce execution time. Finally, we characterize user-level transparent execution on POWER5 and POWER6, and identify the workload mix that best benefits from it.
引用
收藏
页码:813 / 826
页数:14
相关论文
共 50 条
  • [21] POWER5 system microarchitecture
    Sinharoy, Balaram
    Kalla, Ronald N.
    Tendler, Joel M.
    Eickemeyer, Richard J.
    Joyner, Jody B.
    IBM Journal of Research and Development, 1600, 49 (4-5): : 505 - 521
  • [23] Power and Thermal Characterization of POWER6 System
    Jimenez, Victor
    Cazorla, Francisco J.
    Gioiosa, Roberto
    Valero, Mateo
    Boneti, Carlos
    Kursun, Eren
    Cher, Chen-Yong
    Isci, Canturk
    Buyuktosunoglu, Alper
    Bose, Pradip
    PACT 2010: PROCEEDINGS OF THE NINETEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2010, : 7 - 18
  • [24] POWER6 引燃改变
    刘子倩
    尚舞, 2021, (01) : 19 - 21+18
  • [26] POWER5 and packaging - Preface
    Tendler, JM
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2005, 49 (4-5) : 503 - 504
  • [27] POWER5 system microarchitecture
    Sinharoy, B
    Kalla, RN
    Tendler, JM
    Eickemeyer, RJ
    Joyner, JB
    IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2005, 49 (4-5) : 505 - 521
  • [29] Power6 decimal divide
    Schwarz, Eric M.
    Carlough, Steven R.
    2007 IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, 2007, : 128 - 133
  • [30] Power6威力再现
    李楠
    互联网周刊, 2007, (11) : 68 - 69