The case for lifetime reliability-aware microprocessors

被引:0
|
作者
Srinivasan, J [1 ]
Adve, SV [1 ]
Bose, P [1 ]
Rivers, JA [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Champaign, IL 60680 USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Ensuring long processor lifetimes by limiting failures due to wear-out related hard errors is a critical requirement for all microprocessor manufacturers. We observe that continuous device scaling and increasing temperatures are making lifetime reliability targets even harder to meet. However current methodologies for qualifying lifetime reliability are overly conservative since they assume worst-case operating conditions. This paper makes the case that the continued use of such methodologies will significantly and unnecessarily constrain performance. Instead, lifetime reliability awareness at the microarchitectural design stage can mitigate this problem, by designing processors that dynamically adapt in response to the observed usage to meet a reliability target. We make two specific contributions. First, we describe an architecture-level model and its implementation, called RAMP that can dynamically track lifetime reliability, responding to changes in application behavior RAMP is based on state-of-the-art device models for different wearout mechanisms. Second, we propose dynamic reliability management (DRM) - a technique where the processor can respond to changing application behavior to maintain its lifetime reliability target. In contrast to current worst-case behavior based reliability qualification methodologies, DRM allows processors to be qualified for reliability at lower (but more likely) operating points than the worst case. Using RAMP, we show that this can save cost and/or improve performance, that dynamic voltage scaling is an effective response technique for DRM, and that dynamic thermal management neither subsumes nor is subsumed by DRM.
引用
收藏
页码:276 / 287
页数:12
相关论文
共 50 条
  • [1] A Case for Lifetime Reliability-Aware Neuromorphic Computing
    Song, Shihao
    Das, Anup
    2020 IEEE 63RD INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2020, : 596 - 598
  • [2] Lifetime Reliability-Aware Digital Synthesis
    Duan, Shengyu
    Zwolinski, Mark
    Halak, Basel
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2018, 26 (11) : 2205 - 2216
  • [3] Interconnect lifetime prediction for reliability-aware systems
    Lu, Zhijian
    Huang, Wei
    Stan, Mircea R.
    Skadron, Kevin
    Lach, John
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2007, 15 (02) : 159 - 172
  • [4] Lifetime Reliability-Aware Checkpointing Mechanism: Modelling and Analysis
    bin Bandan, Mohamad Imran
    Bhattacharjee, Subhasis
    Shafik, Rishad A.
    Pradhan, Dhiraj K.
    Mathew, Jimson
    2013 INTERNATIONAL SYMPOSIUM ON ELECTRONIC SYSTEM DESIGN (ISED), 2013, : 128 - 132
  • [5] Lifetime Reliability-Aware Task Allocation and Scheduling for MPSoC Platforms
    Huang, Lin
    Yuan, Feng
    Xu, Qiang
    DATE: 2009 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2009, : 51 - 56
  • [6] Reliability-Aware Runahead
    Naithani, Ajeya
    Eeckhout, Lieven
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 786 - 799
  • [7] Interconnect lifetime prediction under dynamic stress for reliability-aware design
    Lu, ZJ
    Huang, W
    Lach, J
    Stan, M
    Skadron, K
    ICCAD-2004: INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, IEEE/ACM DIGEST OF TECHNICAL PAPERS, 2004, : 327 - 334
  • [8] Energy Efficient Lifetime Reliability-Aware Checkpointing for Real-Time System
    Bin Bandant, Mohamad Imran
    Bhattacharjeel, Subhasis
    Pradhanl, Dhiraj K.
    Matthews, Jimson
    JOURNAL OF LOW POWER ELECTRONICS, 2014, 10 (03) : 401 - 416
  • [9] Reliability-aware system synthesis
    Glass, Michael
    Lukasiewycz, Martin
    Streichert, Thilo
    Haubelt, Christian
    Teich, Juergen
    2007 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, VOLS 1-3, 2007, : 409 - 414
  • [10] On the reliability-aware geographic routing
    Taha, ZQ
    Liu, M
    2005 Wireless Telecommunications Symposium, 2005, : 74 - 78