Modeling and Analysis of Fault-tolerant Distributed Memories for Networks-on-Chip

被引:0
|
作者
BanaiyanMofrad, Abbas [1 ]
Dutt, Nikil [1 ]
Girao, Gustavo [2 ]
机构
[1] Univ Calif Irvine, Ctr Embedded Comp Syst, Irvine, CA 92697 USA
[2] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in technology scaling increasingly make Network-on-Chips (NoCs) more susceptible to failures that cause various reliability challenges. With increasing area occupied by different on-chip memories, strategies for maintaining fault-tolerance of distributed on-chip memories become a major design challenge. We propose a system-level design methodology for scalable fault-tolerance of distributed on-chip memories in NoCs. We introduce a novel reliability clustering model for fault-tolerance analysis and shared redundancy management of onchip memory blocks. We perform extensive design space exploration applying the proposed reliability clustering on a block-redundancy fault-tolerant scheme to evaluate the tradeoffs between reliability, performance, and overheads. Evaluations on a 64-core chip multiprocessor (CMP) with an 8x8 mesh NoC show that distinct strategies of our case study may yield up to 20% improvements in performance gains and 25% improvement in energy savings across different benchmarks, and uncover interesting design configurations.
引用
收藏
页码:1605 / 1608
页数:4
相关论文
共 50 条
  • [31] A Runtime Fault-Tolerant Routing Scheme for Partially Connected 3D Networks-on-Chip
    Coelho, Alexandre
    Charif, Amir
    Zergainoh, Nacer-Eddine
    Velazco, Raoul
    2018 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), 2018,
  • [32] Fault-Tolerant Networks-on-Chip Routing With Coarse and Fine-Grained Look-Ahead
    Liu, Junxiu
    Harkin, Jim
    Li, Yuhua
    Maguire, Liam P.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2016, 35 (02) : 260 - 273
  • [33] Fault-tolerant Communication in Invasive Networks on Chip
    Heisswolf, Jan
    Weichslgartner, Andreas
    Zaib, Aurang
    Friederich, Stephanie
    Masing, Leonard
    Stein, Carsten
    Duden, Marco
    Kloepfer, Roman
    Teich, Juergen
    Wild, Thomas
    Herkersdorf, Andreas
    Becker, Juergen
    2015 NASA/ESA CONFERENCE ON ADAPTIVE HARDWARE AND SYSTEMS (AHS), 2015,
  • [34] Reconfigurable fault tolerant routing for networks-on-chip with logical hierarchy
    Schley, Gert
    Ahmed, Ibrahim
    Afzal, Muhammad
    Radetzki, Martin
    COMPUTERS & ELECTRICAL ENGINEERING, 2016, 51 : 195 - 206
  • [35] Mapping a Fault-Tolerant Distributed Algorithm to Systems on Chip
    Fuchs, Gottfried
    Fuegger, Matthias
    Schmid, Ulrich
    Steininger, Andreas
    11TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN - ARCHITECTURES, METHODS AND TOOLS : DSD 2008, PROCEEDINGS, 2008, : 242 - 249
  • [36] A New Scalable Fault Tolerant Routing Algorithm for Networks-on-Chip
    Kia, Hamed Sajjadi
    Ababei, Cristinel
    Srinivasan, Sudarshan
    Jabeen, Shaista
    2015 IEEE 58TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2015,
  • [37] Minimal-Path Fault-Tolerant Approach Using Connection-Retaining Structure in Networks-on-Chip
    Ebrahimi, Masoumeh
    Daneshtalab, Masoud
    Plosila, Juha
    Tenhunen, Hannu
    2013 SEVENTH IEEE/ACM INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP (NOCS 2013), 2013,
  • [38] DISTRIBUTED RECOVERY IN FAULT-TOLERANT MULTIPROCESSOR NETWORKS
    YANNEY, RM
    HAYES, JP
    IEEE TRANSACTIONS ON COMPUTERS, 1986, 35 (10) : 871 - 879
  • [39] ON RELIABILITY MODELING OF FAULT-TOLERANT DISTRIBUTED SYSTEMS
    THAMBIDURAI, P
    PARK, YK
    TRIVEDI, KS
    9TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 1989, : 136 - 142
  • [40] ON THE FAULT-TOLERANT ROUTING IN DISTRIBUTED LOOP NETWORKS
    Liu Huanping Yang Yixian (Po Box 126
    Journal of Electronics(China), 2000, (01) : 84 - 89