TOLERATING FAULTS IN HYPERCUBES USING SUBCUBE PARTITIONING

被引:27
|
作者
BRUCK, J
CYPHER, R
SOROKER, D
机构
[1] IBM Almaden Research Center, San Jose
[2] Shell Development Company
关键词
FAULT-TOLERANCE; HYPERCUBES; PARALLEL COMPUTING; RECONFIGURATION; SUBCUBES;
D O I
10.1109/12.142686
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We examine the issue of running algorithms on a hypercube which has both node and edge faults, and we assume a worst case distribution of the faults. We prove that for any constant c, an n-dimensional hypercube (n-cube) with n(c) faulty components contains a fault-free subgraph that can implement a large class of hypercube algorithms with only a constant factor slowdown. In addition, our approach yields practical implementations for small numbers of faults. For example, we show that any regular algorithm can be implemented on an n-cube that has at most n - 1 faults with slowdowns of at most 2 for computation and at most 4 for communication. To the best of our knowledge this is the first result showing that an n-cube can tolerate more than O(n) arbitrarily placed faults with a constant factor slowdown.
引用
收藏
页码:599 / 605
页数:7
相关论文
共 50 条
  • [41] Long paths in hypercubes with a quadratic number of faults
    Dvorak, Tomas
    Koubek, Vaclav
    INFORMATION SCIENCES, 2009, 179 (21) : 3763 - 3771
  • [42] EDGE-BIPANCYCLICITY OF HYPERCUBES WITH CONDITIONAL FAULTS
    Sun, Chao-Ming
    JOURNAL OF INTERCONNECTION NETWORKS, 2011, 12 (04) : 337 - 343
  • [43] A dynamic replica selection algorithm for tolerating timing faults
    Krishnamurthy, S
    Sanders, WH
    Cukier, M
    INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2001, : 107 - 116
  • [44] Wafer-scale diagnosis tolerating comparator faults
    Istituto di Elaborazione, Informazione Del CNR, VIA Santa Maria, 46, 1-56126 Pisa, Italy
    IEE Proc Comput Digital Tech, 4 (211-215):
  • [45] Tolerating transient faults through an instruction reissue mechanism
    Sato, T
    Arita, I
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, 2001, : 240 - 247
  • [46] An adaptive algorithm for tolerating value faults and crash failures
    Ren, YS
    Cukier, M
    Sanders, WH
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, 12 (02) : 173 - 192
  • [47] Tolerating Transient Communication Faults with Online Traffic Scheduling
    Marques, Luis
    Vasconcelos, Veronica
    Pedreiras, Paulo
    Almeida, Luis
    2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2012, : 396 - 402
  • [48] Mutual Visibility for Robots with Lights Tolerating Light Faults
    Sharma, Gokarna
    2018 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2018), 2018, : 829 - 836
  • [49] Design methodologies for tolerating cell and interconnect faults in FPGAs
    Hanchek, F
    Dutt, S
    INTERNATIONAL CONFERENCE ON COMPUTER DESIGN - VLSI IN COMPUTERS AND PROCESSORS, PROCEEDINGS, 1996, : 326 - 331
  • [50] Gathering of Mobile Robots Tolerating Multiple Crash Faults
    Bouzid, Zohir
    Das, Shantanu
    Tixeuil, Sebastien
    2013 IEEE 33RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2013, : 337 - 346