A low-cost fault-tolerant structure for the hypercube

被引:5
|
作者
Wang, DJ [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[2] Montclair State Univ, Dept Comp Sci, Montclair, NJ 07043 USA
来源
JOURNAL OF SUPERCOMPUTING | 2001年 / 20卷 / 03期
关键词
diagnosability; fault tolerance; hypercubes; interconnection networks; redundant systems;
D O I
10.1023/A:1011636631661
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new, low-cost fault-tolerant structure for the hypercube that employs spare processors and extra links. The target of the proposed structure is to fully tolerate the first faulty node, no matter where it occurs, and "almost fully" tolerate the second, meaning that the underlying hypercube topology can be resumed if the second faulty node occurs at most locations-expectantly 92% of locations. The unique features of our structure are that (1) it utilizes the unused extra link-ports in the processor nodes of the hypercube to obtain the proposed topology, so that minimum extra hardware is needed in constructing the fault-tolerant structure and (2) the structure's node-degrees are low as desired-the primary and spare nodes all have node-degrees of n + 2 for an n-dimensional hypercube. The number of spare nodes is one fourth of primary nodes. The reconfiguration algorithm in the presence of faults is elegant and efficient. The proposed structure also effectively enhances the diagnosability of the hypercube system. It is shown that the diagnosability of the structure is increased to n + 2, whereas an ordinary n-dimensional hypercube has diagnosability n.
引用
收藏
页码:203 / 216
页数:14
相关论文
共 50 条
  • [21] Fault-Tolerant LU Factorization Is Low Cost
    Coti, Camille
    Petrucci, Laure
    Gonzalez, Daniel Alberto Torres
    EURO-PAR 2021: PARALLEL PROCESSING, 2021, 12820 : 536 - 549
  • [22] A FAULT-TOLERANT COMMUNICATION SCHEME FOR HYPERCUBE COMPUTERS
    LEE, TC
    HAYES, JP
    IEEE TRANSACTIONS ON COMPUTERS, 1992, 41 (10) : 1242 - 1256
  • [23] A RECONFIGURABLE MODULAR FAULT-TOLERANT HYPERCUBE ARCHITECTURE
    YANG, CS
    ZU, LP
    WU, YN
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (10) : 1018 - 1032
  • [24] ADAPTIVE FAULT-TOLERANT ROUTING IN HYPERCUBE MULTICOMPUTERS
    CHEN, MS
    SHIN, KG
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (12) : 1406 - 1416
  • [25] Fault-tolerant fixed routing in hypercube generalizations
    Lankinen, A
    Nieminen, J
    Peltola, M
    Ruotsalainen, P
    INDIAN JOURNAL OF PURE & APPLIED MATHEMATICS, 2002, 33 (07): : 1053 - 1076
  • [26] ADAPTIVE FAULT-TOLERANT MULTICAST IN HYPERCUBE MULTICOMPUTERS
    LAN, YR
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 23 (01) : 80 - 93
  • [27] PERFORMANCE OF FAULT-TOLERANT DIAGNOSTICS IN THE HYPERCUBE SYSTEMS
    GHAFOOR, A
    SOLE, P
    IEEE TRANSACTIONS ON COMPUTERS, 1989, 38 (08) : 1164 - 1172
  • [28] A fault-tolerant routing strategy in hypercube multicomputers
    Chiu, GM
    Wu, SP
    IEEE TRANSACTIONS ON COMPUTERS, 1996, 45 (02) : 143 - 155
  • [29] FAULT-TOLERANT MATRIX OPERATIONS ON HYPERCUBE MULTIPROCESSORS
    ELSTER, AC
    UYAR, MU
    REEVES, AP
    PROCEEDINGS OF THE 1989 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, VOL 3: ALGORITHMS AND APPLICATIONS, 1989, : 169 - 176
  • [30] Fault-tolerant routing algorithms for hypercube networks
    Kaneko, K
    Ito, H
    IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, : 218 - 224