An effective fault-tolerant routing methodology for direct networks

被引:0
|
作者
Gómez, ME [1 ]
Flich, J [1 ]
López, P [1 ]
Robles, A [1 ]
Duato, J [1 ]
Nordbotten, NA [1 ]
Lysne, O [1 ]
Skeie, T [1 ]
机构
[1] Univ Politecn Valencia, Dept Comp Engn, E-46071 Valencia, Spain
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Current massively parallel computing systems are being built with thousands of nodes, which significantly affects the probability of failure. In [14], we proposed a methodology to design fault-tolerant routing algorithms for direct interconnection networks. The methodology uses a simple mechanism: for some source-destination pairs, packets are first forwarded to an intermediate node, and later, from this node to the destination node. Minimal adaptive routing is used along both subpaths. For those cases where the methodology cannot find a suitable intermediate node, it combines the use of intermediate nodes with two additional mechanisms: disabling adaptive routing and using misrouting on a per-packet basis. While the combination of these three mechanisms tolerates a large number of faults, each one requires adding some hardware support in the network and also introduces some overhead. In this paper, we will perform an in-depth detailed analysis of the impact of these mechanisms on network behaviour. We will analyze the impact of the three mechanisms separately and combined. The ultimate goal of this paper is to obtain a suitable combination of mechanisms that is able to meet the trade-off between fault-tolerance degree, routing complexity, and performance.
引用
收藏
页码:222 / 231
页数:10
相关论文
共 50 条
  • [21] A Fault-Tolerant Routing Scheme in Dynamic Networks
    冯秀山
    韩承德
    Journal of Computer Science and Technology, 2001, (04) : 371 - 380
  • [22] Fault-tolerant routing methodology for hypercube and cube-connected cycles interconnection networks
    Habibian, Hossein
    Patooghy, Ahmad
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (10): : 4560 - 4579
  • [23] Fault-tolerant routing methodology for hypercube and cube-connected cycles interconnection networks
    Hossein Habibian
    Ahmad Patooghy
    The Journal of Supercomputing, 2017, 73 : 4560 - 4579
  • [24] Fault-tolerant routing in circulant networks and cycle prefix networks
    Sheng-Chyang Liaw
    Gerald J. Chang
    Feng Cao
    D. Frank Hsu
    Annals of Combinatorics, 1998, 2 (2) : 165 - 172
  • [25] FAULT-TOLERANT ROUTING IN THE STAR AND PANCAKE INTERCONNECTION NETWORKS
    GARGANO, L
    VACCARO, U
    VOZELLA, A
    INFORMATION PROCESSING LETTERS, 1993, 45 (06) : 315 - 320
  • [26] A fault-tolerant routing protocol in wireless sensor networks
    Chao, Hsi-Lu
    Chang, Chen-Lung
    International Journal of Sensor Networks, 2008, 3 (01) : 66 - 73
  • [27] FAULT-TOLERANT WORMHOLE ROUTING ALGORITHMS FOR MESH NETWORKS
    BOPPANA, RV
    CHALASANI, S
    IEEE TRANSACTIONS ON COMPUTERS, 1995, 44 (07) : 848 - 864
  • [28] Fault-tolerant wormhole routing algorithm for mesh networks
    Sui, PH
    Wang, SD
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2000, 147 (01): : 9 - 14
  • [29] Fault-Tolerant Routing With Load Balancing in LeTQ Networks
    Fan, Weibei
    Xiao, Fu
    Fan, Jianxi
    Han, Zhijie
    Sun, Lijuan
    Wang, Ruchuan
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (01) : 68 - 82
  • [30] Adaptive fault-tolerant wormhole routing for torus networks
    Shih, JD
    1998 INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 558 - 565