An effective fault-tolerant routing methodology for direct networks

被引:0
|
作者
Gómez, ME [1 ]
Flich, J [1 ]
López, P [1 ]
Robles, A [1 ]
Duato, J [1 ]
Nordbotten, NA [1 ]
Lysne, O [1 ]
Skeie, T [1 ]
机构
[1] Univ Politecn Valencia, Dept Comp Engn, E-46071 Valencia, Spain
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Current massively parallel computing systems are being built with thousands of nodes, which significantly affects the probability of failure. In [14], we proposed a methodology to design fault-tolerant routing algorithms for direct interconnection networks. The methodology uses a simple mechanism: for some source-destination pairs, packets are first forwarded to an intermediate node, and later, from this node to the destination node. Minimal adaptive routing is used along both subpaths. For those cases where the methodology cannot find a suitable intermediate node, it combines the use of intermediate nodes with two additional mechanisms: disabling adaptive routing and using misrouting on a per-packet basis. While the combination of these three mechanisms tolerates a large number of faults, each one requires adding some hardware support in the network and also introduces some overhead. In this paper, we will perform an in-depth detailed analysis of the impact of these mechanisms on network behaviour. We will analyze the impact of the three mechanisms separately and combined. The ultimate goal of this paper is to obtain a suitable combination of mechanisms that is able to meet the trade-off between fault-tolerance degree, routing complexity, and performance.
引用
收藏
页码:222 / 231
页数:10
相关论文
共 50 条
  • [31] Fault-tolerant Greedy Forest Routing for Complex Networks
    Houthooft, Rein
    Sahhaf, Sahel
    Tavernier, Wouter
    De Turck, Filip
    Colle, Didier
    Pickavet, Mario
    2014 6TH INTERNATIONAL WORKSHOP ON RELIABLE NETWORKS DESIGN AND MODELING (RNDM), 2014, : 1 - 8
  • [32] Fault-tolerant routing algorithms for hypercube interconnection networks
    Kaneko, Keiichi
    Ito, Hideo
    IEICE Transactions on Information and Systems, 2001, E84-D (01) : 121 - 128
  • [33] A New Fault-Tolerant Routing Methodology for KNS Topologies.
    Penaranda, Roberto
    Gran, Ernst Gunnar
    Skeie, Tor
    Engracia Gomez, Maria
    Lopez, Pedro
    2016 2ND IEEE INTERNATIONAL WORKSHOP ON HIGH-PERFORMANCE INTERCONNECTION NETWORKS IN THE EXASCALE AND BIG-DATA ERA (HIPINEB), 2016, : 1 - 8
  • [34] DISTRIBUTED AND FAULT-TOLERANT ROUTING IN LEO SATELLITE NETWORKS
    Lu, Yong
    Zhao, Youjian
    Sun, Fuchun
    Yang, Zhian
    FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING (ICACTE 2012), 2012, : 789 - 798
  • [35] Fault-tolerant routing algorithms for hypercube interconnection networks
    Kaneko, K
    Ito, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (01): : 121 - 128
  • [36] FAULT-TOLERANT ROUTING IN DEBRUIJN COMMUNICATION-NETWORKS
    ESFAHANIAN, AH
    HAKIMI, SL
    IEEE TRANSACTIONS ON COMPUTERS, 1985, 34 (09) : 777 - 788
  • [37] Fault-tolerant routing in dual-cube networks
    Jiang, Z
    Wu, J
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 389 - 392
  • [38] Optimal fault-tolerant routing algorithm and fault-tolerant diameter in directed double-loop networks
    Chen, Yebin
    Li, Ying
    Chen, Tao
    THEORETICAL COMPUTER SCIENCE, 2013, 468 : 50 - 58
  • [39] Shortest path routing and fault-tolerant routing on de Bruijn networks
    Mao, JW
    Yang, CB
    NETWORKS, 2000, 35 (03) : 207 - 215
  • [40] Effective Solution for Scalability and Productivity Improvement in Fault-Tolerant Routing
    Lemeshko, Oleksandr
    Arous, Kinan
    Tariki, Nadia
    2015 SECOND INTERNATIONAL SCIENTIFIC-PRACTICAL CONFERENCE PROBLEMS OF INFOCOMMUNICATIONS SCIENCE AND TECHNOLOGY (PIC S&T 2015), 2015, : 76 - 78