BOUNDS ON ALGORITHM-BASED FAULT TOLERANCE IN MULTIPLE PROCESSOR SYSTEMS.

被引:38
|
作者
Banerjee, Prithviraj [1 ]
Abraham, Jacob A. [1 ]
机构
[1] Univ of Illinois, Urbana, IL, USA, Univ of Illinois, Urbana, IL, USA
关键词
COMPUTER PROGRAMMING - Algorithms - MATHEMATICAL PROGRAMMING; LINEAR - MATHEMATICAL TECHNIQUES - Graph Theory;
D O I
10.1109/TC.1986.1676762
中图分类号
学科分类号
摘要
The authors present a graph-theoretic model for determining upper and lower bounds on the number of checks needed for achieving concurrent fault detection and location. The objective is to estimate the overhead in time and the number of processors required for such a scheme. Faults in processors, errors in the data, and checks on the data to detect and locate errors are represented as a tripartite graph. Bounds on the time and processor overhead are obtained by considering a series of subproblems. First, using some crude concepts for t-fault detection and t-fault location, bounds on the maximum size of the error patterns that can arise from such fault patterns are obtained. Using these results, bounds are derived on the number of checks required for error detection and location. Some numerical results are derived from a linear programming formulation.
引用
收藏
页码:296 / 306
相关论文
共 50 条
  • [1] BOUNDS ON ALGORITHM-BASED FAULT TOLERANCE IN MULTIPLE PROCESSOR SYSTEMS
    BANERJEE, P
    ABRAHAM, JA
    IEEE TRANSACTIONS ON COMPUTERS, 1986, 35 (04) : 296 - 306
  • [2] IMPROVED BOUNDS FOR ALGORITHM-BASED FAULT-TOLERANCE
    ROSENKRANTZ, DJ
    RAVI, SS
    IEEE TRANSACTIONS ON COMPUTERS, 1993, 42 (05) : 630 - 635
  • [3] FAULT-TOLERANCE CONSIDERATIONS IN LARGE, MULTIPLE-PROCESSOR SYSTEMS.
    Kuhl, Jon G.
    Reddy, Sudhakar M.
    Computer, 1986, 19 (04) : 56 - 67
  • [4] Algorithm-based fault tolerance: a review
    Vijay, M
    Mittal, R
    MICROPROCESSORS AND MICROSYSTEMS, 1997, 21 (03) : 151 - 161
  • [5] ALGORITHM-BASED FAULT TOLERANCE FOR MATRIX OPERATIONS
    HUANG, KH
    ABRAHAM, JA
    IEEE TRANSACTIONS ON COMPUTERS, 1984, 33 (06) : 518 - 528
  • [6] AN ANALYSIS OF ALGORITHM-BASED FAULT TOLERANCE TECHNIQUES
    LUK, FT
    PARK, H
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1988, 5 (02) : 172 - 184
  • [7] ALGORITHM-BASED FAULT TOLERANCE ON A HYPERCUBE MULTIPROCESSOR
    BANERJEE, P
    RAHMEH, JT
    STUNKEL, C
    NAIR, VS
    ROY, K
    BALASUBRAMANIAN, V
    ABRAHAM, JA
    IEEE TRANSACTIONS ON COMPUTERS, 1990, 39 (09) : 1132 - 1145
  • [8] Algorithm-based fault tolerance for dense matrix factorizations, multiple failures and accuracy
    Bouteiller, Aurelien
    Herault, Thomas
    Bosilca, George
    Du, Peng
    Dongarra, Jack
    ACM Transactions on Parallel Computing, 2015, 1 (02)
  • [9] Combinatorial analysis of check set construction for algorithm-based fault tolerance systems
    Wang, DQ
    Zhao, LC
    JOURNAL OF ELECTRONIC TESTING-THEORY AND APPLICATIONS, 1998, 12 (03): : 255 - 260
  • [10] Online Algorithm-Based Fault Tolerance for Cholesky Decomposition on Heterogeneous Systems with GPUs
    Chen, Jieyang
    Liang, Xin
    Chen, Zizhong
    2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 993 - 1002