THE ENGINEERING OF FAULT-TOLERANT DISTRIBUTED COMPUTING SYSTEMS

被引:0
|
作者
BABAOGLU, O [1 ]
机构
[1] CORNELL UNIV,DEPT COMP SCI,ITHACA,NY 14853
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We view the design of fault-tolerant computing systems as an engineering endeavor. As such, this activity requires understanding the theoretical limitations and the scope of the feasible designs. We survey the impact that various environment characteristics and design choices have on the resultant system properties. We propose a single metric-the system reliability-as an appropriate measure for exploring tradeoffs among a potentially-large design space.
引用
收藏
页码:262 / 273
页数:12
相关论文
共 50 条
  • [21] Communication pattern based checkpointing coordination for fault-tolerant distributed computing systems
    Park, T
    Yeom, HY
    TWELFTH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN-12), PROCEEDINGS, 1998, : 559 - 562
  • [22] A hybrid and adaptive model for fault-tolerant distributed computing
    Gorender, S
    Macêdo, R
    Raynal, M
    2005 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2005, : 412 - 421
  • [23] Active fault-tolerant system for open distributed computing
    Lanka, Rodrigo
    Oda, Kentaro
    Yoshida, Takaichi
    AUTONOMIC AND TRUSTED COMPUTING, PROCEEDINGS, 2006, 4158 : 581 - 590
  • [24] Fundamentals of fault-tolerant distributed computing in asynchronous environments
    Gärtner, FC
    ACM COMPUTING SURVEYS, 1999, 31 (01) : 1 - 26
  • [25] Fault-tolerant distributed mass storage for LHC computing
    Wiebalck, A
    Breuer, PT
    Lindenstruth, V
    Steinbeck, TM
    CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, : 266 - 273
  • [26] Spatial Data Locality in Scalable and Fault-tolerant Distributed Spatial Computing Systems
    Werner, Martin
    BIGSPATIAL 2018: PROCEEDINGS OF THE 7TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ANALYTICS FOR BIG GEOSPATIAL DATA (BIGSPATIAL-2018), 2018, : 47 - 56
  • [27] An adaptive programming model for fault-tolerant distributed computing
    Gorender, Sergio
    Macedo, Raimundo Jose de Araujo
    Raynal, Michel
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2007, 4 (01) : 18 - 31
  • [28] A dynamic fault-tolerant model for open distributed computing
    Lanka, Rodrigo
    Oda, Kentaro
    Najima, Horoki
    Yoshida, Takaichi
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 25 - +
  • [29] GRAPH MODEL FOR FAULT-TOLERANT COMPUTING SYSTEMS
    HAYES, JP
    IEEE TRANSACTIONS ON COMPUTERS, 1976, 25 (09) : 875 - 884
  • [30] Design of Fault-Tolerant Neuromorphic Computing Systems
    Liu, Mengyun
    Xia, Lixue
    Wang, Yu
    Chakrabarty, Krishnendu
    2018 23RD IEEE EUROPEAN TEST SYMPOSIUM (ETS), 2018,