Design and implementation of a consistent time service for fault-tolerant distributed systems

被引:2
|
作者
Zhao, W [1 ]
Moser, LE [1 ]
Melliar-Smith, PM [1 ]
机构
[1] Eternal Syst Inc, Santa Barbara, CA 93111 USA
关键词
D O I
10.1109/DSN.2003.1209945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clock-related operations are one of the many sources of replica non-determinism and of replica inconsistency in fault-tolerant distributed systems. In passive replication, if the primary server crashes, the next clock value returned by the new primary server might have actually rolled back in time, which can lead to undesirable consequences for the replicated application. The same problem can happen for active replication where the result of the first replica to respond is taken as the next clock value. In this paper, we describe the design and implementation of a consistent time service for fault-tolerant distributed systems. The consistent time service introduces a group clock that is consistent across the replicas and that ensures the determinism of the replicas with respect to clock-related operations. The group clock is monotonically increasing, is transparent to the application and is fault-tolerant. The consistent time service guarantees the consistency of the group clock even when faults occur, when new replicas are added into the group and when failed replicas recover.
引用
收藏
页码:341 / 350
页数:10
相关论文
共 50 条
  • [21] Design optimization of time- and cost-constrained fault-tolerant distributed embedded systems
    Izosimov, V
    Pop, P
    Eles, P
    Peng, Z
    DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 864 - 869
  • [22] DESIGN AND IMPLEMENTATION OF MOONY - A FAULT-TOLERANT DISTRIBUTED SHARED-MEMORY SYSTEM
    LIN, YW
    YUAN, SM
    LIANG, DR
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1995, 10 (02): : 111 - 119
  • [23] A Distributed Fault-Tolerant Design for Multiple-Server VOD Systems
    Ing-Jye Shyu
    Shiuh-Pyng Shieh
    Multimedia Tools and Applications, 1999, 8 : 219 - 247
  • [24] The Role of A Priori Belief in the Design and Analysis of Fault-Tolerant Distributed Systems
    Giorgio Cignarale
    Ulrich Schmid
    Tuomas Tahko
    Roman Kuznets
    Minds and Machines, 2023, 33 : 293 - 319
  • [25] A distributed fault-tolerant design for multiple-server VOD systems
    Shyu, IJ
    Shieh, SP
    MULTIMEDIA TOOLS AND APPLICATIONS, 1999, 8 (02) : 219 - 247
  • [26] FAULT-TOLERANT DISTRIBUTED SYSTEM-DESIGN
    SENGUPTA, A
    SEN, A
    BANDYOPADHYAY, S
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1988, 35 (02): : 168 - 172
  • [27] Fault-tolerant real-time communication in distributed computing systems
    Zheng, Q
    Shin, KG
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1998, 9 (05) : 470 - 480
  • [28] The Role of A Priori Belief in the Design and Analysis of Fault-Tolerant Distributed Systems
    Cignarale, Giorgio
    Schmid, Ulrich
    Tahko, Tuomas
    Kuznets, Roman
    MINDS AND MACHINES, 2023, 33 (02) : 293 - 319
  • [29] Distributed fault-tolerant avionic systems - A real-time perspective
    Audsley, NC
    Burke, M
    1998 IEEE AEROSPACE CONFERENCE PROCEEDINGS, VOL 4, 1998, : 43 - 60
  • [30] Real-time fault-tolerant scheduling in heterogeneous distributed systems
    Qin, X
    Han, ZF
    Pang, LP
    Li, SL
    Jin, H
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 421 - 427