Design and implementation of a consistent time service for fault-tolerant distributed systems

被引:2
|
作者
Zhao, W [1 ]
Moser, LE [1 ]
Melliar-Smith, PM [1 ]
机构
[1] Eternal Syst Inc, Santa Barbara, CA 93111 USA
关键词
D O I
10.1109/DSN.2003.1209945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clock-related operations are one of the many sources of replica non-determinism and of replica inconsistency in fault-tolerant distributed systems. In passive replication, if the primary server crashes, the next clock value returned by the new primary server might have actually rolled back in time, which can lead to undesirable consequences for the replicated application. The same problem can happen for active replication where the result of the first replica to respond is taken as the next clock value. In this paper, we describe the design and implementation of a consistent time service for fault-tolerant distributed systems. The consistent time service introduces a group clock that is consistent across the replicas and that ensures the determinism of the replicas with respect to clock-related operations. The group clock is monotonically increasing, is transparent to the application and is fault-tolerant. The consistent time service guarantees the consistency of the group clock even when faults occur, when new replicas are added into the group and when failed replicas recover.
引用
收藏
页码:341 / 350
页数:10
相关论文
共 50 条
  • [1] Design and implementation of a consistent time service for fault-tolerant distributed systems
    Zhao, W
    Moser, LE
    Melliar-Smith, PM
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2004, 19 (05): : 315 - 323
  • [2] ON THE IMPLEMENTATION AND USE OF ADA ON FAULT-TOLERANT DISTRIBUTED SYSTEMS
    KNIGHT, JC
    URQUHART, JIA
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (05) : 553 - 563
  • [3] Extension to Matlab/Simulink for design and implementation of distributed fault-tolerant control systems
    Domen, V
    Rok, O
    2004 1st International Conference on Electrical and Electronics Engineering (ICEEE), 2004, : 339 - 343
  • [4] DESIGN OF FAULT-TOLERANT DISTRIBUTED CONTROL-SYSTEMS
    PIURI, V
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 1994, 43 (02) : 257 - 264
  • [5] ON THE IMPLEMENTATION AND USE OF ADA ON FAULT-TOLERANT DISTRIBUTED SYSTEMS.
    Knight, John C.
    Urquhart, John I.A.
    IEEE Transactions on Software Engineering, 1987, SE-13 (05) : 553 - 563
  • [6] Design and implementation of a CORBA fault-tolerant object group service
    Morgan, G
    Shrivastava, S
    Ezhilchelvan, P
    Little, M
    DISTRIBUTED APPLICATIONS AND INTEROPERABLE SYSTEMS II, 1999, 15 : 361 - 374
  • [7] Design and implementation of fault-tolerant transactional agents for manipulating distributed objects
    Tanaka, Y
    Enokido, T
    Takizawa, M
    WORDS 2005: 10TH IEEE INTERNATIONAL WORKSHOP ON OBJECT-ORIENTED REAL-TIME DEPENDABLE, PROCEEDINGS, 2005, : 329 - 336
  • [8] UNDERSTANDING FAULT-TOLERANT DISTRIBUTED SYSTEMS
    CRISTIAN, F
    COMMUNICATIONS OF THE ACM, 1991, 34 (02) : 56 - 78
  • [9] Design of fault-tolerant scheduling algorithm for real-time tasks in distributed systems
    Han, Zongfen
    Qin, Xiao
    Pang, Liping
    Li, Shengli
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 27 (06): : 12 - 14
  • [10] Fault-tolerant Distributed Systems in Hardware
    Schmid, Stefan
    BULLETIN OF THE EUROPEAN ASSOCIATION FOR THEORETICAL COMPUTER SCIENCE, 2015, 2015 (116): : 111 - 153