Cost analysis of optimistic recovery model for forked checkpointing

被引:0
|
作者
Hong, J [1 ]
Kim, S [1 ]
Cho, Y [1 ]
机构
[1] Seoul Natl Univ, Sch Comp Sci & Engn, Seoul 151742, South Korea
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2003年 / E86D卷 / 09期
关键词
checkpointing and recovery; forked checkpointing; checkpoint overhead; expected execution time;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Forked checkpointing scheme is proposed to achieve low checkpoint overhead. When a process wants to take a checkpoint in the forked checkpointing scheme, it creates a child process and continues its normal computation. Two recovery models can be used for forked checkpointing when the parent process fails before the child process establishes the checkpoint. One is the pessimistic recovery model where the recovery process rolls back to the previous checkpoint state. The other is the optimistic recovery model where a recovery process waits for the checkpoint to be established by the child process. In this paper, we present the recovery models for forked checkpointing by deriving the expected execution time of a process with and without checkpointing and also show that the expected recovery time of the optimistic recovery model is smaller than that of the pessimistic recovery model.
引用
收藏
页码:1534 / 1541
页数:8
相关论文
共 50 条
  • [31] A generalized forward recovery checkpointing scheme
    Huang, K
    Wu, J
    Fernandez, EB
    PARALLEL AND DISTRIBUTED PROCESSING, 1998, 1388 : 623 - 643
  • [32] RELIABILITY ANALYSIS OF CHECKPOINTING MODEL WITH MULTIPLE VERIFICATION MECHANISM
    Lee, Yutae
    BULLETIN OF THE KOREAN MATHEMATICAL SOCIETY, 2019, 56 (06) : 1435 - 1445
  • [33] OPTIMISTIC RECOVERY IN DISTRIBUTED SYSTEMS
    STROM, RE
    YEMINI, S
    ACM TRANSACTIONS ON COMPUTER SYSTEMS, 1985, 3 (03): : 204 - 226
  • [34] Transparent optimistic rollback recovery
    Johnson, David B.
    Zwaenepoel, Willy
    Operating Systems Review (ACM), 1991, 25 (02): : 99 - 102
  • [35] Design and analysis of a hardware-assisted checkpointing and recovery scheme for distributed applications
    Ramamurthy, B
    Upadhyaya, S
    Bhargava, B
    SEVENTEENTH IEEE SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS, PROCEEDINGS, 1998, : 84 - 90
  • [36] CHECKPOINTING AND ROLLBACK-RECOVERY FOR DISTRIBUTED SYSTEMS
    KOO, R
    TOUEG, S
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (01) : 23 - 31
  • [37] Extended mpiJava']Java for distributed checkpointing and recovery
    Hernandez, Emilio
    Cardinale, Yudith
    Pereira, Wilmer
    RECENT ADVANCES IN PARALLEL VIRTUAL MACHINE AND MESSAGE PASSING INTERFACE, 2006, 4192 : 158 - 165
  • [38] An optimistic checkpointing and message logging approach for consistent global checkpoint collection in distributed systems
    Jiang, Qiangfeng
    Luo, Yi
    Manivannan, D.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (12) : 1575 - 1589
  • [39] PREACHES - Portable recovery and checkpointing in heterogeneous systems
    Ssu, KF
    Fuchs, WK
    TWENTY-EIGHTH ANNUAL INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT COMPUTING, DIGEST PAPERS, 1998, : 38 - 47
  • [40] AN EFFICIENT PROTOCOL FOR CHECKPOINTING RECOVERY IN DISTRIBUTED SYSTEMS
    KIM, JL
    PARK, T
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1993, 4 (08) : 955 - 960