A generic approach to structuring and implementing complex fault-tolerant software

被引:3
|
作者
Xu, J [1 ]
Randell, B [1 ]
Romanovsky, A [1 ]
机构
[1] Univ Durham, Durham DH1 3LE, England
关键词
architectural patterns; concurrent and distributed systems; coordinated atomic actions; faulttolerant software; object orientation;
D O I
10.1109/ISORC.2002.1003704
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the practical implementation of means of tolerating residual software faults in complex software systems, especially concurrent and distributed ones. There are several inherent difficulties in implementing such fault-tolerant software systems, including the controlled use of extra redundancy and the mixture of different design concerns. In an attempt to minimise these difficulties, we present a generic implementation approach, composed of a multi-layered reference architecture, a configuration method and an architectural pattern. We evaluate our implementation approach using an industrial control application whose control software we equip with the ability to tolerate a variety of software faults. The preliminary evidence shows that our approach can simplify the implementation process, reduce repetitive development effort and provide high flexibility through a generic interface for a wide range of fault tolerance schemes.
引用
收藏
页码:207 / 214
页数:6
相关论文
共 50 条
  • [31] RELIABILITY-GROWTH OF FAULT-TOLERANT SOFTWARE
    KANOUN, K
    KAANICHE, M
    BEOUNES, C
    LAPRIE, JC
    ARLAT, J
    IEEE TRANSACTIONS ON RELIABILITY, 1993, 42 (02) : 205 - 219
  • [32] Reliability simulation of fault-tolerant software and systems
    Gokhale, SS
    Lyu, MR
    Trivedi, KS
    PACIFIC RIM INTERNATIONAL SYMPOSIUM ON FAULT-TOLERANT SYSTEMS, PROCEEDINGS, 1997, : 167 - 173
  • [33] The effect of testing on reliability of fault-tolerant software
    Popov, P
    Littlewood, B
    2004 INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS, PROCEEDINGS, 2004, : 265 - 274
  • [34] Optimal structure of fault-tolerant software systems
    Levitin, G
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2005, 89 (03) : 286 - 295
  • [35] Fault-tolerant Hybrid Quantum Software Systems
    Scheerer, Max
    Klamroth, Jonas
    Denninger, Oliver
    2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM SOFTWARE (IEEE QSW 2022), 2022, : 52 - 57
  • [36] Towards reliability optimization of fault-tolerant software
    Belli, F.
    Bremerhaven, Hochschule
    Jedrzejowicz, P.
    IT - Information Technology, 1987, 29 (02): : 61 - 68
  • [37] FAULT-TOLERANT SOFTWARE-RELIABILITY MODELING
    SCOTT, RK
    GAULT, JW
    MCALLISTER, DF
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1987, 13 (05) : 582 - 592
  • [38] STRUCTURING FAULT-TOLERANT OBJECT SYSTEMS FOR MODULARITY IN A DISTRIBUTED ENVIRONMENT
    SHRIVASTAVA, SK
    MCCUE, DL
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 1994, 5 (04) : 421 - 432
  • [39] An Extensible Framework for Implementing Byzantine Fault-Tolerant Protocols
    Gogada, Hanish
    Meling, Hein
    Jehl, Leander
    Olsen, John Ingve
    38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023, 2023, : 121 - 124
  • [40] Distal: A Framework for Implementing Fault-tolerant Distributed Algorithms
    Biely, Martin
    Delgado, Pamela
    Milosevic, Zarko
    Schiper, Andre
    2013 43RD ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2013,