A generic approach to structuring and implementing complex fault-tolerant software

被引:3
|
作者
Xu, J [1 ]
Randell, B [1 ]
Romanovsky, A [1 ]
机构
[1] Univ Durham, Durham DH1 3LE, England
关键词
architectural patterns; concurrent and distributed systems; coordinated atomic actions; faulttolerant software; object orientation;
D O I
10.1109/ISORC.2002.1003704
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the practical implementation of means of tolerating residual software faults in complex software systems, especially concurrent and distributed ones. There are several inherent difficulties in implementing such fault-tolerant software systems, including the controlled use of extra redundancy and the mixture of different design concerns. In an attempt to minimise these difficulties, we present a generic implementation approach, composed of a multi-layered reference architecture, a configuration method and an architectural pattern. We evaluate our implementation approach using an industrial control application whose control software we equip with the ability to tolerate a variety of software faults. The preliminary evidence shows that our approach can simplify the implementation process, reduce repetitive development effort and provide high flexibility through a generic interface for a wide range of fault tolerance schemes.
引用
收藏
页码:207 / 214
页数:6
相关论文
共 50 条