Non-intrusive system level fault-tolerance

被引:0
|
作者
Lundqvist, K [1 ]
Srinivasan, J [1 ]
Gorelov, S [1 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, Embedded Syst Lab, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
High-integrity embedded systems operate in multiple modes, in order to ensure system availability in the face of faults. Unanticipated state-dependent faults that remain in software after system design and development behave like hardware transient faults: they appear, do the damage and disappear. The conventional approach used for handling task overruns caused by transient faults is to use a single recovery task that implements minimal functionality. This approach provides limited availability and should be used as a last resort in order to keep the system online. Traditional fault detection approaches are often intrusive in that they consume processor resources in order to monitor system behavior. This paper presents a novel approach for fault-monitoring by leveraging the Ravenscar profile, model-checking and a system-on-chip implementation of both the kernel and an execution time monitor. System fault-tolerance is provided through a hierarchical set of operational modes that are based on tin-ling behavior violations of individual tasks within the application. The approach is illustrated through a simple case study of a generic navigation system.
引用
收藏
页码:156 / 166
页数:11
相关论文
共 50 条
  • [31] NIRVANA: A Non-Intrusive Black-Box Monitoring Framework for Rack-level Fault Detection
    Ciccotelli, Claudio
    Aniello, Leonardo
    Lombardi, Federico
    Montanari, Luca
    Querzoni, Leonardo
    Baldoni, Roberto
    2015 IEEE 21ST PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING (PRDC), 2015, : 11 - 20
  • [32] A NEW APPROACH TO SYSTEM-LEVEL FAULT-TOLERANCE IN MESSAGE-PASSING MULTICOMPUTERS
    ZIMMERMAN, GW
    ESFAHANIAN, AH
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 507 : 357 - 363
  • [33] System level fault-tolerance core mapping and FPGA-based verification of NoC
    Becchu, Naresh Kumar Reddy
    Harishchandra, Vasantha Moodabettu
    Balachandra, Nithin Kumar Yernad
    MICROELECTRONICS JOURNAL, 2017, 70 : 16 - 26
  • [34] ON FAULT-TOLERANCE AND FAULT-AVOIDANCE
    REGULINSKI, TLD
    IEEE TRANSACTIONS ON RELIABILITY, 1987, 36 (02) : 161 - 161
  • [35] Non-Intrusive Cable Fault Diagnosis Based on Inductive Directional Coupling
    Hu, Suyang
    Wang, Li
    Gao, Chuang
    Zhang, Bin
    Liu, Zhichan
    Yang, Shanshui
    SENSORS, 2018, 18 (11)
  • [36] Network performance and fault detection in a PSTN using non-intrusive methods
    Beritelli, F
    Casale, S
    Cavallaro, A
    Montagna, R
    EUROPEAN TRANSACTIONS ON TELECOMMUNICATIONS, 1999, 10 (05): : 487 - 496
  • [37] Application-Level Fault-Tolerance Solutions for Grid Computing
    Diaz, Daniel
    Pardo, Xoan C.
    Martin, Maria J.
    Gonzalez, Patricia
    CCGRID 2008: EIGHTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, VOLS 1 AND 2, PROCEEDINGS, 2008, : 554 - 559
  • [38] Fault-tolerance at the Management Level in Many-core Systems
    Fochi, Vinicius
    Caimi, Luciano L.
    da Silva, Marcelo H.
    Moraes, Fernando Gehm
    2018 31ST SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI), 2018,
  • [39] Non-Intrusive Classroom Attention Tracking System (NiCATS)
    Sanders, Andrew
    Boswell, Bradley
    Walia, Gursimran Singh
    Allen, Andrew
    2021 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE 2021), 2021,
  • [40] Service Based Software Fault-Tolerance for Manufacturing System
    Jeong, HwaYoung
    Hong, BongHwa
    COMPUTER APPLICATIONS FOR SOFTWARE ENGINEERING, DISASTER RECOVERY, AND BUSINESS CONTINUITY, 2012, 340 : 171 - +