Efficient Software-Implemented HW Fault Tolerance for TinyML Inference in Safety-critical Applications

被引:3
|
作者
Sharif, Uzair [1 ]
Mueller-Gritschneder, Daniel [1 ]
Stahl, Rafael [1 ]
Schlichtmann, Ulf [1 ]
机构
[1] Tech Univ Munich TUM, Chair Elect Design Automat, Munich, Germany
关键词
TinyML; safety; error detection; soft-error;
D O I
10.23919/DATE56975.2023.10137207
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
TinyML research has mainly focused on optimizing neural network inference in terms of latency, code-size and energy-use for efficient execution on low-power micro-controller units (MCUs). However, distinctive design challenges emerge in safety-critical applications, for example in small unmanned autonomous vehicles such as drones, due to the susceptibility of off-the-shelf MCU devices to soft-errors. We propose three new techniques to protect TinyML inference against random soft errors with the target to reduce run-time overhead: one for protecting fully-connected layers; one adaptation of existing algorithmic fault tolerance techniques to depth-wise convolutions; and an efficient technique to protect the so-called epilogues within TinyML layers. Integrating these layer-wise methods, we derive a full-inference hardening solution for TinyML that achieves run-time efficient soft-error resilience. We evaluate our proposed solution on MLPerf-Tiny benchmarks. Our experimental results show that competitive resilience can be achieved compared with currently available methods, while reducing run-time overheads by similar to 120% for one fully-connected neural network (NN); similar to 20% for the two CNNs with depth-wise convolutions; and similar to 2% for standard CNN. Additionally, we propose selective hardening which reduces the incurred run-time overhead further by similar to 2x for the studied CNNs by focusing exclusively on avoiding mispredictions.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] REVIEW OF FAULT-TOLERANT COMPUTING FOR SAFETY-CRITICAL APPLICATIONS IN JAPAN
    TOHMA, Y
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 1994, 9 (01): : 3 - 10
  • [42] Fault Tolerant Smart Transducer Interfaces for Safety-Critical Avionics Applications
    Bouanen, Safwen
    Thibeault, Claude
    Savaria, Yvon
    Tremblay, Jose-Philippe
    Zhu, Guchuan
    2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2013,
  • [43] Reliability demonstration testing method for safety-critical embedded applications software
    Qin, Zhidong
    Chen, Hui
    Shi, Youqun
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS, 2008, : 481 - 487
  • [44] FAULT TOLERANT SMART TRANSDUCER INTERFACES FOR SAFETY-CRITICAL AVIONICS APPLICATIONS
    Bouanen, Safwen
    Thibeault, Claude
    Savaria, Yvon
    Tremblay, Jose-Philippe
    Zhu, Guchuan
    2013 IEEE/AIAA 32ND DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2013,
  • [45] Comparing software measures with fault counts derived from unit-testing of safety-critical software
    Herzner, W
    Ramberger, S
    Länger, T
    Reumann, C
    Gruber, T
    Sejkora, C
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, PROCEEDINGS, 2005, 3688 : 81 - 93
  • [46] Fault Diagnosis and Fault-tolerant Operation of Current Source Inverter for Safety-Critical Applications
    Fard, Majid Tahmasbi
    He, Jiangbiao
    Wang, Zheng
    2020 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE & EXPO (ITEC), 2020, : 925 - 929
  • [47] Toward Efficient Calculation of Inverses in Control Allocation for Safety-Critical Applications
    Raab, Stefan
    Steinert, Agnes
    Hafner, Simon
    Holzapfel, Florian
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2024, 47 (11) : 2316 - 2332
  • [49] An uncertainty-informed framework for trustworthy fault diagnosis in safety-critical applications
    Zhou, Taotao
    Zhang, Laibin
    Han, Te
    Droguett, Enrique Lopez
    Mosleh, Ali
    Chan, Felix T. S.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 229
  • [50] Fault Assessment of Safety-Critical Applications on Reconfigurable Multi-Core Architecture
    Khamvilai, Thanakorn
    Sutter, Louis
    Magalhaes, Jose M., Jr.
    Syed, Aqib A.
    Feron, Eric
    2019 IEEE/AIAA 38TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2019,