harDNNing: a machine-learning-based framework for fault tolerance assessment and protection of DNNs

被引:5
|
作者
Traiola, Marcello [1 ]
Kritikakou, Angeliki [1 ]
Sentieys, Olivier [1 ]
机构
[1] Univ Rennes, CNRS, INRIA, IRISA, Rennes, France
关键词
Reliability Analysis; Fault Tolerance; Machine Learning; Neural Networks;
D O I
10.1109/ETS56758.2023.10174178
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Deep Neural Networks (DNNs) show promising performance in several application domains, such as robotics, aerospace, smart healthcare, and autonomous driving. Never-theless, DNN results may be incorrect, not only because of the network intrinsic inaccuracy, but also due to faults affecting the hardware. Indeed, hardware faults may impact the DNN inference process and lead to prediction failures. Therefore, ensuring the fault tolerance of DNN is crucial. However, common fault tolerance approaches are not cost-effective for DNNs protection, because of the prohibitive overheads due to the large size of DNNs and of the required memory for parameter storage. In this work, we propose a comprehensive framework to assess the fault tolerance of DNNs and cost-effectively protect them. As a first step, the proposed framework performs datatype-and-layer-based fault injection, driven by the DNN characteristics. As a second step, it uses classification-based machine learning methods in order to predict the criticality, not only of network parameters, but also of their bits. Last, dedicated Error Correction Codes (ECCs) are selectively inserted to protect the critical parameters and bits, hence protecting the DNNs with low cost. Thanks to the proposed framework, we explored and protected two Convolutional Neural Networks (CNNs), each with four different data encoding. The results show that it is possible to protect the critical network parameters with selective ECCs while saving up to 83% memory w.r.t. conventional ECC approaches.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Edge-Computing and Machine-Learning-Based Framework for Software Sensor Development
    Hanzelik, Pal Peter
    Kummer, Alex
    Abonyi, Janos
    SENSORS, 2022, 22 (11)
  • [22] Machine-Learning-Based Predictive Handover
    Masri, Ahmed
    Veijalainen, Teemu
    Martikainen, Henrik
    Mwanje, Stephen
    Ali-Tolppa, Janne
    Kajo, Marton
    2021 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2021), 2021, : 648 - 652
  • [23] Dynamic Learning Framework for Smooth-Aided Machine-Learning-Based Backbone Traffic Forecasts
    Hassan, Mohamed Khalafalla
    Ariffin, Sharifah Hafizah Syed
    Ghazali, N. Effiyana
    Hamad, Mutaz
    Hamdan, Mosab
    Hamdi, Monia
    Hamam, Habib
    Khan, Suleman
    SENSORS, 2022, 22 (09)
  • [24] Machine-learning-based Single-phase-to-ground Fault Detection in Distribution Systems
    Zeng, Xiao-Dan
    Guo, Mou-Fa
    Chen, Duan-Yu
    2017 IEEE CONFERENCE ON ENERGY INTERNET AND ENERGY SYSTEM INTEGRATION (EI2), 2017, : 153 - 158
  • [25] Identifying fluency parameters for a machine-learning-based automated interpreting assessment system
    Wang, Xiaoman
    Wang, Binhua
    PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2024, 32 (02): : 278 - 294
  • [26] Machine-Learning-Based Hazardous Spot Detection Framework by Mobile Sensing and Opportunistic Networks
    Watanabe, Yoshito
    Liu, Wei
    Shoji, Yozo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13646 - 13657
  • [27] Biochar design for antibiotics adsorption via a hybrid machine-learning-based optimization framework
    Li, Jie
    Pan, Lanjia
    Huang, Yahui
    Liu, Xuejiao
    Ye, Zhilong
    Wang, Yin
    SEPARATION AND PURIFICATION TECHNOLOGY, 2024, 348
  • [28] Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels
    Xiao, Lei
    Han, Yubing
    Weng, Zuxin
    REMOTE SENSING, 2022, 14 (20)
  • [29] Towards a Generic Trust Management Framework using a Machine-Learning-Based Trust Model
    Lopez, Jorge
    Maag, Stephane
    2015 IEEE TRUSTCOM/BIGDATASE/ISPA, VOL 1, 2015, : 1343 - 1348
  • [30] A framework to guide the selection and configuration of machine-learning-based data analytics solutions in manufacturing
    Zacarias, Alejandro Gabriel Villanueva
    Reimann, Peter
    Mitschang, Bernhard
    51ST CIRP CONFERENCE ON MANUFACTURING SYSTEMS, 2018, 72 : 153 - 158