Systematic Reliability Evaluation of FPGA Implemented CNN Accelerators

被引:6
|
作者
Gao, Zhen [1 ]
Gao, Shihui [1 ]
Yao, Yi [1 ]
Liu, Qiang [2 ]
Zeng, Shulin [3 ]
Ge, Guangjun [3 ]
Wang, Yu [3 ]
Ullah, Anees [4 ]
Reviriego, Pedro [5 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Tianjin Univ, Sch Microelect, Tianjin 300072, Peoples R China
[3] Tsinghua Univ, Sch Elect Engn, Beijing 100084, Peoples R China
[4] Univ Engn & Technol, Dept Elect Engn, Abbottabad Campus, Abbottabad 220101, Pakistan
[5] Univ Politecn Madrid, Dept Ingeneria Sistemas Telemat, Madrid 28040, Spain
基金
中国国家自然科学基金;
关键词
Reliability; Field programmable gate arrays; Parallel processing; Convolutional neural networks; Reliability engineering; Computer architecture; Neural networks; Convolutional neural networks (CNNs); FPGA accelerator; reliability; single bit upsets (SEUs); fault injection; RADIATION;
D O I
10.1109/TDMR.2023.3235767
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Convolutional neural networks (CNN) have become essential for many scientific and industrial applications, such as image classification and pattern detection. Among the devices that can implement neural networks, SRAM based FPGAs are a popular option due to their excellent parallel computing capability and good flexibility. However, SRAM-FPGAs are susceptible to radiation effects, which limits its application on safety critical applications. In this paper, the reliability of an accelerator based on the advanced Instruction-Set Architecture is evaluated based on hardware fault injection experiments. Each main module of the accelerator is evaluated separately, and the impact of parallelism and model features on the accelerator reliability is also examined. The experimental results reveal some important conclusions in terms of general hardware reliability and also of the particular model reliability. First, over 99% of SEUs on the computation modules will cause accuracy loss, and the reliability improves for higher parallelism. Second, a large portion of SEUs on the data mover and the instruction scheduler will cause system corruptions due to abnormal interactions with the ARM or other modules. Third, nonlinear activation and pooling layers are effective in reducing the effect of SEUs on computation modules, so models that use these layers tend to be more robust. The results provide a deep understanding of the impact of errors on CNNs implemented on ISA based FPGA accelerators (e.g., the Xilinx DPU).
引用
收藏
页码:116 / 126
页数:11
相关论文
共 50 条
  • [1] Performance Modeling for CNN Inference Accelerators on FPGA
    Ma, Yufei
    Cao, Yu
    Vrudhula, Sarma
    Seo, Jae-Sun
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (04) : 843 - 856
  • [2] Optimization of FPGA-based CNN accelerators using metaheuristics
    Sadiq M. Sait
    Aiman El-Maleh
    Mohammad Altakrouri
    Ahmad Shawahna
    The Journal of Supercomputing, 2023, 79 : 4493 - 4533
  • [3] Optimization of FPGA-based CNN accelerators using metaheuristics
    Sait, Sadiq M.
    El-Maleh, Aiman
    Altakrouri, Mohammad
    Shawahna, Ahmad
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (04): : 4493 - 4533
  • [4] Effects of Runtime Reconfiguration on PUFs Implemented as FPGA-Based Accelerators
    Nassar, Hassan
    Bauer, Lars
    Henkel, Joerg
    IEEE EMBEDDED SYSTEMS LETTERS, 2023, 15 (04) : 174 - 177
  • [5] Analyzing the Impact of Approximate Adders on the Reliability of FPGA Accelerators
    Tsounis, Ioannis
    Papadimitriou, Athanasios
    Psarakis, Mihalis
    2021 IEEE EUROPEAN TEST SYMPOSIUM (ETS 2021), 2021,
  • [6] Single Fault Reliability Analysis in FPGA Implemented Circuits
    Jahanirad, Hadi
    Mohammadi, Karim
    Attarsharghi, Pejman
    2012 13TH INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED), 2012, : 49 - 56
  • [7] FlexCNN: An End-to-end Framework for Composing CNN Accelerators on FPGA
    Basalama, Suhail
    Sohrabizadeh, Atefeh
    Wang, Jie
    Guo, Licheng
    Cong, Jason
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2023, 16 (02)
  • [8] Performance Exploration on Pre-implemented CNN Hardware Accelerator on FPGA
    Kwadjo, Danielle Tchuinkou
    Mbongue, Joel Mandebi
    Bobda, Christophe
    2020 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2020), 2020, : 298 - 299
  • [9] Reliability Evaluation and Fault Tolerance Design for FPGA Implemented Reed Solomon (RS) Erasure Decoders
    Gao, Zhen
    Shi, Jinchang
    Liu, Qiang
    Ullah, Anees
    Reviriego, Pedro
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2023, 31 (01) : 142 - 146
  • [10] A statistical study into reliability of FPGA implemented circuits: Simulation and modelling
    Aguirre-Morales, J. D.
    Marc, F.
    MICROELECTRONICS RELIABILITY, 2021, 117