Fault-tolerant deep learning inference on CPU-GPU integrated edge devices with TEEs

被引:0
|
作者
Xu, Hongjian [1 ]
Liao, Longlong [2 ,3 ]
Liu, Xinqi [4 ]
Chen, Shuguang [3 ]
Chen, Jianguo [5 ]
Liang, Zhixuan [6 ]
Yu, Yuanlong [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou 350100, Peoples R China
[2] Fuzhou Univ, Fuzhou 350100, Peoples R China
[3] Univ Hong Kong, Hong Kong 999077, Peoples R China
[4] Univ Hong Kong, Dept Civil Engn, Hong Kong 999077, Peoples R China
[5] Sun Yat Sen Univ, Sch Software Engn, Zhuhai 519082, Peoples R China
[6] Hong Kong Polytech Univ, Comp Sci & Technol, Hong Kong 999077, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Fault-tolerant inference; Fault injection attack; CPU-GPU integrated edge device; Trusted Execution Environment;
D O I
10.1016/j.future.2024.07.027
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
CPU-GPU integrated edge devices and deep learning algorithms have received significant progress in recent years, leading to increasingly widespread application of edge intelligence. However, deep learning inference on these edge devices is vulnerable to Fault Injection Attacks (FIAs) that can modify device memory or execute instructions with errors. We propose DarkneTF, a Fault-Tolerant (FT) deep learning inference framework for CPU-GPU integrated edge devices, to ensure the correctness of model inference results by detecting the threat of FIAs. DarkneTF introduces algorithm-based verification to implement the FT deep learning inference. The verification process involves verifying the integrity of model weights and validating the correctness of time- intensive calculations, such as convolutions. We improve the Freivalds algorithm to enhance the ability to detect tiny perturbations by strengthening randomization. As the verification process is also susceptible to FIAs, DarkneTF offloads the verification process into Trusted Execution Environments (TEEs). This scheme ensures the verification process's security and allows for accelerated model inference using the integrated GPUs. Experimental results show that GPU-accelerated FT inference on HiKey 960 achieves notable speedups ranging from 3.46x to 5.57x compared to FT inference on a standalone CPU. The extra memory overhead incurred FT inference remains at an exceedingly low level, with a range of 0.46% to 10.22%. The round-off error of the improved Freivalds algorithm is below 2.50 . 50 x 10 -4 , and the accuracy of detecting FIAs is above 92.73%.
引用
收藏
页码:404 / 414
页数:11
相关论文
共 50 条
  • [41] Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review
    Shuvo, Md. Maruf Hossain
    Islam, Syed Kamrul
    Cheng, Jianlin
    Morshed, Bashir I.
    PROCEEDINGS OF THE IEEE, 2023, 111 (01) : 42 - 91
  • [42] Learning-based integrated fault-tolerant guidance and control for hypersonic vehicles considering avoidance and penetration
    Wu, Tiancai
    Wang, Honglun
    Ren, Bin
    Liu, Yiheng
    Wu, Xingyu
    Yan, Guocheng
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (15):
  • [43] Carbon-Aware and Fault-Tolerant Migration of Deep Learning Workloads in the Geo-distributed Cloud
    Park, Jeonghyeon
    Kim, Daero
    Kim, Jiseon
    Han, Jungkyu
    Chun, Sejin
    2024 IEEE 17TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, CLOUD 2024, 2024, : 494 - 501
  • [44] Deep Learning-based Reentry Predictor-corrector Fault-tolerant Guidance for Hypersonic Vehicles
    Yu Y.
    Wang H.
    Binggong Xuebao/Acta Armamentarii, 2020, 41 (04): : 656 - 669
  • [45] CoFB: latency-constrained co-scheduling of flows and batches for deep learning inference service on the CPU–GPU system
    Qi Zhang
    Yi Liu
    Tao Liu
    Depei Qian
    The Journal of Supercomputing, 2023, 79 : 14172 - 14199
  • [46] Deep Reinforcement Learning-Based Approach for Fault-Tolerant Control of PV Systems in Smart Grids
    Karaki, Tala
    Saied, Majd
    Shraim, Hassan
    2022 10TH INTERNATIONAL CONFERENCE ON SYSTEMS AND CONTROL (ICSC), 2022, : 283 - 288
  • [47] Fault-Tolerant Control of Programmable Logic Controller-Based Production Systems With Deep Reinforcement Learning
    Zinn, Jonas
    Vogel-Heuser, Birgit
    Gruber, Marius
    JOURNAL OF MECHANICAL DESIGN, 2021, 143 (07)
  • [48] Active fault-tolerant hybrid control integrated with reinforcement learning application to cable-driven parallel robots
    Lu, Yanqi
    Yao, Weiran
    CONTROL ENGINEERING PRACTICE, 2025, 158
  • [49] Energy-efficient and fault-tolerant routing mechanism for WSN using optimizer based deep learning model
    Swathi, B.
    Amanullah, M.
    Kalaiselvan, S. A.
    SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS, 2024, 44
  • [50] Intelligent active fault-tolerant system for multi-source integrated navigation system based on deep neural network
    Chengjun Guo
    Feng Li
    Zhong Tian
    Wei Guo
    Shusen Tan
    Neural Computing and Applications, 2020, 32 : 16857 - 16874