Real-Time Meets Approximate Computing: An Elastic CNN Inference Accelerator with Adaptive Trade-off between QoS and QoR

被引:13
|
作者
Wang, Ying [1 ]
Li, Huawei [1 ]
Li, Xiaowei [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1145/3061639.3062307
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Due to the recent progress in deep learning and neural acceleration architectures, specialized deep neural network or convolutional neural network (CNNs) accelerators are expected to provide an energy-efficient solution for real-time vision/speech processing, recognition and a wide spectrum of approximate computing applications. In addition to their wide applicability scope, we also found that the fascinating feature of deterministic performance and high energy-efficiency, makes such deep learning (DL) accelerators ideal candidates as application-processor IPs in embedded SoCs concerned with real-time processing. However, unlike traditional accelerator designs, DL accelerators introduce a new aspect of design trade-off between real-time processing (QoS) and computation approximation (QoR) into embedded systems. This work proposes an elastic CNN acceleration architecture that automatically adapts to the hard QoS constraint by exploiting the error-resilience in typical approximate computing workloads. For the first time, the proposed design, including network tuning-and-mapping software and reconfigurable accelerator hardware, aims to reconcile the design constraint of QoS and Quality of Result (QoR), which are respectively the key concerns in real-time and approximate computing. It is shown in experiments that the proposed architecture enables the embedded system to work flexibly in an expanded operating space, significantly enhances its real-time ability, and maximizes the energy-efficiency of system within the user-specified QoS-QoR constraint through self-reconfiguration.
引用
收藏
页数:6
相关论文
共 34 条
  • [1] The trade-off between accuracy and the complexity of real-time background subtraction
    Hossain, Md Alamgir
    Nguyen, VanDung
    Huh, Eui-Nam
    IET IMAGE PROCESSING, 2021, 15 (02) : 350 - 368
  • [2] Trade-Off Between Real-Time and Classification Performance in Motor Imagery BCI
    Miladinovic, Aleksandar
    Ajcevic, Milos
    Iscra, Katerina
    Bassi, Francesco
    Raffini, Alessandra
    Jarmolowska, Joanna
    Marusic, Uros
    Accardo, Agostino
    9TH EUROPEAN MEDICAL AND BIOLOGICAL ENGINEERING CONFERENCE, VOL 2, EMBEC 2024, 2024, 113 : 336 - 344
  • [3] High-Level Estimation and Trade-Off Analysis for Adaptive Real-Time Systems
    Sander, Ingo
    Zhu, Jun
    Jantsch, Axel
    Herrholz, Andreas
    Hartmann, Philipp A.
    Nebel, Wolfgang
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 2985 - +
  • [4] Trade-off analysis of real-time control performance and schedulability
    Seto, D
    Lehoczky, JP
    Sha, L
    Shin, KG
    REAL-TIME SYSTEMS, 2001, 21 (03) : 199 - 217
  • [5] Trade-Off Analysis of Real-Time Control Performance and Schedulability*
    Danbing Seto
    John P. Lehoczky
    Lui Sha
    Kang G. Shin
    Real-Time Systems, 2001, 21 : 199 - 217
  • [6] On the trade-off between real-time pricing and the social acceptability costs of demand response
    da Silva, Hendrigo Batista
    Santiago, Leonardo P.
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2018, 81 : 1513 - 1521
  • [7] Real-time adaptive QoS prediction using approximate matrix multiplication
    Kurdija, Adrian Satja
    Silic, Marin
    Srbljic, Sinisa
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2018, 14 (02) : 200 - 235
  • [8] Real-time safest route identification: Examining the trade-off between safest and fastest routes
    Ghoul T.
    Sayed T.
    Fu C.
    Analytic Methods in Accident Research, 2023, 39
  • [9] A genetic programming approach for real-time crash prediction to solve trade-off between interpretability and accuracy
    Ma, Xiaochi
    Lu, Jian
    Liu, Xian
    Qu, Weibin
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2023, 15 (04) : 421 - 443
  • [10] Distortion-transmission trade-off in real-time transmission of Markov sources
    Chakravorty, Jhelum
    Mahajan, Aditya
    2015 IEEE INFORMATION THEORY WORKSHOP (ITW), 2015,