A Heterogeneous Inference Framework for a Deep Neural Network

被引:0
|
作者
Gadea-Girones, Rafael [1 ]
Rocabado-Rocha, Jose Luis [1 ]
Fe, Jorge [1 ]
Monzo, Jose M. [1 ]
机构
[1] Univ Politecn Valencia, Inst Mol Imaging Technol I3M, Valencia 46022, Spain
关键词
convolutional neural networks; heterogeneous computation; systolic arrays; FPGA; FIRE DETECTION;
D O I
10.3390/electronics13020348
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Artificial intelligence (AI) is one of the most promising technologies based on machine learning algorithms. In this paper, we propose a workflow for the implementation of deep neural networks. This workflow attempts to combine the flexibility of high-level compilers (HLS)-based networks with the architectural control features of hardware description languages (HDL)-based flows. The architecture consists of a convolutional neural network, SqueezeNet v1.1, and a hard processor system (HPS) that coexists with acceleration hardware to be designed. This methodology allows us to compare solutions based solely on software (PyTorch 1.13.1) and propose heterogeneous inference solutions, taking advantage of the best options within the software and hardware flow. The proposed workflow is implemented on a low-cost field programmable gate array system-on-chip (FPGA SOC) platform, specifically the DE10-Nano development board. We have provided systolic architectural solutions written in OpenCL that are highly flexible and easily tunable to take full advantage of the resources of programmable devices and achieve superior energy efficiencies working with a 32-bit floating point. From a verification point of view, the proposed method is effective, since the reference models in all tests, both for the individual layers and the complete network, have been readily available using packages well known in the development, training, and inference of deep networks.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] EvoDNN - An Evolutionary Deep Neural Network with Heterogeneous Activation Functions
    Cui, Peiyu
    Shabash, Boris
    Wiese, Kay C.
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2362 - 2369
  • [42] Heterogeneous Modular Deep Neural Network for Diabetic Retinopathy Detection
    Soniya
    Paul, Sandeep
    Singh, Lotika
    2016 IEEE REGION 10 HUMANITARIAN TECHNOLOGY CONFERENCE (R10-HTC), 2016,
  • [43] Deep heterogeneous network embedding based on Siamese Neural Networks
    Zhang, Chen
    Tang, Zhouhua
    Yu, Bin
    Xie, Yu
    Pan, Ke
    NEUROCOMPUTING, 2020, 388 : 1 - 11
  • [44] Multi-accelerator Neural Network Inference in Diversely Heterogeneous Embedded Systems
    Dagli, Ismet
    Belviranli, Mehmet E.
    PROCEEDINGS OF RSDHA 2021: REDEFINING SCALABILITY FOR DIVERSELY HETEROGENEOUS ARCHITECTURES, 2021, : 1 - 7
  • [45] Benchmarking Deep Neural Network Inference Performance on Serverless Environments With MLPerf
    Elordi, Unai
    Unzueta, Luis
    Goenetxea, Jon
    Sanchez-Carvallido, Sergio
    Arganda-Carreras, Ignacio
    Otaegui, Oihana
    IEEE SOFTWARE, 2021, 38 (01) : 81 - 87
  • [46] A hierarchical fused fuzzy deep neural network with heterogeneous network embedding for recommendation
    Pham, Phu
    Nguyen, Loan T. T.
    Nguyen, Ngoc Thanh
    Kozma, Robert
    Vo, Bay
    INFORMATION SCIENCES, 2023, 620 : 105 - 124
  • [47] Automated Search for Deep Neural Network Inference Partitioning on Embedded FPGA
    Kress, Fabian
    Hoefer, Julian
    Hotfilter, Tim
    Walter, Iris
    El Annabi, El Mahdi
    Harbaum, Tanja
    Becker, Juergen
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT I, 2023, 1752 : 557 - 568
  • [48] Throughput optimizations for FPGA-based deep neural network inference
    Posewsky, Thorbjoern
    Ziener, Daniel
    MICROPROCESSORS AND MICROSYSTEMS, 2018, 60 : 151 - 161
  • [49] A Posit8 Decompression Operator for Deep Neural Network Inference
    Desrentes, Oregane
    Resmerita, Diana
    de Dinechin, Benoit Dupont
    NEXT GENERATION ARITHMETIC, CONGA 2022, 2022, 13253 : 14 - 30
  • [50] Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems
    Kress, Fabian
    El Annabi, El Mahdi
    Hotfilter, Tim
    Hoefer, Julian
    Harbaum, Tanja
    Becker, Juergen
    2024 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2024, : 39 - 44