FlexCNN: An End-to-end Framework for Composing CNN Accelerators on FPGA

被引:19
|
作者
Basalama, Suhail [1 ]
Sohrabizadeh, Atefeh [1 ]
Wang, Jie [1 ]
Guo, Licheng [1 ]
Cong, Jason [1 ]
机构
[1] Univ Calif Los Angeles, 404 Westwood Blvd Engn,6 Room 468, Los Angeles, CA 90095 USA
关键词
FPGA; CNN; ONNX; systolic array; transposed convolution; dilated convolution; OpenPose; U-Net; E-Net;
D O I
10.1145/3570928
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With reduced data reuse and parallelism, recent convolutional neural networks (CNNs) create new challenges for FPGA acceleration. Systolic arrays (SAs) are efficient, scalable architectures for convolutional layers, but without proper optimizations, their efficiency drops dramatically for reasons: (1) the different dimensions within same-type layers, (2) the different convolution layers especially transposed and dilated convolutions, and (3) CNN's complex dataflow graph. Furthermore, significant overheads arise when integrating FPGAs into machine learning frameworks. Therefore, we present a flexible, composable architecture called FlexCNN, which delivers high computation efficiency by employing dynamic tiling, layer fusion, and data layout optimizations. Additionally, we implement a novel versatile SA to process normal, transposed, and dilated convolutions efficiently. FlexCNN also uses a fully pipelined software-hardware integration that alleviates the software overheads. Moreover, with an automated compilation flow, FlexCNN takes a CNN in the ONNX1 representation, performs a design space exploration, and generates an FPGA accelerator. The framework is tested using three complex CNNs: OpenPose, U-Net, and E-Net. The architecture optimizations achieve 2.3x performance improvement. Compared to a standard SA, the versatile SA achieves close-to-ideal speedups, with up to 15.98x and 13.42x for transposed and dilated convolutions, with a 6% average area overhead. The pipelined integration leads to a 5x speedup for OpenPose.
引用
收藏
页数:32
相关论文
共 50 条
  • [31] A Fully End-to-End Cascaded CNN for Facial Landmark Detection
    He, Zhenliang
    Kan, Meina
    Zhang, Jie
    Chen, Xilin
    Shan, Shiguang
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 200 - 207
  • [32] Smoothing CNN for end-to-end training in visual question answering
    Long, Yu
    Tang, Pengjie
    Wang, Hanli
    Li, Qinyu
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 784 - 791
  • [33] DNN plus NeuroSim: An End-to-End Benchmarking Framework for Compute-in-Memory Accelerators with Versatile Device Technologies
    Peng, Xiaochen
    Huang, Shanshi
    Luo, Yandong
    Sun, Xiaoyu
    Yu, Shimeng
    2019 IEEE INTERNATIONAL ELECTRON DEVICES MEETING (IEDM), 2019,
  • [34] Multiobjective End-to-End Design Space Exploration of Parameterized DNN Accelerators
    Russo, Enrico
    Palesi, Maurizio
    Patti, Davide
    Monteleone, Salvatore
    Ascia, Giuseppe
    Catania, Vincenzo
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (02) : 1800 - 1812
  • [35] AutoText: An End-to-End AutoAI Framework for Text
    Chaudhary, Arunima
    Issak, Alayt
    Kate, Kiran
    Katsis, Yannis
    Valente, Abel
    Wang, Dakuo
    Evfimievski, Alexandre
    Gurajada, Sairam
    Kawas, Ban
    Malossi, Cristiano
    Popa, Lucian
    Pedapati, Tejaswini
    Samulowitz, Horst
    Wistuba, Martin
    Li, Yunyao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 16001 - 16003
  • [36] An intelligent framework for end-to-end rockfall detection
    Zoumpekas, Thanasis
    Puig, Anna
    Salamo, Maria
    Garcia-Selles, David
    Blanco Nunez, Laura
    Guinau, Marta
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (11) : 6471 - 6502
  • [37] Reliable End-to-End APNs Interaction Framework
    Singh, Ravendra
    Chatterjee, Indrani
    Smrati
    2014 INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2014, : 921 - 926
  • [38] A Framework for End-to-End Ontology Management System
    Walisadeera, Anusha Indika
    Ginige, Athula
    Wikramanayake, Gihan Nilendra
    Madushanka, A. L. Pamuditha
    Udeshini, A. A. Shanika
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2015, PT I, 2015, 9155 : 529 - 544
  • [39] An End-to-End Reliability Framework of the Internet of Things
    Azghiou, Kamal
    El Mouhib, Manal
    Koulali, Mohammed-Amine
    Benali, Abdelhamid
    SENSORS, 2020, 20 (09)
  • [40] A framework for end-to-end verification for digital microfluidics
    Pushpita Roy
    Ansuman Banerjee
    Bhargab B. Bhattacharya
    Innovations in Systems and Software Engineering, 2021, 17 : 231 - 245