A High-Throughput Reconfigurable Processing Array for Neural Networks

被引:0
|
作者
Wu, Ephrem [1 ]
Zhang, Xiaoqian [1 ]
Berman, David [1 ]
Cho, Inkeun [1 ]
机构
[1] Xilinx Inc, San Jose, CA 95124 USA
关键词
convolutional neural networks; timing closure; matrix multiplication; FPGA; DSP; cache; memory bandwidth;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGA-based neural-networks typically leave performance on the table because the DSP resources run at less than a third of the peak clock rate. This paper presents a processing array architected to consistently achieve timing closure at 100% of the peak DSP clock rate with standard FPGA tools. In the HDL design environment, our processing array operates at the peak DSP clock rates on Xilinx UltraScale (741 MHz) and UltraScale+ (891 MHz) devices. To enhance portability and consistency of timing closure, this array operates at a high clock rate while data SRAMs run at a fraction of this rate. As a proof of concept, this paper outlines a processing array for matrix multiplication and convolution, the most compute-intensive operations of a convolutional neural network (CNN).
引用
收藏
页数:4
相关论文
共 50 条
  • [31] High-accuracy and high-throughput reactive lymphocyte identification using lightweight neural networks
    Mei, Liye
    Jin, Shuangtong
    Huang, Tingting
    Peng, Haorang
    Zha, Wenqi
    He, Jing
    Zhang, Songsong
    Xu, Chuan
    Yang, Wei
    Shen, Hui
    Lei, Cheng
    Xiong, Bei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 97
  • [32] WidePipe: High-Throughput Deep Learning Inference System on a Cluster of Neural Processing Units
    Ma, Lixian
    Shao, En
    Zhou, Yueyuan
    Tan, Guangming
    2021 IEEE 39TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2021), 2021, : 563 - 566
  • [33] Parallel Tools in HEVC for High-Throughput Processing
    Zhou, Minhua
    Sze, Vivienne
    Budagavi, Madhukar
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXV, 2012, 8499
  • [34] Automatic Processing of Chromatograms in a High-Throughput Environment
    Lytle, Fred E.
    Julian, Randall K.
    CLINICAL CHEMISTRY, 2016, 62 (01) : 144 - 153
  • [35] High-throughput detection of RNA processing in bacteria
    Erin E. Gill
    Luisa S. Chan
    Geoffrey L. Winsor
    Neil Dobson
    Raymond Lo
    Shannan J. Ho Sui
    Bhavjinder K. Dhillon
    Patrick K. Taylor
    Raunak Shrestha
    Cory Spencer
    Robert E. W. Hancock
    Peter J. Unrau
    Fiona S. L. Brinkman
    BMC Genomics, 19
  • [36] High-throughput detection of RNA processing in bacteria
    Gill, Erin E.
    Chan, Luisa S.
    Winsor, Geoffrey L.
    Dobson, Neil
    Lo, Raymond
    Sui, Shannan J. Ho
    Dhillon, Bhavjinder K.
    Taylor, Patrick K.
    Shrestha, Raunak
    Spencer, Cory
    Hancock, Robert E. W.
    Unrau, Peter J.
    Brinkman, Fiona S. L.
    BMC GENOMICS, 2018, 19
  • [37] A Throughput-Optimized Channel-Oriented Processing Element Array for Convolutional Neural Networks
    Chen, Yu-Xian
    Ruan, Shanq-Jang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (02) : 752 - 756
  • [38] Miniaturized Fluid Array for High-Throughput Protein Expression
    Khnouf, Ruba
    Olivero, Daniel
    Jin, Shouguang
    Fan, Z. Hugh
    BIOTECHNOLOGY PROGRESS, 2010, 26 (06) : 1590 - 1596
  • [39] High-throughput impedance spectroscopy biosensor array chip
    Liu, Xiaowen
    Li, Lin
    Mason, Andrew J.
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2014, 372 (2012):
  • [40] Human transcriptome array for high-throughput clinical studies
    Xu, Weihong
    Seok, Junhee
    Mindrinos, Michael N.
    Schweitzer, Anthony C.
    Jiang, Hui
    Wilhelmy, Julie
    Clark, Tyson A.
    Kapur, Karen
    Xing, Yi
    Faham, Malek
    Storey, John D.
    Moldawer, Lyle L.
    Maier, Ronald V.
    Tompkins, Ronald G.
    Wong, Wing Hung
    Davis, Ronald W.
    Xiao, Wenzhong
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (09) : 3707 - 3712