A High-Throughput Reconfigurable Processing Array for Neural Networks

被引:0
|
作者
Wu, Ephrem [1 ]
Zhang, Xiaoqian [1 ]
Berman, David [1 ]
Cho, Inkeun [1 ]
机构
[1] Xilinx Inc, San Jose, CA 95124 USA
关键词
convolutional neural networks; timing closure; matrix multiplication; FPGA; DSP; cache; memory bandwidth;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGA-based neural-networks typically leave performance on the table because the DSP resources run at less than a third of the peak clock rate. This paper presents a processing array architected to consistently achieve timing closure at 100% of the peak DSP clock rate with standard FPGA tools. In the HDL design environment, our processing array operates at the peak DSP clock rates on Xilinx UltraScale (741 MHz) and UltraScale+ (891 MHz) devices. To enhance portability and consistency of timing closure, this array operates at a high clock rate while data SRAMs run at a fraction of this rate. As a proof of concept, this paper outlines a processing array for matrix multiplication and convolution, the most compute-intensive operations of a convolutional neural network (CNN).
引用
收藏
页数:4
相关论文
共 50 条
  • [1] A Reconfigurable ASIP for High-Throughput and Flexible FFT Processing in SDR Environment
    Chen, Ting
    Liu, Hengzhu
    Zhang, Botao
    6TH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2014), 2014, 9159
  • [3] Implementation of High-Throughput FFT Processing on an Application-Specific Reconfigurable Processor
    Liu, Lei
    Yang, Ziyu
    Li, Sikun
    Yan, Ming
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1284 - 1288
  • [4] Multifunctional High-Throughput Single-Cell Analysis using Reconfigurable Amplifier Array
    White, Kevin A.
    Mulberry, Geoffrey
    Kim, Brian N.
    BIOPHYSICAL JOURNAL, 2017, 112 (03) : 461A - 461A
  • [5] Application of artificial neural networks to high-throughput synthesis of zeolites
    Moliner, M
    Serra, JM
    Corma, A
    Argente, E
    Valero, S
    Botti, V
    MICROPOROUS AND MESOPOROUS MATERIALS, 2005, 78 (01) : 73 - 81
  • [6] Analyzing high-throughput screening data by neural networks.
    Gasteiger, J
    Teckentrup, A
    Briem, H
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2000, 219 : U607 - U607
  • [7] MORA: A new coarse-grain reconfigurable array for high throughput multimedia processing
    Lanuzza, Marco
    Perri, Stefania
    Corsonello, Pasquale
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION - PROCEEDINGS, 2007, 4599 : 159 - +
  • [8] CELLULAR LOGIC ARRAY-PROCESSING TECHNIQUES FOR HIGH-THROUGHPUT IMAGE-PROCESSING SYSTEMS
    RAJAN, EG
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1993, 18 : 279 - &
  • [9] High-Throughput Classification of Radiographs Using Deep Convolutional Neural Networks
    Rajkomar, Alvin
    Lingam, Sneha
    Taylor, Andrew G.
    Blum, Michael
    Mongan, John
    JOURNAL OF DIGITAL IMAGING, 2017, 30 (01) : 95 - 101
  • [10] Unconventional Integrated Photonic Accelerators for High-Throughput Convolutional Neural Networks
    Tsirigotis, Aris
    Sarantoglou, George
    Skontranis, Menelaos
    Deligiannidis, Stavros
    Sozos, Kostas
    Tsilikas, Giannis
    Dermanis, Dimitris
    Bogris, Adonis
    Mesaritakis, Charis
    INTELLIGENT COMPUTING, 2023, 2