A High-Throughput Reconfigurable Processing Array for Neural Networks

被引:0
|
作者
Wu, Ephrem [1 ]
Zhang, Xiaoqian [1 ]
Berman, David [1 ]
Cho, Inkeun [1 ]
机构
[1] Xilinx Inc, San Jose, CA 95124 USA
关键词
convolutional neural networks; timing closure; matrix multiplication; FPGA; DSP; cache; memory bandwidth;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
FPGA-based neural-networks typically leave performance on the table because the DSP resources run at less than a third of the peak clock rate. This paper presents a processing array architected to consistently achieve timing closure at 100% of the peak DSP clock rate with standard FPGA tools. In the HDL design environment, our processing array operates at the peak DSP clock rates on Xilinx UltraScale (741 MHz) and UltraScale+ (891 MHz) devices. To enhance portability and consistency of timing closure, this array operates at a high clock rate while data SRAMs run at a fraction of this rate. As a proof of concept, this paper outlines a processing array for matrix multiplication and convolution, the most compute-intensive operations of a convolutional neural network (CNN).
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Reconfigurable microfluidic dilution for high-throughput quantitative assays
    Fan, Jinzhen
    Li, Baoqing
    Xing, Siyuan
    Pan, Tingrui
    LAB ON A CHIP, 2015, 15 (12) : 2670 - 2679
  • [22] High-throughput and compact reconfigurable architectures for recursive filters
    Shinde, Vaishali
    Kumar, Ganesh Jai
    Valencia, Daniel
    Alimohammad, Amirhossein
    IET COMMUNICATIONS, 2018, 12 (13) : 1616 - 1623
  • [23] A HIGH-THROUGHPUT NEURAL NETWORK ACCELERATOR
    Chen, Tianshi
    Du, Zidong
    Sun, Ninghui
    Wang, Jia
    Wu, Chengyong
    Chen, Yunji
    Temam, Olivier
    IEEE MICRO, 2015, 35 (03) : 24 - 32
  • [24] High-throughput mesoscopic optical imaging data processing and parsing using differential-guided filtered neural networks
    Zhang, Hong
    Lu, Zhikang
    Gong, Peicong
    Zhang, Shilong
    Yang, Xiaoquan
    Li, Xiangning
    Feng, Zhao
    Li, Anan
    Xiao, Chi
    BRAIN INFORMATICS, 2024, 11 (01)
  • [25] An Uninterrupted Processing Technique-Based High-Throughput and Energy-Efficient Hardware Accelerator for Convolutional Neural Networks
    Islam, Md Najrul
    Shrestha, Rahul
    Chowdhury, Shubhajit Roy
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (12) : 1891 - 1901
  • [26] Logic Design of Neural Networks for High-Throughput and Low-Power Applications
    Xu, Kangwei
    Zhang, Grace Li
    Schlichtmann, Ulf
    Li, Bing
    29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 902 - 907
  • [27] Evaluation of high-throughput screening hits by means of Kohonen neural networks.
    Barnickel, G
    Anzali, S
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1997, 214 : 29 - CINF
  • [28] Using Artificial Neural Networks to boost high-throughput discovery in heterogeneous catalysis
    Baumes, L
    Farrusseng, D
    Lengliz, M
    Mirodatos, C
    QSAR & COMBINATORIAL SCIENCE, 2004, 23 (09): : 767 - 778
  • [29] High-throughput soybean seeds phenotyping with convolutional neural networks and transfer learning
    Yang, Si
    Zheng, Lihua
    He, Peng
    Wu, Tingting
    Sun, Shi
    Wang, Minjuan
    PLANT METHODS, 2021, 17 (01)
  • [30] High-throughput soybean seeds phenotyping with convolutional neural networks and transfer learning
    Si Yang
    Lihua Zheng
    Peng He
    Tingting Wu
    Shi Sun
    Minjuan Wang
    Plant Methods, 17