A High-Throughput Reconfigurable Processing Array for Neural Networks

被引：0

作者：

Wu, Ephrem ^{[1
]}

Zhang, Xiaoqian ^{[1
]}

Berman, David ^{[1
]}

Cho, Inkeun ^{[1
]}

机构：

[1] Xilinx Inc, San Jose, CA 95124 USA

来源：

2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL) | 2017年

关键词：

convolutional neural networks; timing closure; matrix multiplication; FPGA; DSP; cache; memory bandwidth;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

FPGA-based neural-networks typically leave performance on the table because the DSP resources run at less than a third of the peak clock rate. This paper presents a processing array architected to consistently achieve timing closure at 100% of the peak DSP clock rate with standard FPGA tools. In the HDL design environment, our processing array operates at the peak DSP clock rates on Xilinx UltraScale (741 MHz) and UltraScale+ (891 MHz) devices. To enhance portability and consistency of timing closure, this array operates at a high clock rate while data SRAMs run at a fraction of this rate. As a proof of concept, this paper outlines a processing array for matrix multiplication and convolution, the most compute-intensive operations of a convolutional neural network (CNN).

引用

页数：4

共 50 条

[21] Reconfigurable microfluidic dilution for high-throughput quantitative assays
Fan, Jinzhen
Li, Baoqing
Xing, Siyuan
Pan, Tingrui
LAB ON A CHIP, 2015, 15 (12) : 2670 - 2679
[22] High-throughput and compact reconfigurable architectures for recursive filters
Shinde, Vaishali
Kumar, Ganesh Jai
Valencia, Daniel
Alimohammad, Amirhossein
IET COMMUNICATIONS, 2018, 12 (13) : 1616 - 1623
[23] A HIGH-THROUGHPUT NEURAL NETWORK ACCELERATOR
Chen, Tianshi
Du, Zidong
Sun, Ninghui
Wang, Jia
Wu, Chengyong
Chen, Yunji
Temam, Olivier
IEEE MICRO, 2015, 35 (03) : 24 - 32
[24] High-throughput mesoscopic optical imaging data processing and parsing using differential-guided filtered neural networks
Zhang, Hong
Lu, Zhikang
Gong, Peicong
Zhang, Shilong
Yang, Xiaoquan
Li, Xiangning
Feng, Zhao
Li, Anan
Xiao, Chi
BRAIN INFORMATICS, 2024, 11 (01)
[25] An Uninterrupted Processing Technique-Based High-Throughput and Energy-Efficient Hardware Accelerator for Convolutional Neural Networks
Islam, Md Najrul
Shrestha, Rahul
Chowdhury, Shubhajit Roy
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 30 (12) : 1891 - 1901
[26] Logic Design of Neural Networks for High-Throughput and Low-Power Applications
Xu, Kangwei
Zhang, Grace Li
Schlichtmann, Ulf
Li, Bing
29TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE, ASP-DAC 2024, 2024, : 902 - 907
[27] Evaluation of high-throughput screening hits by means of Kohonen neural networks.
Barnickel, G
Anzali, S
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1997, 214 : 29 - CINF
[28] Using Artificial Neural Networks to boost high-throughput discovery in heterogeneous catalysis
Baumes, L
Farrusseng, D
Lengliz, M
Mirodatos, C
QSAR & COMBINATORIAL SCIENCE, 2004, 23 (09): : 767 - 778
[29] High-throughput soybean seeds phenotyping with convolutional neural networks and transfer learning
Yang, Si
Zheng, Lihua
He, Peng
Wu, Tingting
Sun, Shi
Wang, Minjuan
PLANT METHODS, 2021, 17 (01)
[30] High-throughput soybean seeds phenotyping with convolutional neural networks and transfer learning
Si Yang
Lihua Zheng
Peng He
Tingting Wu
Shi Sun
Minjuan Wang
Plant Methods, 17

← 1 2 3 4 5 →