Performance evaluation of neural network hardware using time-shared bus and integer representation architecture

被引：0

作者：

Yasunaga, M

Ochiai, T

机构：

[1] Univ of Tsukuba, Tsukuba-shi, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 1996年 / E79D卷 / 06期

关键词：

neural networks; parallel computing; parallel programming language; performance evaluation; scalability;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural network hardware using time-shared bus and integer representation architecture has already been fabricated and reported from the design viewpoint. However, nothing related to performance evaluation of hardware has yet been presented. Computation-speed, scalability and learning accuracy of hardware are evaluated theoretically and experimentally using a Back Propagation (BP) algorithm. In addition, a mirror-weight assignment technique is proposed for high-speed computation in the BP. NETTalk, an English-pronunciation-reasoning task, has been chosen as the target application for the BP. In the experiment, recently-developed neuro-hardware based on the above architecture and its parallel programming language are used. An outline of the language is described along with BP programming. Mirror-weight assignment allows maximum speed at 55.0 MCUPS (Million Connections Updated Per Second) using 256 neurons in the hidden-layer (numbers of neurons in input- and output-layers are fixed at 203 and 26 respectively in NETTalk). In addition, if scalability is defined as a function of the number of neurons in the hidden-layer, the machine retains high scalability at 0.5 if such a maximum speed needs to be used. No degradation in learning accuracy occurs when experimental results computed using the neuro-hardware are compared with those obtained by floating-point representation architecture (workstation). The experiment indicates that the present integer representational design of the neuro-hardware is sufficient for NETTalk. Performance has been evaluated theoretically. evaluation purposes, it is assumed that most of the execution-time is taken up by bus cycles. On the basis of this assumption, an analytical model of computation-speed and scalability is proposed. Analytical predictions agreed well with experimental results.

引用

页码：888 / 896

页数：9

共 50 条

[1] IMAGE-PROCESSING SYSTEM WITH TIME-SHARED MULTIFRAME DATA BUS ARCHITECTURE - MFIP
SUGIMOTO, S
MATSUOKA, K
ICHIOKA, Y
PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1983, 435 : 200 - 206
[2] A real-time biomimetic acoustic localizing system using time-shared architecture
Karl, Marianne Nourzad
Karl, Christian
Hubbard, Allyn
UNATTENDED GROUND, SEA, AND AIR SENSOR TECHNOLOGIES AND APPLICATIONS X, 2008, 6963
[3] Performance Evaluation of Long Integer Multiplication Using OpenMP and MPI on Shared Memory Architecture
Tembhurne, Jitendra V.
Sathe, Shailesh R.
2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 283 - 288
[4] Bus Arrival Time Prediction Using Recurrent Neural Network with LSTM Architecture
Agafonov, A. A.
Yumaganov, A. S.
OPTICAL MEMORY AND NEURAL NETWORKS, 2019, 28 (03) : 222 - 230
[5] Bus Arrival Time Prediction Using Recurrent Neural Network with LSTM Architecture
A. A. Agafonov
A. S. Yumaganov
Optical Memory and Neural Networks, 2019, 28 : 222 - 230
[6] Performance Evaluation of Feed-Forward Backpropagation Neural Network for Classification on a Reconfigurable Hardware Architecture
Mohammadi, Mahnaz
Ronge, Rohit
Singapuram, Sanjay S.
Nandy, S. K.
APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 312 - 319
[7] PARALLEL POLYGON SCAN CONVERSION ALGORITHMS - PERFORMANCE EVALUATION ON A SHARED BUS ARCHITECTURE
GHOSAL, D
PATNAIK, LM
COMPUTERS & GRAPHICS, 1986, 10 (01) : 7 - 25
[8] Performance evaluation system for probabilistic neural network hardware
Noriyuki Aibe
Ryosuke Mizuno
Masanori Nakamura
Moritoshi Yasunaga
Ikuo Yoshihara
Artificial Life and Robotics, 2004, 8 (2) : 208 - 213
[9] A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network
An, Fubang
Wang, Lingli
Zhou, Xuegong
ELECTRONICS, 2023, 12 (13)
[10] Creating multiple time-shared laser traps with simultaneous displacement detection using digital signal processing hardware
Guilford, WH
Tournas, JA
Dascalu, D
Watson, DS
ANALYTICAL BIOCHEMISTRY, 2004, 326 (02) : 153 - 166

← 1 2 3 4 5 →