Performance evaluation of neural network hardware using time-shared bus and integer representation architecture

被引:0
|
作者
Yasunaga, M
Ochiai, T
机构
[1] Univ of Tsukuba, Tsukuba-shi, Japan
关键词
neural networks; parallel computing; parallel programming language; performance evaluation; scalability;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural network hardware using time-shared bus and integer representation architecture has already been fabricated and reported from the design viewpoint. However, nothing related to performance evaluation of hardware has yet been presented. Computation-speed, scalability and learning accuracy of hardware are evaluated theoretically and experimentally using a Back Propagation (BP) algorithm. In addition, a mirror-weight assignment technique is proposed for high-speed computation in the BP. NETTalk, an English-pronunciation-reasoning task, has been chosen as the target application for the BP. In the experiment, recently-developed neuro-hardware based on the above architecture and its parallel programming language are used. An outline of the language is described along with BP programming. Mirror-weight assignment allows maximum speed at 55.0 MCUPS (Million Connections Updated Per Second) using 256 neurons in the hidden-layer (numbers of neurons in input- and output-layers are fixed at 203 and 26 respectively in NETTalk). In addition, if scalability is defined as a function of the number of neurons in the hidden-layer, the machine retains high scalability at 0.5 if such a maximum speed needs to be used. No degradation in learning accuracy occurs when experimental results computed using the neuro-hardware are compared with those obtained by floating-point representation architecture (workstation). The experiment indicates that the present integer representational design of the neuro-hardware is sufficient for NETTalk. Performance has been evaluated theoretically. evaluation purposes, it is assumed that most of the execution-time is taken up by bus cycles. On the basis of this assumption, an analytical model of computation-speed and scalability is proposed. Analytical predictions agreed well with experimental results.
引用
收藏
页码:888 / 896
页数:9
相关论文
共 50 条
  • [1] IMAGE-PROCESSING SYSTEM WITH TIME-SHARED MULTIFRAME DATA BUS ARCHITECTURE - MFIP
    SUGIMOTO, S
    MATSUOKA, K
    ICHIOKA, Y
    PROCEEDINGS OF THE SOCIETY OF PHOTO-OPTICAL INSTRUMENTATION ENGINEERS, 1983, 435 : 200 - 206
  • [2] A real-time biomimetic acoustic localizing system using time-shared architecture
    Karl, Marianne Nourzad
    Karl, Christian
    Hubbard, Allyn
    UNATTENDED GROUND, SEA, AND AIR SENSOR TECHNOLOGIES AND APPLICATIONS X, 2008, 6963
  • [3] Performance Evaluation of Long Integer Multiplication Using OpenMP and MPI on Shared Memory Architecture
    Tembhurne, Jitendra V.
    Sathe, Shailesh R.
    2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 283 - 288
  • [4] Bus Arrival Time Prediction Using Recurrent Neural Network with LSTM Architecture
    Agafonov, A. A.
    Yumaganov, A. S.
    OPTICAL MEMORY AND NEURAL NETWORKS, 2019, 28 (03) : 222 - 230
  • [5] Bus Arrival Time Prediction Using Recurrent Neural Network with LSTM Architecture
    A. A. Agafonov
    A. S. Yumaganov
    Optical Memory and Neural Networks, 2019, 28 : 222 - 230
  • [6] Performance Evaluation of Feed-Forward Backpropagation Neural Network for Classification on a Reconfigurable Hardware Architecture
    Mohammadi, Mahnaz
    Ronge, Rohit
    Singapuram, Sanjay S.
    Nandy, S. K.
    APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 312 - 319
  • [7] PARALLEL POLYGON SCAN CONVERSION ALGORITHMS - PERFORMANCE EVALUATION ON A SHARED BUS ARCHITECTURE
    GHOSAL, D
    PATNAIK, LM
    COMPUTERS & GRAPHICS, 1986, 10 (01) : 7 - 25
  • [8] Performance evaluation system for probabilistic neural network hardware
    Noriyuki Aibe
    Ryosuke Mizuno
    Masanori Nakamura
    Moritoshi Yasunaga
    Ikuo Yoshihara
    Artificial Life and Robotics, 2004, 8 (2) : 208 - 213
  • [9] A High Performance Reconfigurable Hardware Architecture for Lightweight Convolutional Neural Network
    An, Fubang
    Wang, Lingli
    Zhou, Xuegong
    ELECTRONICS, 2023, 12 (13)
  • [10] Creating multiple time-shared laser traps with simultaneous displacement detection using digital signal processing hardware
    Guilford, WH
    Tournas, JA
    Dascalu, D
    Watson, DS
    ANALYTICAL BIOCHEMISTRY, 2004, 326 (02) : 153 - 166