Maximizing System Performance by Balancing Computation Loads in LSTM Accelerators

被引:0
|
作者
Park, Junki [1 ]
Kung, Jaeha [1 ]
Yi, Wooseok [1 ]
Kim, Jae-Joon [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The LSTM is a popular neural network model for modeling or analyzing the time-varying data. The main operation of LSTM is a matrix-vector multiplication and it becomes sparse (spMxV) due to the widely-accepted weight pruning in deep learning. This paper presents a new sparse matrix format, named CBSR, to maximize the inference speed of the LSTM accelerator. In the CBSR format, speed-up is achieved by balancing out the computation loads over PEs. Along with the new format, we present a simple network transformation to completely remove the hardware overhead incurred when using the CBSR format. Also, the detailed analysis on the impact of network size or the number of PEs is performed, which lacks in the prior work. The simulation results show 1.6 similar to 38% improvement in the system performance compared to the well-known CSC/CSR format. The power analysis is also performed in 65nm CMOS technology to show 9 similar to 22% energy savings.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [1] Balancing Computation Loads and Optimizing Input Vector Loading in LSTM Accelerators
    Park, Junki
    Yi, Wooseok
    Ahn, Daehyun
    Kung, Jaeha
    Kim, Jae-Joon
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 39 (09) : 1889 - 1901
  • [2] COMPUTATION OF SYSTEM ZEROS WITH BALANCING
    HODEL, AS
    LINEAR ALGEBRA AND ITS APPLICATIONS, 1993, 188 : 423 - 436
  • [3] Improving MapReduce Performance by Balancing Skewed Loads
    Fan Yuanquan
    Wu Weiguo
    Xu Yunlong
    Chen Heng
    CHINA COMMUNICATIONS, 2014, 11 (08) : 85 - 108
  • [4] Performance computation for precharacterized CMOS gates with RC loads
    Dartu, F
    Menezes, N
    Pileggi, LT
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 1996, 15 (05) : 544 - 553
  • [5] Maximizing critical fan system performance
    Chemical Processing, 2002, 65 (02):
  • [6] Maximizing Transmission Efficiency Using the National Grid Electricity Balancing System
    Chang, Show-Kang
    Teoh, Chin-Chuen
    Tao, Ye
    Peng, Peng
    Li, Han
    Dyer, Julian
    Barnett, Stephen
    2015 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, 2015,
  • [7] Scaling and Balancing for High-Performance Computation of Optimal Controls
    Ross, I. M.
    Gong, Q.
    Karpenko, M.
    Proulx, R. J.
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2018, 41 (10) : 2086 - 2097
  • [8] Stability of delay load balancing system during parallel computation
    Meng, Qing-Yang
    Wang, Shu
    Qiao, Jian-Zhong
    Lin, Shu-Kuan
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2010, 31 (09): : 1238 - 1241
  • [9] LSTM-Based Traffic Load Balancing and Resource Allocation for an Edge System
    Dlamini, Thembelihle
    Vilakati, Sifiso
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [10] Adaptive radio resource management for maximizing reward and balancing loads in 4G hybrid universal mobile telecommunications system and long term evolution communications
    Chang, Ben-Jye
    Liang, Ying-Hsin
    Cao, Kai-Xiang
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2015, 15 (03): : 510 - 526