Maximizing System Performance by Balancing Computation Loads in LSTM Accelerators

被引:0
|
作者
Park, Junki [1 ]
Kung, Jaeha [1 ]
Yi, Wooseok [1 ]
Kim, Jae-Joon [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The LSTM is a popular neural network model for modeling or analyzing the time-varying data. The main operation of LSTM is a matrix-vector multiplication and it becomes sparse (spMxV) due to the widely-accepted weight pruning in deep learning. This paper presents a new sparse matrix format, named CBSR, to maximize the inference speed of the LSTM accelerator. In the CBSR format, speed-up is achieved by balancing out the computation loads over PEs. Along with the new format, we present a simple network transformation to completely remove the hardware overhead incurred when using the CBSR format. Also, the detailed analysis on the impact of network size or the number of PEs is performed, which lacks in the prior work. The simulation results show 1.6 similar to 38% improvement in the system performance compared to the well-known CSC/CSR format. The power analysis is also performed in 65nm CMOS technology to show 9 similar to 22% energy savings.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [21] High performance computation and visualization of EMs using an integrated computation system
    Lu, JW
    COUPLING OF FLUIDS, STRUCTURES AND WAVES IN AERONAUTICS, PROCEEDINGS, 2003, 85 : 182 - 195
  • [22] Maximizing Drilling Performance through Enhanced Solid Control System
    Irawan, S.
    Kinif, B. I.
    Bayuaji, R.
    INTERNATIONAL CONFERENCE OF APPLIED SCIENCE AND TECHNOLOGY FOR INFRASTRUCTURE ENGINEERING, 2017, 267
  • [23] JANUS: A Compilation System for Balancing Parallelism and Performance in OpenVX
    Omidian, Hossein
    Lemieux, Guy G. F.
    2ND INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2018), 2018, 1004
  • [24] Performance analysis and portability of the PLUM load balancing system
    Oliker, L
    Biswas, R
    Gabow, HN
    EURO-PAR '98 PARALLEL PROCESSING, 1998, 1470 : 307 - 317
  • [25] Performance evaluation of dynamic load balancing system for clusters
    Tang, Dan
    Jin, Hai
    Zhang, Yong-Kun
    Jisuanji Xuebao/Chinese Journal of Computers, 2004, 27 (06): : 803 - 811
  • [26] Performance Analysis of Standalone PV System at Varying Loads
    Girdhar, Vaishali
    Vadhera, Shelly
    Mittal, Monika
    2018 IEEE 8TH POWER INDIA INTERNATIONAL CONFERENCE (PIICON), 2018,
  • [27] Impact of PHEV Loads on the Dynamic Performance of Power System
    Islam, F. R.
    Pota, H. R.
    Mahmud, M. A.
    Hossain, M. J.
    2010 20TH AUSTRALASIAN UNIVERSITIES POWER ENGINEERING CONFERENCE (AUPEC 2010): POWER QUALITY FOR THE 21ST CENTURY, 2010,
  • [28] RPPC: a Holistic Runtime System for Maximizing Performance under Power Capping
    Park, Jinsu
    Park, Seongbeom
    Baek, Woongki
    2018 18TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2018, : 41 - 50
  • [29] Maximizing critical fan system performance - Improving reliability, saving money
    Kelly, S
    CHEMICAL PROCESSING, 2002, 65 (02): : 36 - 39
  • [30] Maximizing ORC performance with optimal match of working fluid with system design
    Barse, Kirtipal A.
    Mann, Michael D.
    APPLIED THERMAL ENGINEERING, 2016, 100 : 11 - 19