Fast Simulation of a Many-NPL Network-on-Chip for Microarchitectural Design Space Exploration

被引:0
|
作者
Kang, Jintaek [1 ]
Yi, Changjae [1 ]
Lee, Keonjoo [1 ]
Lee, Seungwook [2 ]
Ryu, Soojung [3 ]
Ha, Soonhoi [1 ]
机构
[1] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Samsung Elect, SAIT, Suwon, South Korea
[3] SK Telecom, T3K, Seongnam, South Korea
关键词
Many-NPU System; Simulation Technique; Simulator Acceleration;
D O I
10.1109/DSD53832.2021.00029
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A viable solution to cope with the ever-increasing computation complexity of deep learning applications is to integrate many neural processing units (NPUs) in a chip where a network-on-chip (NoC) is used as the communication fabric. Since the design space of an NoC is huge, the network topology is first selected based on the communication patterns of applications with a high-level performance estimation method. After the network topology is selected, the microarchitectural design space exploration is performed with a cycle-level NoC simulator. However, the existing NoC simulator is so slow that design space exploration of the microarchitecture is usually conducted manually in a narrow space. Since a synthetic trace is used, the simulation accuracy is also limited. To overcome these weaknesses, we present a simulation technique that is fast and accurate enough for microarchitectural design space of an NoC. In the proposed technique, we use the real communication trace from the many-NPU simulation without NoC consideration. To this end, we define the trace format that defines the interface between a many-NPU simulator and the NoC simulator. To accelerate simulation speed, we propose a parallelization technique at the cluster level in the simulation of the hierarchical NoC. The key technique is to manage the timestamps of events at the cluster boundary to do without time synchronization error. And, we adjust the abstraction level of simulation models to reduce the number of modules in the SystcmC NoC simulation. With the proposed technique, we could achieve up to 40 times speed-up for 32 NPU system, compared with the FlexNoC simulator.
引用
收藏
页码:131 / 138
页数:8
相关论文
共 50 条
  • [31] Network-on-Chip design and synthesis outlook
    Atienza, David
    Angiolini, Federico
    Murali, Srinivasan
    Pullini, Antonio
    Benini, Luca
    De Micheli, Giovanni
    INTEGRATION-THE VLSI JOURNAL, 2008, 41 (03) : 340 - 359
  • [32] The Design and Implementation of a Hierarchical Network-on-Chip
    Braham, Chorfi Chandarli
    Ji Weixing
    Zhang Lingyu
    2012 INTERNATIONAL WORKSHOP ON INFORMATION AND ELECTRONICS ENGINEERING, 2012, 29 : 2966 - 2974
  • [33] Network-on-chip architectures and design methods
    Benini, L
    Bertozzi, D
    IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 2005, 152 (02): : 261 - 272
  • [34] Hierarchical network-on-chip design method
    Microprocessor Research and Development Center, Peking University, Beijing 100871, China
    Beijing Daxue Xuebao Ziran Kexue Ban, 2007, 5 (669-676):
  • [35] A Fast, Source-synchronous Ring-based Network-on-Chip Design
    Mandal, Ayan
    Khatri, Sunil P.
    Mahapatra, Rabi N.
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 1489 - 1494
  • [36] Communication analysis for network-on-chip design
    Siebenborn, A
    Bringmann, O
    Rosenstiel, W
    INTERNATIONAL CONFERENCE ON PARALLEL COMPUTING IN ELECTRICAL ENGINEERING, 2004, : 315 - 320
  • [37] Network-on-chip architectures and design methodologies
    Palesi, Maurizio
    Kumar, Shashi
    Marculescu, Radu
    MICROPROCESSORS AND MICROSYSTEMS, 2011, 35 (02) : 83 - 84
  • [38] Chip Multiprocessor Design Space Exploration through Statistical Simulation
    Genbrugge, Davy
    Eeckhout, Lieven
    IEEE TRANSACTIONS ON COMPUTERS, 2009, 58 (12) : 1668 - 1681
  • [39] Floorplanning Exploration and Performance Evaluation of a New Network-on-Chip
    Xue, Licheng
    Ji, Weixing
    Zuo, Qi
    Zhang, Yang
    2011 DESIGN, AUTOMATION & TEST IN EUROPE (DATE), 2011, : 625 - 630
  • [40] Accelerating Network-on-Chip Simulation via Sampling
    Dai, Wenbo
    Jerger, Natalie Enright
    2014 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2014, : 135 - 136