Fast Simulation of a Many-NPL Network-on-Chip for Microarchitectural Design Space Exploration

被引:0
|
作者
Kang, Jintaek [1 ]
Yi, Changjae [1 ]
Lee, Keonjoo [1 ]
Lee, Seungwook [2 ]
Ryu, Soojung [3 ]
Ha, Soonhoi [1 ]
机构
[1] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Samsung Elect, SAIT, Suwon, South Korea
[3] SK Telecom, T3K, Seongnam, South Korea
关键词
Many-NPU System; Simulation Technique; Simulator Acceleration;
D O I
10.1109/DSD53832.2021.00029
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A viable solution to cope with the ever-increasing computation complexity of deep learning applications is to integrate many neural processing units (NPUs) in a chip where a network-on-chip (NoC) is used as the communication fabric. Since the design space of an NoC is huge, the network topology is first selected based on the communication patterns of applications with a high-level performance estimation method. After the network topology is selected, the microarchitectural design space exploration is performed with a cycle-level NoC simulator. However, the existing NoC simulator is so slow that design space exploration of the microarchitecture is usually conducted manually in a narrow space. Since a synthetic trace is used, the simulation accuracy is also limited. To overcome these weaknesses, we present a simulation technique that is fast and accurate enough for microarchitectural design space of an NoC. In the proposed technique, we use the real communication trace from the many-NPU simulation without NoC consideration. To this end, we define the trace format that defines the interface between a many-NPU simulator and the NoC simulator. To accelerate simulation speed, we propose a parallelization technique at the cluster level in the simulation of the hierarchical NoC. The key technique is to manage the timestamps of events at the cluster boundary to do without time synchronization error. And, we adjust the abstraction level of simulation models to reduce the number of modules in the SystcmC NoC simulation. With the proposed technique, we could achieve up to 40 times speed-up for 32 NPU system, compared with the FlexNoC simulator.
引用
收藏
页码:131 / 138
页数:8
相关论文
共 50 条
  • [21] Fast Simulation Techniques for Design Space Exploration
    Knorreck, Daniel
    Apvrille, Ludovic
    Pacalet, Renaud
    OBJECTS, COMPONENTS, MODELS AND PATTERNS, PROCEEDINGS, 2009, 33 : 308 - 327
  • [22] Efficient Application Specification for Network-on-Chip Exploration
    Zhang, Zhe
    Hu, Xiaoming
    2009 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL 2, PROCEEDINGS, 2009, : 368 - 371
  • [23] A power and energy exploration of Network-on-Chip architectures
    Banerjee, Arnab
    Mullins, Robert
    Moore, Simon
    NOCS 2007: FIRST INTERNATIONAL SYMPOSIUM ON NETWORKS-ON-CHIP, PROCEEDINGS, 2007, : 163 - +
  • [24] A Simulation Environment for Design Space Exploration for Asymmetric 3D-Network-on-Chip
    Joseph, Jan Moritz
    Wrieden, Sven
    Blochwitz, Christopher
    Garcia-Oritz, Alberto
    Pionteck, Thilo
    2016 11TH INTERNATIONAL SYMPOSIUM ON RECONFIGURABLE COMMUNICATION-CENTRIC SYSTEMS-ON-CHIP (RECOSOC), 2016,
  • [25] An Energy and Performance Exploration of Network-on-Chip Architectures
    Banerjee, Arnab
    Wolkotte, Pascal T.
    Mullins, Robert D.
    Moore, Simon W.
    Smit, Gerard J. M.
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (03) : 319 - 329
  • [26] Asynchronous network node design for network-on-chip
    Wang, X
    Sigüenza-Tortosa, D
    Ahonen, T
    Nurmi, J
    ISSCS 2005: International Symposium on Signals, Circuits and Systems, Vols 1 and 2, Proceedings, 2005, : 55 - 58
  • [27] Switch design and implementation for Network-on-Chip
    Ye, Qiang
    Liu, Jian
    Zheng, Li-Rong
    Proceedings of the Seventh IEEE CPMT Conference on High Density Microsystem Design, Packaging and Failure Analysis (HDP'05), 2005, : 412 - 418
  • [28] Wire-driven microarchitectural design space exploration
    Ekpanyapong, M
    Lim, SK
    Ballapuram, C
    Lee, HHS
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 1867 - 1870
  • [29] A Fast Network-on-Chip Simulator with QEMU and SystemC
    Nakajima, Keita
    Hieda, Takuji
    Taniguchi, Ittetsu
    Tomiyama, Hiroyuki
    Takada, Hiroaki
    2012 THIRD INTERNATIONAL CONFERENCE ON NETWORKING AND COMPUTING (ICNC 2012), 2012, : 298 - 301
  • [30] Reconfigurable Router Design for Network-On-Chip
    Mathew, Minu
    Mugilan, D.
    2014 IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2014), 2014, : 1268 - 1272