COMPESCE: A Co-design Approach for Memory Subsystem Performance Analysis in HPC Many-Cores

被引:0
|
作者
Portero, Antoni [1 ]
Falquez, Carlos [1 ]
Ho, Nam [1 ]
Petrakis, Polydoros [2 ]
Nassyr, Stepan [1 ]
Marazakis, Manolis [2 ]
Dolbeau, Romain [3 ]
Cifuentes, Jorge Alejandro Nocua [4 ]
Alvarez, Luis Bertran [4 ]
Pleiter, Dirk [5 ]
Suarez, Estela [1 ]
机构
[1] Forschungszentrum Julich, Julich Supercomp Ctr, Novel Syst Architectures Design, Julich, Germany
[2] Fdn Res & Technol Hellas FORTH, Inst Comp Sci, Iraklion, Greece
[3] SiPearl, Rennes, France
[4] ATOS, Les Clayes Sous Bois, France
[5] Royal Inst Technol, KTH, Stockholm, Sweden
关键词
Co-design; HPC; Network on Chip; gem5;
D O I
10.1007/978-3-031-42785-5_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the memory subsystem design through gem5 simulations of a non-uniform memory access (NUMA) architecture with ARM cores equipped with vector engines. And connected to a Network-on-Chip (NoC) following the Coherent Hub Interface (CHI) protocol. The study quantifies the benefits of vectorization, prefetching, and multichannel NoC configurations using a benchmark for generating memory patterns and indexed accesses. The outcomes provide insights into improving bus utilization and bandwidth and reducing stalls in the system. The paper proposes hardware/software (HW/SW) advancements to reach and use the HBM device with a higher percentage than 80% at the memory controllers in the simulated manycore system.
引用
收藏
页码:105 / 119
页数:15
相关论文
共 26 条
  • [1] Worst-Case Memory Traffic Analysis for Many-Cores using a Limited Migrative Model
    Nikolic, Borislav
    Yomsi, Patrick Meumeu
    Petters, Stefan M.
    2013 IEEE 19TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA), 2013, : 42 - 51
  • [2] A DECENTRALIZED APPROACH FOR MULTI-SUBSYSTEM CO-DESIGN OPTIMIZATION USING DIRECT COLLOCATION METHOD
    Liu, Tianchen
    Azarm, Shapour
    Chopra, Nikhil
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2017, VOL 2A, 2017,
  • [3] Performance Analysis Techniques for the Exascale Co-Design Process
    Schulz, Martin
    Belak, Jim
    Bhatele, Abhinav
    Bremer, Peer-Timo
    Bronevetsky, Greg
    Casas, Marc
    Gamblin, Todd
    Isaacs, Katherine E.
    Laguna, Ignacio
    Levine, Joshua
    Pascucci, Valerio
    Richards, David
    Rountree, Barry
    PARALLEL COMPUTING: ACCELERATING COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, 25 : 19 - 32
  • [4] The integrated care performance assessment tool: a co-design approach
    Dates, Mariana
    Tedeschi, Micol
    Lennox-Chhugani, Niamh
    INTERNATIONAL JOURNAL OF INTEGRATED CARE, 2019, 19
  • [5] Automatic SoC Design Flow on Many-core Processors: a Software Hardware Co-Design Approach for FPGAs
    Liu, Ling
    Morozov, Oleksii
    Han, Yuxing
    Gutknecht, Juerg
    Hunziker, Patrick
    FPGA 11: PROCEEDINGS OF THE 2011 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, 2011, : 37 - 40
  • [6] Adaptive Quantization as a Device-Algorithm Co-Design Approach to Improve the Performance of In-Memory Unsupervised Learning With SNNs
    Shi, Yuhan
    Huang, Zhisheng
    Oh, Sangheon
    Kaslan, Nathan
    Song, Jungwoo
    Kuzum, Duygu
    IEEE TRANSACTIONS ON ELECTRON DEVICES, 2019, 66 (04) : 1722 - 1728
  • [7] High Performance Computing for Precision Landing and Hazard Avoidance and Co-design Approach
    Rutishauser, David
    Prothro, John
    Moore, Ronn
    Yim, Hester
    2019 IEEE AEROSPACE CONFERENCE, 2019,
  • [8] Performance analysis of embedded systems in the virtual component co-design environment
    Garg, P
    Gupta, A
    Rozenblit, JW
    11TH IEEE INTERNATIONAL CONFERENCE AND WORKSHOP ON THE ENGINEERING OF COMPUTER-BASED SYSTEMS, PROCEEDINGS, 2004, : 61 - 68
  • [9] Impact of Small Antenna on Linear Power Amplifier Performance in a Co-design Approach
    Ben Abdallah, Essia
    Giry, Alexandre
    Bories, Serge
    Nicolas, Dominique
    Delaveaud, Christophe
    2015 IEEE 13TH INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS), 2015,
  • [10] Performance evaluation over HW/SW co-design SoC memory transfers for a CNN accelerator
    Rios-Navarro, A.
    Tapiador-Morales, R.
    Jimenez-Fernandez, A.
    Amaya, C.
    Dominguez-Morales, M.
    Delbruck, T.
    Linares-Barranco, A.
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON NANOTECHNOLOGY (IEEE-NANO), 2018,