HeNCoG: A Heterogeneous Near-memory Computing Architecture for Energy Efficient GCN Acceleration

被引:0
|
作者
Hwang, Seung-Eon [1 ]
Song, Duyeong [1 ]
Park, Jongsun [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Graph Convolutional Network; Sparse Matrix Multiplication; Near-memory Computing; Domain Specific Accelerator; PERFORMANCE;
D O I
10.1109/ISCAS58744.2024.10558133
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Graph convolutional network (GCN), which first applies convolutional operations to process graph data, has gained attention in various tasks involving relational data. Previous GCN accelerators have been designed with heterogeneous cores, considering two stages of inference (aggregation and combination), or with a unified core based on the inference of multi layer as an iterative sparse-dense matrix multiplication. However, those prior works have suffered from an unnecessary large number of multiply-accumulate (MAC) operations and/or main memory accesses. In this paper, we propose HeNCoG, a GCN accelerator that utilizes a heterogeneous MAC array core for the combination stage and a near-memory computing core for the aggregation stage. In HeNCoG, considering that the number of MAC operations is significantly reduced when changing the stage execution order, the combination stage is executed first with a row-stationary dataflow. In the aggregation stage, magneto-resistive random-access memory (MRAM)-based near-memory computing is employed to reduce the number of main memory accesses needed to access the adjacency matrix in the graph dataset. Graph partitioning and double buffering techniques are also applied to further improve hardware efficiencies. Simulation results show that the HeNCoG architecture reduces execution cycles by 97% and memory accesses by 42% compared to previous works.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Performance Estimation and Prototyping of Reconfigurable Near-Memory Computing Systems
    Iskandar, Veronia
    Abd El Ghany, Mohamed A.
    Goehringer, Diana
    2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 357 - 358
  • [22] Near-Memory Computing With Compressed Embedding Table for Personalized Recommendation
    Lim, Jeongmin
    Kim, Young Geun
    Chung, Sung Woo
    Koushanfar, Farinaz
    Kong, Joonho
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (03) : 938 - 951
  • [23] GEM: Ultra-Efficient Near-Memory Reconfigurable Acceleration for Read Mapping by Dividing and Predictive Scattering
    Chen, Longlong
    Zhu, Jianfeng
    Peng, Guiqiang
    Liu, Mingxu
    Wei, Shaojun
    Liu, Leibo
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (12) : 3059 - 3072
  • [24] Energy-Efficient Hardware Acceleration through Computing in the Memory
    Paul, Somnath
    Karam, Robert
    Bhunia, Swarup
    Puri, Ruchir
    2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
  • [25] Exploiting Near-Memory Processing Architectures for Bayesian Neural Networks Acceleration
    Zhao, Yinglin
    Yang, Jianlei
    Jia, Xiaotao
    Wang, Xueyan
    Wang, Zhaohao
    Kang, Wang
    Zhang, Youguang
    Zhao, Weisheng
    2019 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2019), 2019, : 204 - 207
  • [26] Application-Transparent Near-Memory Processing Architecture with Memory Channel Network
    Alian, Mohammad
    Min, Seung Won
    Asgharimoghaddam, Hadi
    Dhar, Ashutosh
    Wang, Dong Kai
    Roewer, Thomas
    McPadden, Adam
    O'Halloran, Oliver
    Chen, Deming
    Xiong, Jinjun
    Kim, Daehoon
    Hwu, Wen-mei
    Kim, Nam Sung
    2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 802 - 814
  • [27] Algorithm/Architecture Co-Design for Near-Memory Processing
    Drumond M.
    Daglis A.
    Mirzadeh N.
    Ustiugov D.
    Picorel J.
    Falsafi B.
    Grot B.
    Pnevmatikatos D.
    2018, Association for Computing Machinery (52): : 109 - 122
  • [28] Charon: Specialized Near-Memory Processing Architecture for Clearing Dead Objects in Memory
    Jang, Jaeyoung
    Heo, Jun
    Lee, Yejin
    Won, Jaeyeon
    Kim, Seonghak
    Jung, Sung Jun
    Hakbeom, Jang
    Ham, Tae Jun
    Lee, Jae Woo
    MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 726 - 739
  • [29] NetDIMM: Low-Latency Near-Memory Network Interface Architecture
    Alian, Mohammad
    Kim, Nam Sung
    MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 699 - 711
  • [30] SongC: A Compiler for Hybrid Near-Memory and In-Memory Many-Core Architecture
    Lin, Junfeng
    Qu, Huanyu
    Ma, Songchen
    Ji, Xinglong
    Li, Hongyi
    Li, Xiaochuan
    Song, Chenhang
    Zhang, Weihao
    IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (10) : 2420 - 2433