HeNCoG: A Heterogeneous Near-memory Computing Architecture for Energy Efficient GCN Acceleration

被引：0

作者：

Hwang, Seung-Eon ^{[1
]}

Song, Duyeong ^{[1
]}

Park, Jongsun ^{[1
]}

机构：

[1] Korea Univ, Sch Elect Engn, Seoul, South Korea

来源：

2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024 | 2024年

基金：

新加坡国家研究基金会;

关键词：

Graph Convolutional Network; Sparse Matrix Multiplication; Near-memory Computing; Domain Specific Accelerator; PERFORMANCE;

D O I：

10.1109/ISCAS58744.2024.10558133

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Graph convolutional network (GCN), which first applies convolutional operations to process graph data, has gained attention in various tasks involving relational data. Previous GCN accelerators have been designed with heterogeneous cores, considering two stages of inference (aggregation and combination), or with a unified core based on the inference of multi layer as an iterative sparse-dense matrix multiplication. However, those prior works have suffered from an unnecessary large number of multiply-accumulate (MAC) operations and/or main memory accesses. In this paper, we propose HeNCoG, a GCN accelerator that utilizes a heterogeneous MAC array core for the combination stage and a near-memory computing core for the aggregation stage. In HeNCoG, considering that the number of MAC operations is significantly reduced when changing the stage execution order, the combination stage is executed first with a row-stationary dataflow. In the aggregation stage, magneto-resistive random-access memory (MRAM)-based near-memory computing is employed to reduce the number of main memory accesses needed to access the adjacency matrix in the graph dataset. Graph partitioning and double buffering techniques are also applied to further improve hardware efficiencies. Simulation results show that the HeNCoG architecture reduces execution cycles by 97% and memory accesses by 42% compared to previous works.

引用

页数：5

共 50 条

[21] Performance Estimation and Prototyping of Reconfigurable Near-Memory Computing Systems
Iskandar, Veronia
Abd El Ghany, Mohamed A.
Goehringer, Diana
2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL, 2023, : 357 - 358
[22] Near-Memory Computing With Compressed Embedding Table for Personalized Recommendation
Lim, Jeongmin
Kim, Young Geun
Chung, Sung Woo
Koushanfar, Farinaz
Kong, Joonho
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (03) : 938 - 951
[23] GEM: Ultra-Efficient Near-Memory Reconfigurable Acceleration for Read Mapping by Dividing and Predictive Scattering
Chen, Longlong
Zhu, Jianfeng
Peng, Guiqiang
Liu, Mingxu
Wei, Shaojun
Liu, Leibo
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (12) : 3059 - 3072
[24] Energy-Efficient Hardware Acceleration through Computing in the Memory
Paul, Somnath
Karam, Robert
Bhunia, Swarup
Puri, Ruchir
2014 DESIGN, AUTOMATION AND TEST IN EUROPE CONFERENCE AND EXHIBITION (DATE), 2014,
[25] Exploiting Near-Memory Processing Architectures for Bayesian Neural Networks Acceleration
Zhao, Yinglin
Yang, Jianlei
Jia, Xiaotao
Wang, Xueyan
Wang, Zhaohao
Kang, Wang
Zhang, Youguang
Zhao, Weisheng
2019 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2019), 2019, : 204 - 207
[26] Application-Transparent Near-Memory Processing Architecture with Memory Channel Network
Alian, Mohammad
Min, Seung Won
Asgharimoghaddam, Hadi
Dhar, Ashutosh
Wang, Dong Kai
Roewer, Thomas
McPadden, Adam
O'Halloran, Oliver
Chen, Deming
Xiong, Jinjun
Kim, Daehoon
Hwu, Wen-mei
Kim, Nam Sung
2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2018, : 802 - 814
[27] Algorithm/Architecture Co-Design for Near-Memory Processing
Drumond M.
Daglis A.
Mirzadeh N.
Ustiugov D.
Picorel J.
Falsafi B.
Grot B.
Pnevmatikatos D.
2018, Association for Computing Machinery (52): : 109 - 122
[28] Charon: Specialized Near-Memory Processing Architecture for Clearing Dead Objects in Memory
Jang, Jaeyoung
Heo, Jun
Lee, Yejin
Won, Jaeyeon
Kim, Seonghak
Jung, Sung Jun
Hakbeom, Jang
Ham, Tae Jun
Lee, Jae Woo
MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 726 - 739
[29] NetDIMM: Low-Latency Near-Memory Network Interface Architecture
Alian, Mohammad
Kim, Nam Sung
MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, 2019, : 699 - 711
[30] SongC: A Compiler for Hybrid Near-Memory and In-Memory Many-Core Architecture
Lin, Junfeng
Qu, Huanyu
Ma, Songchen
Ji, Xinglong
Li, Hongyi
Li, Xiaochuan
Song, Chenhang
Zhang, Weihao
IEEE TRANSACTIONS ON COMPUTERS, 2024, 73 (10) : 2420 - 2433

← 1 2 3 4 5 →