Triangle Counting Accelerations: From Algorithm to In-Memory Computing Architecture

被引：13

作者：

Wang, Xueyan ^{[1
]}

Yang, Jianlei ^{[2
]}

Zhao, Yinglin ^{[1
]}

Jia, Xiaotao ^{[1
]}

Yin, Rong ^{[3
]}

Chen, Xuhang ^{[1
]}

Qu, Gang ^{[4
,5
]}

Zhao, Weisheng ^{[1
]}

机构：

[1] Beihang Univ, Sch Integrated Circuit Sci & Engn, MIIT Key Lab Spintron, Beijing 100191, Peoples R China

[2] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Software Dev Environm NLSDE, BDBC, Beijing 100191, Peoples R China

[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100049, Peoples R China

[4] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA

[5] Univ Maryland, Inst Syst Res, College Pk, MD 20742 USA

来源：

IEEE TRANSACTIONS ON COMPUTERS | 2022年 / 71卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Triangle counting acceleration; processing-in-memory; algorithm-architecture co-design; graph computing; NONVOLATILE MEMORY; ENERGY;

D O I：

10.1109/TC.2021.3131049

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Triangles are the basic substructure of networks and triangle counting (TC) has been a fundamental graph computing problem in numerous fields such as social network analysis. Nevertheless, like other graph computing problems, due to the high memory-computation ratio and random memory access pattern, TC involves a large amount of data transfers thus suffers from the bandwidth bottleneck in the traditional Von-Neumann architecture. To overcome this challenge, in this paper, we propose to accelerate TC with the emerging processingin-memory (PIM) architecture through an algorithm-architecture co-optimization manner. To enable the efficient in-memory implementations, we come up to reformulate TC with bitwise logic operations (such as AND), and develop customized graph compression and mapping techniques for efficient data flow management. With the emerging computational Spin-Transfer Torque Magnetic RAM(STT-MRAM) array, which is one of the most promising PIM enabling techniques, the device-to-architecture co-simulation results demonstrate that the proposed TC in-memory accelerator outperforms the state-of-the-art GPU and FPGA accelerations by 12.2 x and 31.8 x, respectively, and achieves a 34 x energy efficiency improvement over the FPGA accelerator.

引用

页码：2462 / 2472

页数：11

共 50 条

[1] Graph Algorithm Optimization for Spintronics-based In-memory Computing Architecture
Wang X.
Chen X.
Jia X.
Yang J.
Qu G.
Zhao W.
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (09): : 3193 - 3199
[2] A Crossbar-Based In-Memory Computing Architecture
Wang, Xinxin
Zidan, Mohammed A.
Lu, Wei D.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (12) : 4224 - 4232
[3] In-Memory Computing Architecture for Efficient Hardware Security
Ajmi, Hala
Zayer, Fakhreddine
Belgacem, Hamdi
2024 IEEE 7TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES, SIGNAL AND IMAGE PROCESSING, ATSIP 2024, 2024, : 71 - 76
[4] In-Memory Computing Architecture for Efficient Hardware Security
Ajmi, Hala
Zayer, Fakhreddine
Belgacem, Hamdi
arXiv,
[5] A Unified Memory Network Architecture for In-Memory Computing in Commodity Servers
Zhan, Jia
Akgun, Itir
Zhao, Jishen
Davis, Al
Faraboschi, Paolo
Wang, Yuangang
Xie, Yuan
2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
[6] Efficient and lightweight in-memory computing architecture for hardware security
Ajmi, Hala
Zayer, Fakhreddine
Fredj, Amira Hadj
Belgacem, Hamdi
Mohammad, Baker
Werghi, Naoufel
Dias, Jorge
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 190
[7] A Flexible In-Memory Computing Architecture for Heterogeneously Quantized CNNs
Ponzina, Flavio
Rios, Marco
Ansaloni, Giovanni
Levisse, Alexandre
Atienza, David
2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 164 - 169
[8] TAICHI: A Tiled Architecture for In-Memory Computing and Heterogeneous Integration
Wang, Xinxin
Pinkham, Reid
Zidan, Mohammed A.
Meng, Fan-Hsuan
Flynn, Michael P.
Zhang, Zhengya
Lu, Wei D.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 559 - 563
[9] Data movement elimination with dual in-memory computing architecture
Kang, Wang
Kou, Jing
Zhang, Liang
DEVICE, 2024, 2 (12):
[10] ReVAMP : ReRAM based VLIW Architecture for in-Memory comPuting
Bhattacharjee, Debjyoti
Devadoss, Rajeswari
Chattopadhyay, Anupam
PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 782 - 787

← 1 2 3 4 5 →