Triangle Counting Accelerations: From Algorithm to In-Memory Computing Architecture

被引:13
|
作者
Wang, Xueyan [1 ]
Yang, Jianlei [2 ]
Zhao, Yinglin [1 ]
Jia, Xiaotao [1 ]
Yin, Rong [3 ]
Chen, Xuhang [1 ]
Qu, Gang [4 ,5 ]
Zhao, Weisheng [1 ]
机构
[1] Beihang Univ, Sch Integrated Circuit Sci & Engn, MIIT Key Lab Spintron, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Software Dev Environm NLSDE, BDBC, Beijing 100191, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing 100049, Peoples R China
[4] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[5] Univ Maryland, Inst Syst Res, College Pk, MD 20742 USA
基金
中国国家自然科学基金;
关键词
Triangle counting acceleration; processing-in-memory; algorithm-architecture co-design; graph computing; NONVOLATILE MEMORY; ENERGY;
D O I
10.1109/TC.2021.3131049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Triangles are the basic substructure of networks and triangle counting (TC) has been a fundamental graph computing problem in numerous fields such as social network analysis. Nevertheless, like other graph computing problems, due to the high memory-computation ratio and random memory access pattern, TC involves a large amount of data transfers thus suffers from the bandwidth bottleneck in the traditional Von-Neumann architecture. To overcome this challenge, in this paper, we propose to accelerate TC with the emerging processingin-memory (PIM) architecture through an algorithm-architecture co-optimization manner. To enable the efficient in-memory implementations, we come up to reformulate TC with bitwise logic operations (such as AND), and develop customized graph compression and mapping techniques for efficient data flow management. With the emerging computational Spin-Transfer Torque Magnetic RAM(STT-MRAM) array, which is one of the most promising PIM enabling techniques, the device-to-architecture co-simulation results demonstrate that the proposed TC in-memory accelerator outperforms the state-of-the-art GPU and FPGA accelerations by 12.2 x and 31.8 x, respectively, and achieves a 34 x energy efficiency improvement over the FPGA accelerator.
引用
收藏
页码:2462 / 2472
页数:11
相关论文
共 50 条
  • [1] Graph Algorithm Optimization for Spintronics-based In-memory Computing Architecture
    Wang X.
    Chen X.
    Jia X.
    Yang J.
    Qu G.
    Zhao W.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (09): : 3193 - 3199
  • [2] A Crossbar-Based In-Memory Computing Architecture
    Wang, Xinxin
    Zidan, Mohammed A.
    Lu, Wei D.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (12) : 4224 - 4232
  • [3] In-Memory Computing Architecture for Efficient Hardware Security
    Ajmi, Hala
    Zayer, Fakhreddine
    Belgacem, Hamdi
    2024 IEEE 7TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES, SIGNAL AND IMAGE PROCESSING, ATSIP 2024, 2024, : 71 - 76
  • [4] In-Memory Computing Architecture for Efficient Hardware Security
    Ajmi, Hala
    Zayer, Fakhreddine
    Belgacem, Hamdi
    arXiv,
  • [5] A Unified Memory Network Architecture for In-Memory Computing in Commodity Servers
    Zhan, Jia
    Akgun, Itir
    Zhao, Jishen
    Davis, Al
    Faraboschi, Paolo
    Wang, Yuangang
    Xie, Yuan
    2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
  • [6] Efficient and lightweight in-memory computing architecture for hardware security
    Ajmi, Hala
    Zayer, Fakhreddine
    Fredj, Amira Hadj
    Belgacem, Hamdi
    Mohammad, Baker
    Werghi, Naoufel
    Dias, Jorge
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024, 190
  • [7] A Flexible In-Memory Computing Architecture for Heterogeneously Quantized CNNs
    Ponzina, Flavio
    Rios, Marco
    Ansaloni, Giovanni
    Levisse, Alexandre
    Atienza, David
    2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 164 - 169
  • [8] TAICHI: A Tiled Architecture for In-Memory Computing and Heterogeneous Integration
    Wang, Xinxin
    Pinkham, Reid
    Zidan, Mohammed A.
    Meng, Fan-Hsuan
    Flynn, Michael P.
    Zhang, Zhengya
    Lu, Wei D.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) : 559 - 563
  • [9] Data movement elimination with dual in-memory computing architecture
    Kang, Wang
    Kou, Jing
    Zhang, Liang
    DEVICE, 2024, 2 (12):
  • [10] ReVAMP : ReRAM based VLIW Architecture for in-Memory comPuting
    Bhattacharjee, Debjyoti
    Devadoss, Rajeswari
    Chattopadhyay, Anupam
    PROCEEDINGS OF THE 2017 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2017, : 782 - 787