OPT-GCN: A Unified and Scalable Chiplet-Based Accelerator for High-Performance and Energy-Efficient GCN Computation

被引:0
|
作者
Zhao, Yingnan [1 ]
Wang, Ke [2 ]
Louri, Ahmed [1 ]
机构
[1] George Washington Univ, Dept Elect & Comp Engn, Washington, DC 20052 USA
[2] Univ North Carolina Charlotte, Dept Elect & Comp Engn, Charlotte, NC 28223 USA
关键词
Engines; System-on-chip; Vectors; Inference algorithms; Computer architecture; Energy efficiency; Design automation; Chiplet-based design; graph convolutional network (GCN); hardware accelerator; hardware-algorithm co-design; GRAPH NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1109/TCAD.2024.3401543
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
As the size of real-world graphs continues to grow at an exponential rate, performing the graph convolutional network (GCN) inference efficiently is becoming increasingly challenging. Prior works that employ a unified computing engine with a predefined computation order lack the necessary flexibility and scalability to handle diverse input graph datasets. In this article, we introduce OPT-GCN, a chiplet-based accelerator design that performs GCN inference efficiently while providing flexibility and scalability through an architecture-algorithm co-design. On the architecture side, the proposed design integrates a unified computing engine in each chiplet and an active interposer, both of which are adaptable to efficiently perform the GCN inference and facilitate data communication. On the algorithm side, we propose dynamic scheduling and mapping algorithms to optimize memory access and on-chip computations for diverse GCN applications. Experimental results show that the proposed design provides a memory access reduction by a factor of 11.3x, 3.4x, and 1.4x, and energy savings of 15.2x, 3.7x, and 1.6x on average compared to HyGCN, AWB-GCN, and GCNAX, respectively.
引用
收藏
页码:4827 / 4840
页数:14
相关论文
共 50 条
  • [1] SPRINT: A High-Performance, Energy-Efficient, and Scalable Chiplet-Based Accelerator With Photonic Interconnects for CNN Inference
    Li, Yuan
    Louri, Ahmed
    Karanth, Avinash
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (10) : 2332 - 2345
  • [2] A Flexible Hybrid Interconnection Design for High-Performance and Energy-Efficient Chiplet-Based Systems
    Mahmud, Md Tareq
    Wang, Ke
    IEEE COMPUTER ARCHITECTURE LETTERS, 2024, 23 (02) : 215 - 218
  • [3] SEECHIP: A Scalable and Energy-Efficient Chiplet-based GPU Architecture Using Photonic Links
    Zhang, Hao
    Chen, Yawen
    Huang, Zhiyi
    Zhang, Haibo
    Dai, Fei
    PROCEEDINGS OF THE 52ND INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2023, 2023, : 566 - 575
  • [4] GPACE: An Energy-Efficient PQ-based GCN Accelerator with Redundancy Reduction
    Du, Yibo
    Liang, Shengwen
    Wang, Ying
    Li, Huawei
    Li, Xiaowei
    Han, Yinhe
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [5] ChipAI: A scalable chiplet-based accelerator for efficient DNN inference using silicon photonics
    Zhang, Hao
    Zhang, Haibo
    Huang, Zhiyi
    Chen, Yawen
    JOURNAL OF SYSTEMS ARCHITECTURE, 2025, 158
  • [6] Graphicionado: A High-Performance and Energy-Efficient Accelerator for Graph Analytics
    Ham, Tae Jun
    Wu, Lisa
    Sundaram, Narayanan
    Satish, Nadathur
    Martonosi, Margaret
    2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
  • [7] HPPI: A High-Performance Photonic Interconnect Design for Chiplet-Based DNN Accelerators
    Li, Guanglong
    Ye, Yaoyao
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 43 (03) : 812 - 825
  • [8] Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks
    Xie, Xi
    Peng, Hongwu
    Hasan, Amit
    Huang, Shaoyi
    Zhao, Jiahui
    Fang, Haowen
    Zhang, Wei
    Geng, Tong
    Khan, Omer
    Ding, Caiwen
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [9] TuNao: A High-Performance and Energy-Efficient Reconfigurable Accelerator for Graph Processing
    Zhou, Jinhong
    Liu, Shaoli
    Guo, Qi
    Zhou, Xuda
    Zhi, Tian
    Liu, Daofu
    Wang, Chao
    Zhou, Xuehai
    Chen, Yunji
    Chen, Tianshi
    2017 17TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2017, : 731 - 734
  • [10] High-Performance and Scalable Organosilicon Membranes for Energy-Efficient Alcohol Purification
    Zhu, Tengyang
    Shen, Dongchen
    Dong, Jiayu
    Liu, Huan
    Xia, Qing
    Li, Song
    Shao, Lu
    Wang, Yan
    ADVANCED FUNCTIONAL MATERIALS, 2025, 35 (07)