Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture

被引:1
|
作者
Chen, Qingxiang [1 ]
Zheng, Long [1 ]
Liao, Xiaofei [1 ]
Jin, Hai [1 ]
Wang, Qinggang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph analytics; Dataflow architecture; Heterogeneity; FRAMEWORK;
D O I
10.1007/s42514-020-00041-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve the challenges of low instruction-level parallelism and branch mispredictions in the existing general-purpose architecture for graph applications. In this paper, toward a customized heterogeneous dataflow architecture that integrates the hardware advantages of both dataflow architecture and traditional control architecture, we propose a novel runtime system that can adaptively offload each subgraph to an appropriate underlying architecture. We also present a hybrid execution model to drive optimal performance. Our implementation on a CPU-FPGA platform shows that our approach achieves 2.2x throughput improvement over a state-of-art CPU-FPGA graph processing accelerator and 2.4x throughput improvement over a state-of-art FPGA-based design.
引用
收藏
页码:362 / 375
页数:14
相关论文
共 50 条
  • [31] A real-time and high-performance MobileNet accelerator based on adaptive dataflow scheduling for image classification
    Xiaoting Sang
    Tao Ruan
    Chunlei Li
    Huanyu Li
    Ruimin Yang
    Zhoufeng Liu
    Journal of Real-Time Image Processing, 2024, 21
  • [32] A Microprogrammable Memory Controller for High-Performance Dataflow Applications
    Martin, Jerome
    Bernard, Christian
    Clermidy, Fabien
    Durand, Yves
    2009 PROCEEDINGS OF ESSCIRC, 2009, : 349 - +
  • [33] Buffer Placement and Sizing for High-Performance Dataflow Circuits
    Josipovic, Lana
    Sheikhha, Shabnam
    Guerrieri, Andrea
    Ienne, Paolo
    Cortadella, Jordi
    2020 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA '20), 2020, : 186 - 196
  • [34] Buffer Placement and Sizing for High-Performance Dataflow Circuits
    Josipovic, Lana
    Sheikhha, Shabnam
    Guerrieri, Andrea
    Ienne, Paolo
    Cortadella, Jordi
    ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (01)
  • [35] PathQuery Pregel: high-performance graph query with bulk synchronous processing
    Bogdan Arsintescu
    Shardul Deo
    Warren Harris
    Pattern Analysis and Applications, 2020, 23 : 1493 - 1504
  • [36] PathQuery Pregel: high-performance graph query with bulk synchronous processing
    Arsintescu, Bogdan
    Deo, Shardul
    Harris, Warren
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1493 - 1504
  • [37] A High-performance DAG Task Scheduling Algorithm for Heterogeneous Networked Embedded Systems
    Xie, Guoqi
    Li, Renfa
    Xiao, Xiongren
    Chen, Yuekun
    2014 IEEE 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2014, : 1011 - 1016
  • [38] LCC-Graph: A High-Performance Graph-Processing Framework with Low Communication Costs
    Cheng, Yongli
    Wang, Fang
    Jiang, Hong
    Hua, Yu
    Feng, Dan
    Wang, Xiuneng
    2016 IEEE/ACM 24TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2016,
  • [39] Window memoization: an efficient hardware architecture for high-performance image processing
    Khalvati, Farzad
    Aagaard, Mark D.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2010, 5 (03) : 195 - 212
  • [40] High-Performance Computing Architecture for Sample Value Processing in the Smart Grid
    Sun, Le
    Muguira, Leire
    Jimenez, Jaime
    Astarloa, Armando
    Lazaro, Jesus
    IEEE ACCESS, 2022, 10 : 12208 - 12218