Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture

被引：1

作者：

Chen, Qingxiang ^{[1
]}

Zheng, Long ^{[1
]}

Liao, Xiaofei ^{[1
]}

Jin, Hai ^{[1
]}

Wang, Qinggang ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan 430074, Peoples R China

来源：

CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING | 2020年 / 2卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Graph analytics; Dataflow architecture; Heterogeneity; FRAMEWORK;

D O I：

10.1007/s42514-020-00041-w

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve the challenges of low instruction-level parallelism and branch mispredictions in the existing general-purpose architecture for graph applications. In this paper, toward a customized heterogeneous dataflow architecture that integrates the hardware advantages of both dataflow architecture and traditional control architecture, we propose a novel runtime system that can adaptively offload each subgraph to an appropriate underlying architecture. We also present a hybrid execution model to drive optimal performance. Our implementation on a CPU-FPGA platform shows that our approach achieves 2.2x throughput improvement over a state-of-art CPU-FPGA graph processing accelerator and 2.4x throughput improvement over a state-of-art FPGA-based design.

引用

页码：362 / 375

页数：14

共 50 条

[31] A real-time and high-performance MobileNet accelerator based on adaptive dataflow scheduling for image classification
Xiaoting Sang
Tao Ruan
Chunlei Li
Huanyu Li
Ruimin Yang
Zhoufeng Liu
Journal of Real-Time Image Processing, 2024, 21
[32] A Microprogrammable Memory Controller for High-Performance Dataflow Applications
Martin, Jerome
Bernard, Christian
Clermidy, Fabien
Durand, Yves
2009 PROCEEDINGS OF ESSCIRC, 2009, : 349 - +
[33] Buffer Placement and Sizing for High-Performance Dataflow Circuits
Josipovic, Lana
Sheikhha, Shabnam
Guerrieri, Andrea
Ienne, Paolo
Cortadella, Jordi
2020 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA '20), 2020, : 186 - 196
[34] Buffer Placement and Sizing for High-Performance Dataflow Circuits
Josipovic, Lana
Sheikhha, Shabnam
Guerrieri, Andrea
Ienne, Paolo
Cortadella, Jordi
ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2022, 15 (01)
[35] PathQuery Pregel: high-performance graph query with bulk synchronous processing
Bogdan Arsintescu
Shardul Deo
Warren Harris
Pattern Analysis and Applications, 2020, 23 : 1493 - 1504
[36] PathQuery Pregel: high-performance graph query with bulk synchronous processing
Arsintescu, Bogdan
Deo, Shardul
Harris, Warren
PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1493 - 1504
[37] A High-performance DAG Task Scheduling Algorithm for Heterogeneous Networked Embedded Systems
Xie, Guoqi
Li, Renfa
Xiao, Xiongren
Chen, Yuekun
2014 IEEE 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2014, : 1011 - 1016
[38] LCC-Graph: A High-Performance Graph-Processing Framework with Low Communication Costs
Cheng, Yongli
Wang, Fang
Jiang, Hong
Hua, Yu
Feng, Dan
Wang, Xiuneng
2016 IEEE/ACM 24TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2016,
[39] Window memoization: an efficient hardware architecture for high-performance image processing
Khalvati, Farzad
Aagaard, Mark D.
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2010, 5 (03) : 195 - 212
[40] High-Performance Computing Architecture for Sample Value Processing in the Smart Grid
Sun, Le
Muguira, Leire
Jimenez, Jaime
Astarloa, Armando
Lazaro, Jesus
IEEE ACCESS, 2022, 10 : 12208 - 12218

← 1 2 3 4 5 →