Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture

被引：1

作者：

Chen, Qingxiang ^{[1
]}

Zheng, Long ^{[1
]}

Liao, Xiaofei ^{[1
]}

Jin, Hai ^{[1
]}

Wang, Qinggang ^{[1
]}

机构：

[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan 430074, Peoples R China

来源：

CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING | 2020年 / 2卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Graph analytics; Dataflow architecture; Heterogeneity; FRAMEWORK;

D O I：

10.1007/s42514-020-00041-w

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve the challenges of low instruction-level parallelism and branch mispredictions in the existing general-purpose architecture for graph applications. In this paper, toward a customized heterogeneous dataflow architecture that integrates the hardware advantages of both dataflow architecture and traditional control architecture, we propose a novel runtime system that can adaptively offload each subgraph to an appropriate underlying architecture. We also present a hybrid execution model to drive optimal performance. Our implementation on a CPU-FPGA platform shows that our approach achieves 2.2x throughput improvement over a state-of-art CPU-FPGA graph processing accelerator and 2.4x throughput improvement over a state-of-art FPGA-based design.

引用

页码：362 / 375

页数：14

共 50 条

[21] VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture
Afanasyev, Ilya V.
Voevodin, Vladimir V.
Komatsu, Kazuhiko
Kobayashi, Hiroaki
JOURNAL OF SUPERCOMPUTING, 2021, 77 (08): : 8694 - 8715
[22] An energy reduction scheduling mechanism for a high-performance SoC architecture
Chu, SL
EMBEDDED AND UBIQUITOUS COMPUTING - EUC 2005, 2005, 3824 : 281 - 290
[23] FAST: A High-Performance Architecture for Heterogeneous Big Data Forensics
Pungila, Ciprian
Negru, Viorel
INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 618 - 627
[24] Hierarchical Parallelization and Runtime Scheduling for Pregel-like Graph Processing Systems
Li, Zengxiang
Duan, Rubing
Wang, Long
Lu, Sifei
Qin, Zheng
Goh, Rick Siow Mong
2014 IEEE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2014, : 491 - 498
[25] MACRO PIPELINING BASED SCHEDULING ON HIGH-PERFORMANCE HETEROGENEOUS MULTIPROCESSOR SYSTEMS
BANERJEE, S
HAMADA, T
CHAU, PM
FELLMAN, RD
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (06) : 1468 - 1484
[26] Evolution of a high-performance PC architecture data processing system
Turri, M
DASIA 99: DATA SYSTEMS IN AEROSPACE, 1999, 447 : 73 - 78
[27] PSS: A novel statement scheduling mechanism for a high-performance SoC architecture
Chu, SL
TENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2004, : 690 - 697
[28] The open runtime platform: a flexible high-performance managed runtime environment
Cierniak, M
Eng, M
Glew, N
Lewis, B
Stichnoth, J
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2005, 17 (5-6): : 617 - 637
[29] A real-time and high-performance MobileNet accelerator based on adaptive dataflow scheduling for image classification
Sang, Xiaoting
Ruan, Tao
Li, Chunlei
Li, Huanyu
Yang, Ruimin
Liu, Zhoufeng
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (01)
[30] High-performance architecture
Sherwin-Williams
不详
Finsh. Today, 2007, 2 (22-24):

← 1 2 3 4 5 →