Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture

被引:1
|
作者
Chen, Qingxiang [1 ]
Zheng, Long [1 ]
Liao, Xiaofei [1 ]
Jin, Hai [1 ]
Wang, Qinggang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph analytics; Dataflow architecture; Heterogeneity; FRAMEWORK;
D O I
10.1007/s42514-020-00041-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve the challenges of low instruction-level parallelism and branch mispredictions in the existing general-purpose architecture for graph applications. In this paper, toward a customized heterogeneous dataflow architecture that integrates the hardware advantages of both dataflow architecture and traditional control architecture, we propose a novel runtime system that can adaptively offload each subgraph to an appropriate underlying architecture. We also present a hybrid execution model to drive optimal performance. Our implementation on a CPU-FPGA platform shows that our approach achieves 2.2x throughput improvement over a state-of-art CPU-FPGA graph processing accelerator and 2.4x throughput improvement over a state-of-art FPGA-based design.
引用
收藏
页码:362 / 375
页数:14
相关论文
共 50 条
  • [1] Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture
    Qingxiang Chen
    Long Zheng
    Xiaofei Liao
    Hai Jin
    Qinggang Wang
    CCF Transactions on High Performance Computing, 2020, 2 : 362 - 375
  • [2] Dynamically reconfigurable dataflow architecture for high-performance digital signal processing
    Voigt, S.
    Baesler, M.
    Teufel, T.
    JOURNAL OF SYSTEMS ARCHITECTURE, 2010, 56 (11) : 561 - 576
  • [3] DFMan: A Graph-based Optimization of Dataflow Scheduling on High-Performance Computing Systems
    Chowdhury, Fahim
    Di Natale, Francesco
    Moody, Adam
    Mohror, Kathryn
    Yu, Weikuan
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 368 - 378
  • [4] Effective High-Level Synthesis for High-Performance Graph Processing
    Tang J.
    Zheng L.
    Liao X.
    Jin H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (03): : 467 - 478
  • [5] Dynamically reconfigurable dataflow architecture for high-performance digital signal processing on multi-FPGA platforms
    Voigt, Sven-Ole
    Teufel, Thomas
    2007 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 2007, : 633 - 637
  • [6] Performance Model for Parallel Matrix Multiplication with Dryad: Dataflow Graph Runtime
    Li, Hui
    Fox, Geoffrey
    Qiu, Judy
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 675 - 683
  • [7] High-performance VLSI architecture for video processing
    Navarro, H
    Montiel-Nelson, JA
    Sosa, J
    García, JC
    Sarmiento, R
    Nooshabadi, S
    VLSI CIRCUITS AND SYSTEMS, 2003, 5117 : 175 - 186
  • [8] High-performance architecture for digital transform processing
    Mora, H.
    Signes-Pont, M. T.
    Jimeno-Morenilla, A.
    Sanchez-Romero, J. L.
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (03): : 1336 - 1349
  • [9] A HIGH-PERFORMANCE RECONFIGURABLE PARALLEL PROCESSING ARCHITECTURE
    SHIVELY, RR
    MORGAN, EB
    COPLEY, TW
    GORIN, AL
    PROCEEDINGS : SUPERCOMPUTING 89, 1989, : 505 - 509
  • [10] HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators
    Ahmad, Masab
    Dogan, Halit
    Michael, Christopher J.
    Khan, Omer
    2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2019, : 268 - 281