Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture

被引:0
|
作者
Qingxiang Chen
Long Zheng
Xiaofei Liao
Hai Jin
Qinggang Wang
机构
[1] Huazhong University of Science and Technology,National Engineering Research Center for Big Data Technology and System/Service Computing Technology and System Lab/Cluster and Grid Computing Lab
关键词
Graph analytics; Dataflow architecture; Heterogeneity;
D O I
暂无
中图分类号
学科分类号
摘要
Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve the challenges of low instruction-level parallelism and branch mispredictions in the existing general-purpose architecture for graph applications. In this paper, toward a customized heterogeneous dataflow architecture that integrates the hardware advantages of both dataflow architecture and traditional control architecture, we propose a novel runtime system that can adaptively offload each subgraph to an appropriate underlying architecture. We also present a hybrid execution model to drive optimal performance. Our implementation on a CPU-FPGA platform shows that our approach achieves 2.2x throughput improvement over a state-of-art CPU-FPGA graph processing accelerator and 2.4x throughput improvement over a state-of-art FPGA-based design.
引用
收藏
页码:362 / 375
页数:13
相关论文
共 50 条
  • [1] Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture
    Chen, Qingxiang
    Zheng, Long
    Liao, Xiaofei
    Jin, Hai
    Wang, Qinggang
    CCF TRANSACTIONS ON HIGH PERFORMANCE COMPUTING, 2020, 2 (04) : 362 - 375
  • [2] Dynamically reconfigurable dataflow architecture for high-performance digital signal processing
    Voigt, S.
    Baesler, M.
    Teufel, T.
    JOURNAL OF SYSTEMS ARCHITECTURE, 2010, 56 (11) : 561 - 576
  • [3] DFMan: A Graph-based Optimization of Dataflow Scheduling on High-Performance Computing Systems
    Chowdhury, Fahim
    Di Natale, Francesco
    Moody, Adam
    Mohror, Kathryn
    Yu, Weikuan
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 368 - 378
  • [4] Effective High-Level Synthesis for High-Performance Graph Processing
    Tang J.
    Zheng L.
    Liao X.
    Jin H.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (03): : 467 - 478
  • [5] Dynamically reconfigurable dataflow architecture for high-performance digital signal processing on multi-FPGA platforms
    Voigt, Sven-Ole
    Teufel, Thomas
    2007 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 2007, : 633 - 637
  • [6] Performance Model for Parallel Matrix Multiplication with Dryad: Dataflow Graph Runtime
    Li, Hui
    Fox, Geoffrey
    Qiu, Judy
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 675 - 683
  • [7] High-performance VLSI architecture for video processing
    Navarro, H
    Montiel-Nelson, JA
    Sosa, J
    García, JC
    Sarmiento, R
    Nooshabadi, S
    VLSI CIRCUITS AND SYSTEMS, 2003, 5117 : 175 - 186
  • [8] High-performance architecture for digital transform processing
    Mora, H.
    Signes-Pont, M. T.
    Jimeno-Morenilla, A.
    Sanchez-Romero, J. L.
    JOURNAL OF SUPERCOMPUTING, 2019, 75 (03): : 1336 - 1349
  • [9] A HIGH-PERFORMANCE RECONFIGURABLE PARALLEL PROCESSING ARCHITECTURE
    SHIVELY, RR
    MORGAN, EB
    COPLEY, TW
    GORIN, AL
    PROCEEDINGS : SUPERCOMPUTING 89, 1989, : 505 - 509
  • [10] HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators
    Ahmad, Masab
    Dogan, Halit
    Michael, Christopher J.
    Khan, Omer
    2019 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS), 2019, : 268 - 281