Effective runtime scheduling for high-performance graph processing on heterogeneous dataflow architecture

被引:1
|
作者
Chen, Qingxiang [1 ]
Zheng, Long [1 ]
Liao, Xiaofei [1 ]
Jin, Hai [1 ]
Wang, Qinggang [1 ]
机构
[1] Huazhong Univ Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Natl Engn Res Ctr Big Data Technol & Syst, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph analytics; Dataflow architecture; Heterogeneity; FRAMEWORK;
D O I
10.1007/s42514-020-00041-w
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Graph processing is widely used in modern society, such as social networks, bioinformatics, and information networks. It is observed that the dataflow architecture has been demonstrated to effectively resolve the challenges of low instruction-level parallelism and branch mispredictions in the existing general-purpose architecture for graph applications. In this paper, toward a customized heterogeneous dataflow architecture that integrates the hardware advantages of both dataflow architecture and traditional control architecture, we propose a novel runtime system that can adaptively offload each subgraph to an appropriate underlying architecture. We also present a hybrid execution model to drive optimal performance. Our implementation on a CPU-FPGA platform shows that our approach achieves 2.2x throughput improvement over a state-of-art CPU-FPGA graph processing accelerator and 2.4x throughput improvement over a state-of-art FPGA-based design.
引用
收藏
页码:362 / 375
页数:14
相关论文
共 50 条
  • [21] VGL: a high-performance graph processing framework for the NEC SX-Aurora TSUBASA vector architecture
    Afanasyev, Ilya V.
    Voevodin, Vladimir V.
    Komatsu, Kazuhiko
    Kobayashi, Hiroaki
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (08): : 8694 - 8715
  • [22] An energy reduction scheduling mechanism for a high-performance SoC architecture
    Chu, SL
    EMBEDDED AND UBIQUITOUS COMPUTING - EUC 2005, 2005, 3824 : 281 - 290
  • [23] FAST: A High-Performance Architecture for Heterogeneous Big Data Forensics
    Pungila, Ciprian
    Negru, Viorel
    INTERNATIONAL JOINT CONFERENCE SOCO'17- CISIS'17-ICEUTE'17 PROCEEDINGS, 2018, 649 : 618 - 627
  • [24] Hierarchical Parallelization and Runtime Scheduling for Pregel-like Graph Processing Systems
    Li, Zengxiang
    Duan, Rubing
    Wang, Long
    Lu, Sifei
    Qin, Zheng
    Goh, Rick Siow Mong
    2014 IEEE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING TECHNOLOGY AND SCIENCE (CLOUDCOM), 2014, : 491 - 498
  • [25] MACRO PIPELINING BASED SCHEDULING ON HIGH-PERFORMANCE HETEROGENEOUS MULTIPROCESSOR SYSTEMS
    BANERJEE, S
    HAMADA, T
    CHAU, PM
    FELLMAN, RD
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (06) : 1468 - 1484
  • [26] Evolution of a high-performance PC architecture data processing system
    Turri, M
    DASIA 99: DATA SYSTEMS IN AEROSPACE, 1999, 447 : 73 - 78
  • [27] PSS: A novel statement scheduling mechanism for a high-performance SoC architecture
    Chu, SL
    TENTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2004, : 690 - 697
  • [28] The open runtime platform: a flexible high-performance managed runtime environment
    Cierniak, M
    Eng, M
    Glew, N
    Lewis, B
    Stichnoth, J
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2005, 17 (5-6): : 617 - 637
  • [29] A real-time and high-performance MobileNet accelerator based on adaptive dataflow scheduling for image classification
    Sang, Xiaoting
    Ruan, Tao
    Li, Chunlei
    Li, Huanyu
    Yang, Ruimin
    Liu, Zhoufeng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (01)
  • [30] High-performance architecture
    Sherwin-Williams
    不详
    Finsh. Today, 2007, 2 (22-24):