Multithreaded Pipeline Synthesis for Data-Parallel Kernels

被引:0
|
作者
Tan, Mingxing [1 ]
Liu, Bin [2 ]
Dai, Steve [1 ]
Zhang, Zhiru [1 ]
机构
[1] Cornell Univ, Sch Elect & Comp Engn, Ithaca, NY 14850 USA
[2] Facebook Inc, Menlo Pk, CA USA
关键词
HIGH-LEVEL SYNTHESIS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Pipelining is an important technique in high-level synthesis, which overlaps the execution of successive loop iterations or threads to achieve high throughput for loop/function kernels. Since existing pipelining techniques typically enforce in-order thread execution, a variable-latency operation in one thread would block all subsequent threads, resulting in considerable performance degradation. In this paper, we propose a multithreaded pipelining approach that enables context switching to allow out-of-order thread execution for data-parallel kernels. To ensure that the synthesized pipeline is complexity effective, we further propose efficient scheduling algorithms for minimizing the hardware overhead associated with context management. Experimental results show that our proposed techniques can significantly improve the effective pipeline throughput over conventional approaches while conserving hardware resources.
引用
收藏
页码:718 / 725
页数:8
相关论文
共 50 条
  • [21] A DATA-PARALLEL SCIENTIFIC MODELING LANGUAGE
    FRANCIS, RS
    MATHIESON, ID
    WHITING, PG
    DIX, MR
    DAVIES, HL
    ROTSTAYN, LD
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 21 (01) : 46 - 60
  • [22] Language bindings for a data-parallel runtime
    Carpenter, B
    Fox, G
    Leskiw, D
    Li, X
    Wen, Y
    Zhang, G
    THIRD INTERNATIONAL WORKSHOP ON HIGH-LEVEL PARALLEL PROGRAMMING MODELS AND SUPPORTIVE ENVIRONMENTS, PROCEEDINGS, 1998, : 42 - 49
  • [23] AN APPROACH TO CORRECTNESS OF DATA-PARALLEL ALGORITHMS
    GABARRO, J
    GAVALDA, R
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 22 (02) : 185 - 201
  • [24] Data-Parallel Sparse LU Factorization
    SIAM J Sci Comput, 2 (584):
  • [25] Data-parallel sparse LU factorization
    Conroy, JM
    Kratzer, SG
    Lucas, RF
    Naiman, AE
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 19 (02): : 584 - 604
  • [26] Data-Parallel Octrees for Surface Reconstruction
    Zhou, Kun
    Gong, Minmin
    Huang, Xin
    Guo, Baining
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2011, 17 (05) : 669 - 681
  • [27] On privatization of variables for data-parallel execution
    Gupta, M
    11TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM, PROCEEDINGS, 1997, : 533 - 541
  • [28] Developing a data-parallel application with DaParT
    Sener, C
    Paker, Y
    Kiper, A
    PARALLEL PROCESSING APPLIED MATHEMATICS, 2002, 2328 : 280 - 287
  • [29] Pipelined execution of data-parallel algorithms
    Gorev, Maksim
    Ubar, Raimund
    2014 PROCEEDINGS OF THE 14TH BIENNIAL BALTIC ELECTRONICS CONFERENCE (BEC 2014), 2014, : 109 - 112
  • [30] A design methodology for data-parallel applications
    Nyland, LS
    Prins, JF
    Goldberg, A
    Mills, PH
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2000, 26 (04) : 293 - 314