The Flexible Preconditions Model for Macro-Dataflow Execution

被引:0
|
作者
Sbirlea, Dragos [1 ]
Sbirlea, Alina [1 ]
Wheeler, Kyle B. [2 ]
Sarkar, Vivek [1 ]
机构
[1] Rice Univ, Houston, TX 77251 USA
[2] Micron Technol Inc, Boise, ID USA
关键词
D O I
10.1109/DFM.2013.13
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose the flexible preconditions model for macro-dataflow execution. Our approach unifies two current approaches for managing task dependences, eager execution vs. strict preconditions. When one of the two outperforms the other, flexible preconditions can always attain, and possibly surpass, the performance of the better approach. This work focuses on the performance of parallel programming models based on macro-dataflow, in which applications are composed of tasks and inter-task dependences. Data-flow models usually make a choice between specifying the task dependences before task creation (as strict preconditions), or during task execution, when they are actually needed (eager execution). This paper shows how the choice between eager execution and strict preconditions affects the performance, memory consumption and expressiveness of macro-dataflow applications. The flexible preconditions model is sufficiently flexible to support both eager execution and strict preconditions, as well as hybrid combinations thereof. This capability enables programmers and future auto-tuning systems to pick the precondition combination that yields the best performance for a given application. The experimental evaluation was performed on a 32-core SMP, and is based on a new macro-dataflow implementation, QtCnC, that supports eager execution, strict preconditions and flexible preconditions in a single framework. (QtCnC is an implementation of the CnC model on the QThreads library.) For applications where all dependences are known ahead of time, flexible and strict preconditions execute up to 56% faster than eager execution (for the benchmarks and platform used in our study). On the other hand, for applications where the complete set of per-task dependences is determined after the tasks are spawned, flexible preconditions and eager execution perform up to 38% better than strict preconditions.
引用
收藏
页码:51 / 58
页数:8
相关论文
共 50 条
  • [41] The Design and Implementation of TIDeFlow: A Dataflow-Inspired Execution Model for Parallel Loops and Task Pipelining
    Orozco, Daniel
    Garcia, Elkin
    Pavel, Robert
    Arteaga, Jaime
    Gao, Guang
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (02) : 278 - 307
  • [42] Scheduled Dataflow: Execution paradigm, architecture, and performance evaluation
    Kavi, KM
    Giorgi, R
    Arul, J
    IEEE TRANSACTIONS ON COMPUTERS, 2001, 50 (08) : 834 - 846
  • [43] The Design and Implementation of TIDeFlow: A Dataflow-Inspired Execution Model for Parallel Loops and Task Pipelining
    Daniel Orozco
    Elkin Garcia
    Robert Pavel
    Jaime Arteaga
    Guang Gao
    International Journal of Parallel Programming, 2016, 44 : 278 - 307
  • [44] Transforming Loop Chains via Macro Dataflow Graphs
    Davis, Eddie C.
    Strout, Michelle Mills
    Olschanowsky, Catherine
    PROCEEDINGS OF THE 2018 INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO'18), 2018, : 277 - 289
  • [45] A flexible dataflow CNN accelerator on FPGA
    Li, Haoran
    Gong, Lei
    Wang, Chao
    Zhou, Xuehai
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING WORKSHOPS, CCGRIDW, 2023, : 302 - 304
  • [46] SYSTEMS DESIGN SPACE EXPLORATION BY SERIAL DATAFLOW PROGRAM EXECUTION
    Casale-Brunet, S.
    Mattavelli, M.
    Alberti, C.
    Janneck, J. W.
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 1805 - 1809
  • [47] Reproducing the behavior of a parallel program by using dataflow execution models
    Takahashi, N
    Miei, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1997, E80D (04) : 495 - 503
  • [48] Utilizing dataflow-based execution for coupled cluster methods
    McCraw, Heike
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2014, 248
  • [49] Parallel Dataflow Execution for Sequential Programs on Reconfigurable Hybrid MPSoCs
    Wang, Chao
    Li, Xi
    Zhou, Xuehai
    Ha, Yajun
    2012 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT'12), 2012, : 53 - 56
  • [50] Towards a Scalable, Flexible and High Performance NFV Execution Model
    Zheng, Peng
    Narayanan, Arvind
    Zhang, Zhi-Li
    CONEXT'19 COMPANION: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON EMERGING NETWORKING EXPERIMENTS AND TECHNOLOGIES, 2019, : 68 - 69