Enhancing iteration performance on distributed task-based workflows

被引:0
|
作者
Barcelo, Alex [1 ]
Queralt, Anna [1 ,2 ]
Cortes, Toni [1 ,2 ]
机构
[1] Barcelona Supercomp Ctr, Barcelona, Spain
[2] Univ Politecn Cataluna, Barcelona, Spain
关键词
Task-based workflows; Distributed computing; Object store; Active storage; Dataset iteration;
D O I
10.1016/j.future.2023.07.032
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Task-based programming models have proven to be a robust and versatile way to approach development of applications for distributed environments. They provide natural programming patterns with high performance. However, execution on this paradigm can be very sensitive to granularity -i.e., the quantity and execution length of tasks. Granularity is often linked with the block size of the data, and finding the optimal block size has several challenges, as it requires inner knowledge of the computing environment.Our proposal is to supplement the task-based programming model with a new mechanism -our SplIter proposal. At its core, the SplIter provides a transparent way to split a collection into partitions (logical groups of blocks, obtained without any transfers nor data rearrangement), which can then be iterated. Tasks are linked to those partitions, which means that SplIter breaks the dependency between block size and task granularity.The evaluation shows that the SplIter is able to achieve performance improvements of over one order of magnitude when compared to the baseline, and it is either competitive or strictly better (depending on application characteristics) to the competitor alternative. We have chosen different applications covering a wide variety of scenarios; those applications are representatives of a broader set of applications and domains. The changes required in the source code of a task-based application are minimal, preserving the high programmability of the programming model. Two different state-of-the-art task-based frameworks have been evaluated for all the applications: COMPSs and Dask, showing that the SplIter can be effectively used within different frameworks.& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页码:359 / 375
页数:17
相关论文
共 50 条
  • [1] Managing Failures in Task-Based Parallel Workflows in Distributed Computing Environments
    Ejarque, Jorge
    Bertran, Marta
    Cid-Fuentes, Javier Alvarez
    Conejero, Javier
    Badia, Rosa M.
    EURO-PAR 2020: PARALLEL PROCESSING, 2020, 12247 : 411 - 425
  • [2] Unified Implementation and Simplification for Task-Based Authorization Security in Workflows
    Zhong, Wenjing
    Zhao, Jinjing
    Hu, Hesuan
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (05) : 3796 - 3811
  • [3] A Programming Model for Hybrid Workflows: combining Task-based Workflows and Dataflows all-in-one
    Ramon-Cortes, Cristian
    Lordan, Francesc
    Ejarque, Jorge
    Badia, Rosa M.
    arXiv, 2020,
  • [4] A programming model for Hybrid Workflows: Combining task-based workflows and dataflows all-in-one
    Ramon-Cortes, Cristian
    Lordan, Francesc
    Ejarque, Jorge
    Badia, Rosa M.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 113 : 281 - 297
  • [5] Distributed Task-Based Training of Tree Models
    Yan, Da
    Chowdhury, Md Mashiur Rahman
    Guo, Guimu
    Kahlil, Jalal
    Jiang, Zhe
    Prasad, Sushil
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2237 - 2249
  • [6] The task is not enough: Processing approaches to task-based performance
    Skehan, Peter
    Xiaoyue, Bei
    Qian, Li
    Wang, Zhan
    LANGUAGE TEACHING RESEARCH, 2012, 16 (02) : 170 - 187
  • [7] ENHANCED VISUALISATION OF POTENTIAL UNPLANNED ITERATION TIME IN TASK-BASED DSMS
    Minogue, Paschal
    PROCEEDINGS OF THE 11TH INTERNATIONAL DSM CONFERENCE, 2009, : 155 - 158
  • [8] Enhancing automaticity through task-based language learning
    De Ridder, Isabelle
    Vangehuchten, Lieve
    Gomez, Marta Sesena
    APPLIED LINGUISTICS, 2007, 28 (02) : 309 - 315
  • [9] Distributed Task-based Runtime Systems - Current State and Micro-Benchmark Performance
    Hoque, Reazul
    Shamis, Pavel
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 934 - 941
  • [10] Implementing the Broadcast Operation in a Distributed Task-based Runtime
    Ceccato, Rodrigo
    Yviquel, Herve
    Pereira, Marcio
    Souza, Alan
    Araujo, Guido
    2022 IEEE 34TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE AND HIGH PERFORMANCE COMPUTING WORKSHOPS (SBAC-PADW 2022), 2022, : 25 - 32