The Tiny-Tasks Granularity Trade-Off: Balancing Overhead Versus Performance in Parallel Systems

被引:3
|
作者
Bora, Stefan [1 ]
Walker, Brenton [1 ]
Fidler, Markus [1 ]
机构
[1] Leibniz Univ Hannover, Inst Commun Technol, Hannover 30167, Germany
关键词
Task analysis; Sparks; Servers; Analytical models; Cluster computing; Parallel processing; Synchronization; Network calculus; parallel processing; performance bounds; processing overhead; Spark; synchronization constraints; task granularity; tiny-tasks; FORK; SYNCHRONIZATION; MPI/OPENMP; APPROXIMATIONS; QUEUES; MODELS; OPENMP;
D O I
10.1109/TPDS.2022.3233712
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
of parallel processing systems typically assume that one has l workers and jobs are split into an equal number of k = l tasks. Splitting jobs into k > l smaller tasks, i.e. using tiny tasks, can yield performance and stability improvements because it reduces the variance in the amount of work assigned to each worker, but ask increases, the overhead involved in scheduling and managing the tasks begins to overtake the performance benefit. We perform extensive experiments on the effects of task granularity on an Apache Spark cluster, and based on these, develop a four parameter model for task and job overhead that, in simulation, produces sojourn time distributions that match those of the real system. We also present analytical results which illustrate how using tiny tasks improves the stability region of split-merge systems, and analytical bounds on the sojourn and waiting time distributions of both split-merge and single-queue fork-join systems with tiny tasks. Finally we combine the overhead model with the analytical models to produce an analytical approximation to the sojourn and waiting time distributions of systems with tiny tasks which include overhead. We also perform analogous tiny-tasks experiments on a hybrid multi-processor shared memory system based on MPI and OpenMP which has no load-balancing between nodes. Though no longer strict analytical bounds, our analytical approximations with overhead match both the Spark and MPI/OpenMP experimental results very well.
引用
收藏
页码:1128 / 1144
页数:17
相关论文
共 50 条
  • [31] Predicting the Performance-Cost Trade-off of Applications Across Multiple Systems
    Nassereldine, Amir
    Diab, Safaa
    Baydoun, Mohammed
    Leach, Kenneth
    Alt, Maxim
    Milojicic, Dejan
    El Hajj, Izzat
    2023 IEEE/ACM 23RD INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING, CCGRID, 2023, : 216 - 228
  • [32] A QoC-based model for performance and QoE trade-off in distributed systems
    D'Amato, Andre
    Dantas, Mario
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2017, 29 (18):
  • [33] On the rate versus diversity trade-off in multi-channel wireless transmission systems
    Nam, Haewoon
    Kot, Young-Chai
    Alouini, Mohamed-Slim
    2007 IEEE WIRELESS COMMUNICATIONS & NETWORKING CONFERENCE, VOLS 1-9, 2007, : 1922 - +
  • [34] Security Versus Accuracy: Trade-Off Data Modeling to Safe Fault Classification Systems
    Zhuo, Yue
    Song, Zhihuan
    Ge, Zhiqiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12095 - 12106
  • [35] Carrier Aggregation for Improved Rate versus Power trade-off in Massive MIMO Systems
    Zappone, Alessio
    Perez, David Lopez
    De Domenico, Antonio
    Piovesan, Nicola
    Bao, Harvey
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1655 - 1660
  • [36] TRADE-OFF ANALYSIS OF BUFFER STOCK VERSUS EMERGENCY DELIVERY IN THE KNOCKDOWN PRODUCTION SYSTEMS
    HIRAKI, S
    ISHII, K
    TAKAHASHI, K
    MURAMATSU, R
    ENGINEERING COSTS AND PRODUCTION ECONOMICS, 1990, 20 (02): : 129 - 138
  • [37] Towards Optimal Performance-Area Trade-Off in Adders by Synthesis of Parallel Prefix Structures
    Roy, Subhendu
    Choudhury, Mihir
    Puri, Ruchir
    Pan, David Z.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2014, 33 (10) : 1517 - 1530
  • [38] A case study in productivity-cost trade-off in the design of paced parallel production systems
    Freiheit, T.
    Wang, W.
    Spicer, Patrick
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2007, 45 (14) : 3263 - 3288
  • [39] Towards Optimal Performance-area Trade-off in Adders by Synthesis of Parallel Prefix Structures
    Roy, Subhendu
    Choudhury, Mihir
    Puri, Ruchir
    Pan, David Z.
    2013 50TH ACM / EDAC / IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2013,
  • [40] Network and Economic Trade-Off Performance Regions of Cognitive Radio Systems with Power Control
    Samano-Robles, Ramiro
    Gameiro, Atilio
    APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (05): : 1755 - 1766