Dynamic Straggler Mitigation for Large-Scale Spatial Simulations

被引:0
|
作者
Bin Khunayn, Eman [1 ]
Xie, Hairuo [2 ]
Karunasekera, Shanika [2 ]
Ramamohanarao, Kotagiri [3 ]
机构
[1] King Abdulaziz City Sci & Technol KACST, Riyadh, Saudi Arabia
[2] Univ Melbourne, Melbourne, Australia
[3] Australian Acad Sci, Canberra, Australia
关键词
Spatial simulation; stragglers; BSP; load balancing; traffic simulation;
D O I
10.1145/3578933
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Spatial simulations have been widely used to study real-world environments, such as transportation systems. Applications like prediction and analysis of transportation require the simulation to handle millions of objects while running faster than real time. Running such large-scale simulation requires high computational power, which can be provided through parallel distributed computing. Implementations of parallel distributed spatial simulations usually follow a bulk synchronous parallel (BSP) model to ensure the correctness of simulation. The processing in BSP is divided into iterations of computation and communication, running on multiple workers, followed by a global barrier synchronisation to ensure that all communications are concluded. Unfortunately, the BSP model is plagued by the straggler problem, where a delay in any worker slows down the entire simulation. Stragglers may occur for many reasons, including imbalanced workload distribution or communication and synchronisation delays. The straggler problem can become more severe with increasing parallelism and continuous change of workload distribution among workers. This article proposes methods to dynamically mitigate stragglers and tackle communication delays. The proposed strategies can rebalance the workload distribution during simulation. These methods employ the spatial properties of the simulated environments to combine a flexible synchronisation model with decentralised dynamic load balancing and on-demand resource allocation. All proposed methods are implemented and evaluated using a microscopic traffic simulator as an example of large-scale spatial simulations. We run traffic simulations for Melbourne, Beijing and New York with different straggler scenarios. Our methods significantly improve simulation performance compared to advanced methods such as global dynamic load balancing.
引用
收藏
页数:34
相关论文
共 50 条
  • [41] LARGE-SCALE COMPUTER-SIMULATIONS OF STATIC AND DYNAMIC PROPERTIES OF DISORDERED MATERIALS
    ARBABI, S
    SAHIMI, M
    MOLECULAR SIMULATION, 1991, 8 (1-2) : 1 - 22
  • [42] A large-scale simulation model of pandemic influenza outbreaks for development of dynamic mitigation strategies
    Das, Tapas K.
    Savachkin, Alex A.
    Zhu, Yiliang
    IIE TRANSACTIONS, 2008, 40 (09) : 893 - 905
  • [43] Straggler-Aware Gradient Aggregation for Large-Scale Distributed Deep Learning System
    Li, Yijun
    Huang, Jiawei
    Li, Zhaoyi
    Liu, Jingling
    Zhou, Shengwen
    Zhang, Tao
    Jiang, Wanchun
    Wang, Jianxin
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024, 32 (06) : 4917 - 4930
  • [44] Mitigation Method of Rockfall Hazard on Rock Slope Using Large-Scale Field Tests and Numerical Simulations
    Yoon, Jinam
    Ban, Hoki
    Hwang, Youngcheol
    Park, Duhee
    ADVANCES IN CIVIL ENGINEERING, 2020, 2020
  • [45] Large-scale magnetic resonance simulations: A tutorial
    Concilio, Maria Grazia
    MAGNETIC RESONANCE IN CHEMISTRY, 2020, 58 (08) : 691 - 717
  • [46] New eigensolvers for large-scale nanoscience simulations
    Canning, A.
    Marques, O.
    Voemel, C.
    Wang, Lin-Wang
    Dongarra, J.
    Langou, J.
    Tomov, S.
    SCIDAC 2008: SCIENTIFIC DISCOVERY THROUGH ADVANCED COMPUTING, 2008, 125
  • [47] A survey on VV&A of large-scale simulations
    Wang Y.
    Li J.
    Hongbo S.
    Li Y.
    Akhtar F.
    Imran A.
    International Journal of Crowd Science, 2019, 3 (01) : 63 - 86
  • [48] Large-scale simulations of the Zhang sandpile model
    Lubeck, S.
    Physical Review E. Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 1997, 56 (02):
  • [49] Large-scale simulations of concentrated emulsion flows
    Zinchenko, AZ
    Davis, RH
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2003, 361 (1806): : 813 - 845
  • [50] Conservative synchronization of large-scale network simulations
    Park, A
    Fujimoto, RM
    Perumalla, KS
    18TH WORKSHOP ON PARALLEL AND DISTRIBUTED SIMULATION, PROCEEDINGS, 2004, : 153 - 161