Dynamic Straggler Mitigation for Large-Scale Spatial Simulations

被引:0
|
作者
Bin Khunayn, Eman [1 ]
Xie, Hairuo [2 ]
Karunasekera, Shanika [2 ]
Ramamohanarao, Kotagiri [3 ]
机构
[1] King Abdulaziz City Sci & Technol KACST, Riyadh, Saudi Arabia
[2] Univ Melbourne, Melbourne, Australia
[3] Australian Acad Sci, Canberra, Australia
关键词
Spatial simulation; stragglers; BSP; load balancing; traffic simulation;
D O I
10.1145/3578933
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Spatial simulations have been widely used to study real-world environments, such as transportation systems. Applications like prediction and analysis of transportation require the simulation to handle millions of objects while running faster than real time. Running such large-scale simulation requires high computational power, which can be provided through parallel distributed computing. Implementations of parallel distributed spatial simulations usually follow a bulk synchronous parallel (BSP) model to ensure the correctness of simulation. The processing in BSP is divided into iterations of computation and communication, running on multiple workers, followed by a global barrier synchronisation to ensure that all communications are concluded. Unfortunately, the BSP model is plagued by the straggler problem, where a delay in any worker slows down the entire simulation. Stragglers may occur for many reasons, including imbalanced workload distribution or communication and synchronisation delays. The straggler problem can become more severe with increasing parallelism and continuous change of workload distribution among workers. This article proposes methods to dynamically mitigate stragglers and tackle communication delays. The proposed strategies can rebalance the workload distribution during simulation. These methods employ the spatial properties of the simulated environments to combine a flexible synchronisation model with decentralised dynamic load balancing and on-demand resource allocation. All proposed methods are implemented and evaluated using a microscopic traffic simulator as an example of large-scale spatial simulations. We run traffic simulations for Melbourne, Beijing and New York with different straggler scenarios. Our methods significantly improve simulation performance compared to advanced methods such as global dynamic load balancing.
引用
收藏
页数:34
相关论文
共 50 条
  • [31] Large-scale simulations of clusters of galaxies
    Ricker, PM
    Calder, AC
    Dursi, LJ
    Fryxell, B
    Lamb, DQ
    MacNeice, P
    Olson, K
    Rosner, R
    Timmes, FX
    Truran, JW
    Tufo, HM
    Zingale, M
    ADVANCED COMPUTING AND ANALYSIS TECHNIQUES IN PHYSICS RESEARCH, 2001, 583 : 316 - 318
  • [32] Efficient large-scale BGP simulations
    Dimitropoulos, Xenofontas A.
    Riley, George F.
    COMPUTER NETWORKS, 2006, 50 (12) : 2013 - 2027
  • [33] Evaluating large-scale training simulations
    Simpson, H
    Oser, RL
    MILITARY PSYCHOLOGY, 2003, 15 (01) : 25 - 40
  • [34] Large-Scale Simulations of Sky Surveys
    Heitmann, Katrin
    Habib, Salman
    Finkel, Hal
    Frontiere, Nicholas
    Pope, Adrian
    Morozov, Vitali
    Rangel, Steve
    Kovacs, Eve
    Kwan, Juliana
    Li, Nan
    Rizzi, Silvio
    Insley, Joe
    Vishwanath, Venkatram
    Peterka, Tom
    Daniel, David
    Fasel, Patricia
    Zagaris, George
    COMPUTING IN SCIENCE & ENGINEERING, 2014, 16 (05) : 14 - 23
  • [35] Collaborative Learning Based Straggler Prevention in Large-Scale Distributed Computing Framework
    Deshmukh, Shyam
    Thirupathi Rao, Komati
    Shabaz, Mohammad
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [36] Large-scale disasters: prediction, control, and mitigation
    Poolman, Eugene
    SOUTH AFRICAN GEOGRAPHICAL JOURNAL, 2010, 92 (01) : 88 - 89
  • [39] Large-scale dynamic systems
    Haykin, Simon
    Moulines, Eric
    PROCEEDINGS OF THE IEEE, 2007, 95 (05) : 849 - 852
  • [40] Experimental validation of large-scale simulations of dynamic fracture along weak planes
    Chalivendra, Vijaya B.
    Hong, Soonsung
    Arias, Irene
    Knap, Jaroslaw
    Rosakis, Ares
    Ortiz, Michael
    INTERNATIONAL JOURNAL OF IMPACT ENGINEERING, 2009, 36 (07) : 888 - 898