Making Speculative Scheduling Robust to Incomplete Data

被引:3
|
作者
Gainaru, Ana [1 ]
Pallez , Guillaume [2 ,3 ]
机构
[1] Vanderbilt Univ, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Univ Bordeaux, INRIA, Talence, France
[3] Univ Bordeaux, Labri, Talence, France
关键词
HPC scheduling; stochastic applications; performance modeling; discrete and continuous estimators;
D O I
10.1109/ScalA49573.2019.00013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this work, we study the robustness of Speculative Scheduling to data incompleteness. Speculative scheduling has allowed to incorporate future types of applications into the design of HPC schedulers, specifically applications whose runtime is not perfectly known but can be modeled with probability distributions. Preliminary studies show the importance of speculative scheduling in dealing with stochastic applications when the application runtime model is completely known. In this work we show how one can extract enough information even from incomplete behavioral data for a given HPC applications so that speculative scheduling still performs well. Specifically, we show that for synthetic runtimes who follow usual probability distributions such as truncated normal or exponential, we can extract enough data from as little as 10 previous runs, to be within 5% of the solution which has exact information. For real traces of applications, the performance with 10 data points varies with the applications (within 20% of the full-knowledge solution), but converges fast (5% with 100 previous samples). Finally a side effect of this study is to show the importance of the theoretical results obtained on continuous probability distributions for speculative scheduling. Indeed, we observe that the solutions for such distributions are more robust to incomplete data than the solutions for discrete distributions.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [31] A robust classification of galaxy spectra: Dealing with noisy and incomplete data
    Connolly, AJ
    Szalay, AS
    ASTRONOMICAL JOURNAL, 1999, 117 (05): : 2052 - 2062
  • [32] Speculative computation by abduction under incomplete communication environments
    Satoh, K
    Inoue, K
    Iwanuma, K
    Sakama, C
    FOURTH INTERNATIONAL CONFERENCE ON MULTIAGENT SYSTEMS, PROCEEDINGS, 2000, : 263 - 270
  • [33] Be Stable and Fair: Robust Data Scheduling for Vehicular Networks
    Wu, Libing
    Xia, Youhua
    Wang, Zhibo
    Wang, Hao
    IEEE ACCESS, 2018, 6 : 32839 - 32849
  • [34] Data-driven robust flexible personnel scheduling
    Wang, Zilu
    Luo, Zhixing
    Shen, Huaxiao
    COMPUTERS & OPERATIONS RESEARCH, 2025, 176
  • [35] Robust Task Scheduling Strategy for Big Data Clusters
    Wang, Zixiang
    Liu, Zhoubin
    Huan, Zhan
    Kong, Xiaoyun
    Yuan, Xiaolu
    2017 3RD INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING AND COMMUNICATIONS (BIGCOM), 2017, : 305 - 312
  • [36] Code Transformations Based on Speculative SDC Scheduling
    Lattuada, Marco
    Ferrandi, Fabrizio
    2015 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2015, : 71 - 77
  • [37] Robust clustering via mixtures of t factor analyzers with incomplete data
    Wan-Lun Wang
    Tsung-I Lin
    Advances in Data Analysis and Classification, 2022, 16 : 659 - 690
  • [38] Doubly robust pseudo-likelihood for incomplete hierarchical binary data
    Hermans, Lisa
    Ivanova, Anna
    Sotto, Cristina
    Molenberghs, Geert
    Verbeke, Geert
    Kenward, Michael G.
    STATISTICAL MODELLING, 2020, 20 (01) : 42 - 57
  • [39] Towards intelligent robust detection of anatomical structures in incomplete volumetric data
    Ghesu, Florin C.
    Georgescu, Bogdan
    Grbic, Sasa
    Maier, Andreas
    Hornegger, Joachim
    Comaniciu, Dorin
    MEDICAL IMAGE ANALYSIS, 2018, 48 : 203 - 213
  • [40] A unified framework of multiply robust estimation approaches for handling incomplete data
    Chen, Sixia
    Haziza, David
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2023, 179