Making Speculative Scheduling Robust to Incomplete Data

被引:3
|
作者
Gainaru, Ana [1 ]
Pallez , Guillaume [2 ,3 ]
机构
[1] Vanderbilt Univ, 221 Kirkland Hall, Nashville, TN 37235 USA
[2] Univ Bordeaux, INRIA, Talence, France
[3] Univ Bordeaux, Labri, Talence, France
关键词
HPC scheduling; stochastic applications; performance modeling; discrete and continuous estimators;
D O I
10.1109/ScalA49573.2019.00013
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this work, we study the robustness of Speculative Scheduling to data incompleteness. Speculative scheduling has allowed to incorporate future types of applications into the design of HPC schedulers, specifically applications whose runtime is not perfectly known but can be modeled with probability distributions. Preliminary studies show the importance of speculative scheduling in dealing with stochastic applications when the application runtime model is completely known. In this work we show how one can extract enough information even from incomplete behavioral data for a given HPC applications so that speculative scheduling still performs well. Specifically, we show that for synthetic runtimes who follow usual probability distributions such as truncated normal or exponential, we can extract enough data from as little as 10 previous runs, to be within 5% of the solution which has exact information. For real traces of applications, the performance with 10 data points varies with the applications (within 20% of the full-knowledge solution), but converges fast (5% with 100 previous samples). Finally a side effect of this study is to show the importance of the theoretical results obtained on continuous probability distributions for speculative scheduling. Indeed, we observe that the solutions for such distributions are more robust to incomplete data than the solutions for discrete distributions.
引用
收藏
页码:62 / 71
页数:10
相关论文
共 50 条
  • [1] Robust PARAFAC for incomplete data
    Hubert, Mia
    Van Kerckhoven, Johan
    Verdonck, Tim
    JOURNAL OF CHEMOMETRICS, 2012, 26 (06) : 290 - 298
  • [2] SpRay: Speculative Ray Scheduling for Large Data Visualization
    Park, Hyungman
    Fussell, Donald
    Navratil, Paul
    2018 IEEE 8TH SYMPOSIUM ON LARGE DATA ANALYSIS AND VISUALIZATION (LDAV), 2018, : 77 - 86
  • [3] Robust Bayesian Classification with Incomplete Data
    Xunan Zhang
    Shiji Song
    Cheng Wu
    Cognitive Computation, 2013, 5 : 170 - 187
  • [4] Bayesian Robust PCA for Incomplete Data
    Luttinen, Jaakko
    Ilin, Alexander
    Karhunen, Julia
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 66 - 73
  • [5] Robust Feature Selection on Incomplete Data
    Zheng, Wei
    Zhu, Xiaofeng
    Zhu, Yonghua
    Zhang, Shichao
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3191 - 3197
  • [6] On robust linear regression with incomplete data
    Atkinson, AC
    Cheng, TC
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 33 (04) : 361 - 380
  • [7] Bayesian Robust PCA of Incomplete Data
    Luttinen, Jaakko
    Ilin, Alexander
    Karhunen, Juha
    NEURAL PROCESSING LETTERS, 2012, 36 (02) : 189 - 202
  • [8] Bayesian Robust PCA of Incomplete Data
    Jaakko Luttinen
    Alexander Ilin
    Juha Karhunen
    Neural Processing Letters, 2012, 36 : 189 - 202
  • [9] Robust Bayesian Classification with Incomplete Data
    Zhang, Xunan
    Song, Shiji
    Wu, Cheng
    COGNITIVE COMPUTATION, 2013, 5 (02) : 170 - 187
  • [10] Decision making using incomplete data
    Hewett, R
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 182 - 187