Predicting Job Failures in AuverGrid Based on Workload Log Analysis

被引:9
|
作者
Saadatfar, Hamid [1 ]
Fadishei, Hamid [1 ]
Deldari, Hossein [1 ]
机构
[1] Ferdowsi Univ Mashhad, Parallel & Distributed Proc Lab, Dept Comp Engn, Mashhad, Iran
关键词
Job Failure Prediction; Grid Workload Archive; Trace Analysis; Bayesian Networks;
D O I
10.1007/s00354-012-0105-z
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Grid systems are popular today due to their ability to solve large problems in business and science. Job failures which are inherent in any computational environment are more common in grids due to their dynamic and complex nature. Furthermore, traditional methods for job failure recovery have proven costly and thus a need to shift toward proactive and predictive management strategies is necessary in such systems. In this paper, an innovative effort has been made to predict the futurity of jobs in a production grid environment. First of all, we investigated the relationship between workload characteristics and job failures by analyzing workload traces of AuverGrid which is a part of EGEE (Enabling Grids for E-science) project. After the recognition of failure patterns, the success or failure status of jobs during 6 months of AuverGrid activity was predicted with approximately 96% accuracy. The quality of services on the grid can be improved by integrating the result of this work into management services like scheduling and monitoring.
引用
收藏
页码:73 / 94
页数:22
相关论文
共 50 条
  • [41] MTM-based ergonomic workload analysis
    Laring, J
    Forsman, M
    Kadefors, R
    Örtengren, R
    INTERNATIONAL JOURNAL OF INDUSTRIAL ERGONOMICS, 2002, 30 (03) : 135 - 148
  • [42] Mental Workload Detection Based on EEG Analysis
    Yauri, Jose
    Hernandez-Sabate, Aura
    Folch, Pau
    Gil, Debora
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2021, 339 : 268 - 277
  • [43] Predicting Disk Failures with HMM- and HSMM-Based Approaches
    Zhao, Ying
    Liu, Xiang
    Gan, Siqing
    Zheng, Weimin
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2010, 6171 : 390 - +
  • [44] Predicting nursing workload in digestive wards based on machine learning: A prospective study
    Song, Yulei
    Zhang, Xueqing
    Luo, Dan
    Shi, Jiarui
    Zang, Qiongqiong
    Wang, Ye
    Yin, Haiyan
    Xu, Guihua
    Bai, Yamei
    BMC NURSING, 2024, 23 (01):
  • [45] A Deep Learning-Based Algorithm for Predicting the Turning Point of Cloud Workload
    Jain, Anmol
    Panda, Sanjaya Kumar
    IFIP Advances in Information and Communication Technology, 723 IFIP : 276 - 287
  • [46] Predicting Nurses Mental health based on Job Satisfaction Components
    Nik-eghba, Nastaran
    Otaghi, Masoumeh
    Azadi, Arman
    Sayehmiri, Kourosh
    ASEAN JOURNAL OF PSYCHIATRY, 2023, 24 (07):
  • [47] Predicting job failure in cloud cluster: based on SVM classification
    Liu C.-H.
    Han J.-J.
    Shang Y.-L.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2016, 39 (05): : 104 - 109
  • [48] Predicting Job Stress Based on Elements of Coping Styles in Nurses
    Sedeh, Mansoureh Nezari
    Tajeri, Biuok
    INTERNATIONAL JOURNAL OF MEDICAL RESEARCH & HEALTH SCIENCES, 2016, 5 (07): : 608 - 612
  • [49] A random forest-based job shop rescheduling decision model with machine failures
    Meng Zhao
    Liang Gao
    Xinyu Li
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 3323 - 3333
  • [50] A random forest-based job shop rescheduling decision model with machine failures
    Zhao, Meng
    Gao, Liang
    Li, Xinyu
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 13 (7) : 3323 - 3333