Job failure prediction in Hadoop based on log file analysis

被引:1
|
作者
Shirzad E. [1 ]
Saadatfar H. [1 ,2 ]
机构
[1] Faculty of Electrical and Computer Engineering, University of Birjand, Birjand
[2] Department of Computer Engineering, Faculty of Electrical and Computer Engineering, University of Birjand, Birjand
关键词
cluster workload; data mining; failure prediction; Hadoop; log file; MapReduce job;
D O I
10.1080/1206212X.2020.1732081
中图分类号
学科分类号
摘要
Hadoop is a popular framework based on MapReduce programming model to allow for distributed processing of large datasets across clusters with various number of computer nodes. Just like any dynamic computational environment, Hadoop has some problems and one of which is unsuccessful execution of MapReduce jobs. Job failures can cause significant resource wasting, performance deterioration, and user dissatisfaction. Therefore, a proactive and predictive management approach could be very useful in Hadoop systems. In this paper, we try to predict the futurity of MapReduce jobs in OpenCloud Hadoop cluster by using its log files. OpenCloud is a research cluster managed by CMU’s Parallel Data Lab which uses Hadoop to process big data. We first tried to study the log files and analyze the relationship between the jobs, resources, and workload characteristics and the failures in order to discover the effective features for the prediction process. After recognizing the job failure patterns, some popular machine learning algorithms are deployed to predict the success/failure status of the jobs before they start to execute. Eventually, we compared the learning methods and showed that the C5.0 algorithm had the best results with an accuracy of 91.37%, a recall of 74.43%, and a precision of 80.31%. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:260 / 269
页数:9
相关论文
共 50 条
  • [21] Performance Analysis of Hadoop-Based SQL and NoSQL for Processing Log Data
    Son, Siwoon
    Gil, Myeong-Seon
    Moon, Yang-Sae
    Won, Hee-Sun
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2015, 2015, 9052 : 293 - 299
  • [22] A Mind Map Based Framework for Automated Software Log File Analysis
    Jayathilake, Dileepa
    SOFTWARE AND COMPUTER APPLICATIONS, 2011, 9 : 1 - 6
  • [23] MapReduce based Log File Analysis for System Threats and Problem Identification
    Vernekar, Sumeet S.
    Buchade, Amar
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 831 - 835
  • [24] Effectiveness of Feedback Based on Log File Analysis in Introductory Programming Courses
    Meier, Heidi
    Lepp, Marina
    JOURNAL OF EDUCATIONAL COMPUTING RESEARCH, 2023, 61 (03) : 696 - 719
  • [25] Hadoop Distributed File System for Big data analysis
    Almansouri, Hatim Talal
    Masmoudi, Youssef
    PROCEEDINGS OF 2019 IEEE 4TH WORLD CONFERENCE ON COMPLEX SYSTEMS (WCCS' 19), 2019, : 257 - 261
  • [26] LOG ANALYSIS IN CLOUD COMPUTING ENVIRONMENT WITH HADOOP AND SPARK
    Lin, Xiuqin
    Wang, Peng
    Wu, Bin
    2013 5TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY (IC-BNMT), 2013, : 273 - 276
  • [27] Predicting Job Failures in AuverGrid Based on Workload Log Analysis
    Hamid Saadatfar
    Hamid Fadishei
    Hossein Deldari
    New Generation Computing, 2012, 30 : 73 - 94
  • [28] LogM: Log Analysis for Multiple Components of Hadoop Platform
    Xie, Yuxia
    Yang, Kai
    Luo, Pan
    IEEE ACCESS, 2021, 9 : 73522 - 73532
  • [29] Web log file analysis: backlinks and queries
    Thelwall, M
    ASLIB PROCEEDINGS, 2001, 53 (06): : 217 - 223
  • [30] Predicting Job Failures in AuverGrid Based on Workload Log Analysis
    Saadatfar, Hamid
    Fadishei, Hamid
    Deldari, Hossein
    NEW GENERATION COMPUTING, 2012, 30 (01) : 73 - 94