Job failure prediction in Hadoop based on log file analysis

被引:1
|
作者
Shirzad E. [1 ]
Saadatfar H. [1 ,2 ]
机构
[1] Faculty of Electrical and Computer Engineering, University of Birjand, Birjand
[2] Department of Computer Engineering, Faculty of Electrical and Computer Engineering, University of Birjand, Birjand
关键词
cluster workload; data mining; failure prediction; Hadoop; log file; MapReduce job;
D O I
10.1080/1206212X.2020.1732081
中图分类号
学科分类号
摘要
Hadoop is a popular framework based on MapReduce programming model to allow for distributed processing of large datasets across clusters with various number of computer nodes. Just like any dynamic computational environment, Hadoop has some problems and one of which is unsuccessful execution of MapReduce jobs. Job failures can cause significant resource wasting, performance deterioration, and user dissatisfaction. Therefore, a proactive and predictive management approach could be very useful in Hadoop systems. In this paper, we try to predict the futurity of MapReduce jobs in OpenCloud Hadoop cluster by using its log files. OpenCloud is a research cluster managed by CMU’s Parallel Data Lab which uses Hadoop to process big data. We first tried to study the log files and analyze the relationship between the jobs, resources, and workload characteristics and the failures in order to discover the effective features for the prediction process. After recognizing the job failure patterns, some popular machine learning algorithms are deployed to predict the success/failure status of the jobs before they start to execute. Eventually, we compared the learning methods and showed that the C5.0 algorithm had the best results with an accuracy of 91.37%, a recall of 74.43%, and a precision of 80.31%. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:260 / 269
页数:9
相关论文
共 50 条
  • [1] Multivariate log file analysis for MLC failure prediction
    Wojtasik, A.
    Bolt, M.
    Clark, C. H.
    Nisbet, A.
    Chen, T.
    RADIOTHERAPY AND ONCOLOGY, 2020, 152 : S355 - S356
  • [2] An Approach for Log Analysis Based Failure Monitoring in Hadoop Cluster
    Mohandas, Madhury
    Dhanya, P. M.
    2013 INTERNATIONAL CONFERENCE ON GREEN COMPUTING, COMMUNICATION AND CONSERVATION OF ENERGY (ICGCE), 2013, : 861 - 867
  • [3] Performance evaluation of cloud-based log file analysis with Apache Hadoop and Apache Spark
    Mavridis, Ilias
    Karatza, Helen
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 125 : 133 - 151
  • [4] Big Data: Mining of Log File through Hadoop
    Kotiyal, Bina
    Kumar, Ankit
    Pant, Bhaskar
    Goudar, R. H.
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,
  • [5] Web Log Analysis Based On Hadoop Technology
    Zhao Yongjian
    2019 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2019, : 587 - 590
  • [6] A Hadoop based Web log Analysis System
    Wang, Chen-Hau
    Tsai, Ching-Tsorng
    Fan, Chia-Chen
    Yuan, Shyan-Ming
    2014 7TH INTERNATIONAL CONFERENCE ON UBI-MEDIA COMPUTING AND WORKSHOPS (UMEDIA), 2014, : 72 - 77
  • [7] Design of a Log Analysis System Based on Hadoop
    Li, Jingmin
    INDUSTRIAL ENGINEERING, MACHINE DESIGN AND AUTOMATION (IEMDA 2014) & COMPUTER SCIENCE AND APPLICATION (CCSA 2014), 2015, : 249 - 256
  • [8] Multivariate log file analysis for multi-leaf collimator failure prediction in radiotherapy delivery
    Wojtasik, Arkadiusz Mariusz
    Bolt, Matthew
    Clark, Catharine H.
    Nisbet, Andrew
    Chen, Tao
    PHYSICS & IMAGING IN RADIATION ONCOLOGY, 2020, 15 : 72 - 76
  • [9] Job-Aware Optimization of File Placement in Hadoop
    Nakagami, Makoto
    Fortes, Jose A. B.
    Yamaguchi, Saneyasu
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 664 - 669
  • [10] An approach for MapReduce based Log analysis using Hadoop
    Hingave, Hemant
    Ingle, Rasika
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 1264 - 1268