Job failure prediction in Hadoop based on log file analysis

被引:1
|
作者
Shirzad E. [1 ]
Saadatfar H. [1 ,2 ]
机构
[1] Faculty of Electrical and Computer Engineering, University of Birjand, Birjand
[2] Department of Computer Engineering, Faculty of Electrical and Computer Engineering, University of Birjand, Birjand
关键词
cluster workload; data mining; failure prediction; Hadoop; log file; MapReduce job;
D O I
10.1080/1206212X.2020.1732081
中图分类号
学科分类号
摘要
Hadoop is a popular framework based on MapReduce programming model to allow for distributed processing of large datasets across clusters with various number of computer nodes. Just like any dynamic computational environment, Hadoop has some problems and one of which is unsuccessful execution of MapReduce jobs. Job failures can cause significant resource wasting, performance deterioration, and user dissatisfaction. Therefore, a proactive and predictive management approach could be very useful in Hadoop systems. In this paper, we try to predict the futurity of MapReduce jobs in OpenCloud Hadoop cluster by using its log files. OpenCloud is a research cluster managed by CMU’s Parallel Data Lab which uses Hadoop to process big data. We first tried to study the log files and analyze the relationship between the jobs, resources, and workload characteristics and the failures in order to discover the effective features for the prediction process. After recognizing the job failure patterns, some popular machine learning algorithms are deployed to predict the success/failure status of the jobs before they start to execute. Eventually, we compared the learning methods and showed that the C5.0 algorithm had the best results with an accuracy of 91.37%, a recall of 74.43%, and a precision of 80.31%. © 2020 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:260 / 269
页数:9
相关论文
共 50 条
  • [31] Automatic Linac Log File: Analysis and Reporting
    Gainey, M.
    Rothe, T.
    MEDICAL PHYSICS, 2015, 42 (06) : 3364 - 3364
  • [32] MRI log file analysis for workflow improvement
    Petroianu, Larissa P. G.
    Li, Lun
    Mieloszyk, Rebecca J.
    Mastrangelo, Christina M.
    Stapleton, Shawn
    Hall, Christopher
    CURRENT PROBLEMS IN DIAGNOSTIC RADIOLOGY, 2024, 53 (02) : 192 - 200
  • [33] Web log file analysis: Backlinks and queries
    Thelwall, Mike
    2001, JAI Press (53):
  • [34] Naive Bayesian Filters for Log File Analysis
    Havens, Russel W.
    Lunt, Bary
    Teng, Chia-Chi
    2012 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2012, : 627 - 630
  • [35] A Method of Log File Analysis for Test Oracle
    Tu, Dan
    Chen, Rong
    Du, Zhenjun
    Liu, Yaqing
    2009 INTERNATIONAL CONFERENCE ON SCALABLE COMPUTING AND COMMUNICATIONS & EIGHTH INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTING, 2009, : 351 - 354
  • [36] Web Log Mining Algorithm Based on Hadoop and MongoDB
    Xiao Feng
    Xie Jian
    Rong Huigui
    Huo Shengxu
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL AND INFORMATION SCIENCES (ICCIS 2014), 2014, : 246 - 253
  • [37] Metrics for Outpatient Portal Use Based on Log File Analysis: Algorithm Development
    Di Tosto, Gennaro
    McAlearney, Ann Scheck
    Fareed, Naleef
    Huerta, Timothy R.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2020, 22 (06)
  • [38] BUSINESS MANAGEMENT RESULTS OF SOFTWARE CONTROLLABLE WEBSITE BASED ON LOG FILE ANALYSIS
    Simunic, Mislav
    TOURISM AND HOSPITALITY MANAGEMENT-CROATIA, 2005, 11 (01): : 207 - 222
  • [39] Job-Aware File-Storage Optimization for Improved Hadoop I/O Performance
    Nakagami, Makoto
    Fortes, Jose A. B.
    Yamaguchi, Saneyasu
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (10) : 2083 - 2093
  • [40] Complete Data Deletion Based on Hadoop Distributed File System
    Wang, Fulin
    Wu, Shunxiang
    Cai, Jianhuai
    Zhao, Longze
    Liao, Zhendong
    Ming, Daodong
    PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,