Automatic Debugging of Design Faults in MapReduce Applications

被引:0
|
作者
Moran, Jesus [1 ]
Bertolino, Antonia [2 ]
de la Riva, Claudio [1 ]
Tuya, Javier [1 ]
机构
[1] Univ Oviedo, Comp Sci Dept, Oviedo 33203, Spain
[2] CNR, ISTI, I-56124 Pisa, Italy
关键词
Debugging aids; testing and debugging; LOCALIZATION;
D O I
10.1109/TSE.2024.3369766
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Among the current technologies to analyse large data, the MapReduce processing model stands out in Big Data. MapReduce is implemented in frameworks such as Hadoop, Spark or Flink that are able to manage the program executions according to the resources available at runtime. The developer should design the program in order to support all possible non-deterministic executions. However, the program may fail due to a design fault. Debugging these kinds of faults is difficult because the data are executed non-deterministically in parallel and the fault is not caused directly by the code, but by its design. This paper presents a framework called MRDebug which includes two debugging techniques focused on the MapReduce design faults. A spectrum-based fault localization technique locates the root cause of these faults analysing several executions of the test case, and a Delta Debugging technique isolates the data relevant to trigger the failure. An empirical evaluation with 13 programs shows that MRDebug is effective in debugging the faults, especially when the localization is done with the reduced data. In summary, MRDebug automatically provides valuable information to understand MapReduce design faults as it helps locate their root cause and obtains a minimal data that triggers the failure.
引用
收藏
页码:956 / 978
页数:23
相关论文
共 50 条
  • [1] Automatic Testing of Design Faults in MapReduce Applications
    Moran, Jesus
    Bertolino, Antonia
    de la Riva, Claudio
    Tuya, Javier
    IEEE TRANSACTIONS ON RELIABILITY, 2018, 67 (03) : 717 - 732
  • [2] PerfXplain: Debugging MapReduce Job Performance
    Khoussainova, Nodira
    Balazinska, Magdalena
    Suciu, Dan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (07): : 598 - 609
  • [3] Simultaneous debugging of software faults
    Abreu, Rui
    Zoeteweij, Peter
    van Gemund, Arjan J. C.
    JOURNAL OF SYSTEMS AND SOFTWARE, 2011, 84 (04) : 573 - 586
  • [4] USING AUTOMATIC PROCESS CLUSTERING FOR DESIGN RECOVERY AND DISTRIBUTED DEBUGGING
    KUNZ, T
    BLACK, JP
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1995, 21 (06) : 515 - 527
  • [5] Model transformations of MapReduce Design Patterns for automatic development and verification
    Amato, Flora
    Moscato, Francesco
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 110 : 52 - 59
  • [6] A framework for automatic debugging
    Auguston, M
    Jeffery, C
    Underwood, S
    ASE 2002: 17TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, 2002, : 217 - 222
  • [7] Automatic detection and correction of programming faults for software applications
    Deeprasertkul, P
    Bhattarakosol, P
    O'Brien, F
    JOURNAL OF SYSTEMS AND SOFTWARE, 2005, 78 (02) : 101 - 110
  • [8] MRWorkflow: Design and Implementation of a Workflow System for MapReduce Applications
    Kim, Shin-gyu
    Han, Hyuck
    Yeom, Heon Y.
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (04): : 1363 - 1371
  • [9] Event-aware precise dynamic slicing for automatic debugging of Android applications
    Win, Hsu Myat
    Tan, Shin Hwei
    Sui, Yulei
    JOURNAL OF SYSTEMS AND SOFTWARE, 2023, 198
  • [10] Automatic Optimization for MapReduce Programs
    Jahani, Eaman
    Cafarella, Michael J.
    Re, Christopher
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (06): : 385 - 396