Fault tolerance in big data storage and processing systems: A review on challenges and solutions

被引:16
|
作者
Saadoon, Muntadher [1 ]
Ab Hamid, Siti Hafizah [1 ]
Sofian, Hazrina [1 ]
Altarturi, Hamza H. M. [1 ]
Azizul, Zati Hakim [1 ]
Nasuha, Nur [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Software Engn, Kuala Lumpur 50603, Malaysia
关键词
Fault tolerance; Fault detection; Fault recovery; Big data storage; Big data processing; FAILURE RECOVERY; MAPREDUCE; RELIABILITY; AVAILABILITY; REPLICATION; NETWORKS; HADOOP;
D O I
10.1016/j.asej.2021.06.024
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Big data systems are sufficiently stable to store and process a massive volume of rapidly changing data. However, big data systems are composed of large-scale hardware resources that make their subspecies easily fail. Fault tolerance is the main property of such systems because it maintains availability, reliability, and constant performance during faults. Achieving an efficient fault tolerance solution in a big data system is challenging because fault tolerance must meet some constraints related to the system performance and resource consumption. This study aims to provide a consistent understanding of fault tolerance in big data systems and highlights common challenges that hinder the improvement in fault tolerance efficiency. The fault tolerance solutions applied by previous studies intended to address the identified challenges are reviewed. The paper also presents a perceptive discussion of the findings derived from previous studies and proposes a list of future directions to address the fault tolerance challenges. (C) 2021 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Big Data Processing in the Cloud - Challenges and Platforms
    Zhelev, Svetoslav
    Rozeva, Anna
    PROCEEDINGS OF THE 43RD INTERNATIONAL CONFERENCE APPLICATIONS OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE'17), 2017, 1910
  • [32] Big Data, Big Systems, Big Challenges: A Personal Experience
    Kotov, Vadim E.
    PERSPECTIVES OF SYSTEM INFORMATICS, PSI 2014, 2015, 8974
  • [33] Pathfinder: Fault Tolerance for Stream Processing Systems
    Knasmuller, Bernhard
    Hochreiner, Christoph
    Schulte, Stefan
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 29 - 39
  • [34] On Efficient Hierarchical Storage for Big Data Processing
    Krish, K. R.
    Wadhwa, Bharti
    Iqbal, M. Safdar
    Rafique, M. Mustafa
    Butt, Ali R.
    2016 16TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING (CCGRID), 2016, : 403 - 408
  • [35] Big Data Impacts and Challenges: A Review
    Al-Sai, Zaher Ali
    Abdullah, Rosni
    Husin, Mohd Heikal
    2019 IEEE JORDAN INTERNATIONAL JOINT CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION TECHNOLOGY (JEEIT), 2019, : 150 - 155
  • [36] Challenges with Big Data Mining: A Review
    Sebastian, Libina Rose
    Babu, Sheeba
    Kizhakkethottam, Jubilant J.
    PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORKS SECURITY (ICSNS 2015), 2015,
  • [37] Big Data Challenges and Issues: A Review
    Mathur, Akanksha
    Gupta, C. P.
    PROCEEDING OF THE INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS, BIG DATA AND IOT (ICCBI-2018), 2020, 31 : 446 - 452
  • [38] A Review on Big Data Applications and their Challenges
    Prabhugouda, Amruta
    Asra, Syeda
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2024, 23 (06)
  • [39] Big Data and its Challenges: A Review
    Bagga, Simmi
    Sharma, Anil
    2018 4TH INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES (ICCS), 2018, : 183 - 187
  • [40] Big Data Analysis and Visualization: Challenges and Solutions
    Yoo, Kwan-Hee
    Leung, Carson K.
    Nasridinov, Aziz
    APPLIED SCIENCES-BASEL, 2022, 12 (16):