A Hybrid Approach to High Availability in Stream Processing Systems

被引:36
|
作者
Zhang, Zhe [1 ]
Gu, Yu [2 ]
Ye, Fan [3 ]
Yang, Hao [4 ]
Kim, Minkyong [3 ]
Lei, Hui [3 ]
Liu, Zhen [4 ]
机构
[1] Oak Ridge Natl Lab, Natl Ctr Computat Sci, Oak Ridge, TN 37831 USA
[2] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
[3] IBM T J Watson Res Ctr, Hawthorne, NY USA
[4] Nokia Res Ctr, White Plains, NY USA
关键词
ALGORITHMS;
D O I
10.1109/ICDCS.2010.81
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Stream processing is widely used by today's applications such as financial data analysis and disaster response. In distributed stream processing systems, machine fail-stop events are handled by either active standby or passive standby. However, existing high availability (HA) schemes have not sufficiently addressed the situation when a machine becomes temporarily unavailable due to data rate spikes, intensive analysis or job sharing, which happens frequently but lasts for short time. It is not clear how well active and passive standby fare against such transient unavailability. In this paper, we first critically examine the suitability of active and passive standby against transient unavailability in a real testbed environment. We find that both approaches have advantages and drawbacks, but neither is ideal to provide fast recovery at low overhead as required to handle transient unavailability. Based on the insights gained, we propose a novel hybrid HA method that switches between active and passive standby modes depending on the occurrence of failure events. It presents a desirable tradeoff that is different from existing HA approaches: low overhead during normal conditions and fast recovery upon transient or permanent failure events. We have implemented our hybrid method and compared it with existing HA designs with comprehensive evaluation. The results show that our hybrid method can reduce two-thirds of the recovery time compared to passive standby and 80% message overhead compared to active standby, allowing applications to enjoy uninterrupted processing without paying a high premium.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] An Efficient Approach for Storage of Big Data Streams in Distributed Stream Processing Systems
    Alshamrani, Sultan
    Waseem, Quadri
    Alharbi, Abdullah
    Alosaimi, Wael
    Turabieh, Hamza
    Alyami, Hashem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 91 - 98
  • [22] A Stochastic Optimization Approach to Hybrid Processing in Massive MIMO Systems
    Papageorgiou, Georgios K.
    Sellathurai, Mathini
    Ntougias, Konstantinos
    Papadias, Constantinos B.
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2020, 9 (06) : 770 - 773
  • [23] A survey on the evolution of stream processing systems
    Fragkoulis, Marios
    Carbone, Paris
    Kalavri, Vasiliki
    Katsifodimos, Asterios
    VLDB JOURNAL, 2024, 33 (02): : 507 - 541
  • [24] Signal processing challenges in distributed stream processing systems
    Frossard, Pascal
    Verscheure, Olivier
    Venkatramani, Chitra
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5903 - 5906
  • [25] Towards high performance and high availability clusters of archived stream
    Du, Kai
    Wang, Huaimin
    Yang, Shuqiang
    Deng, Bo
    ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2007, 4505 : 876 - +
  • [26] High Availability Framework and Query Fault Tolerance for Hybrid Distributed Database Systems
    Pasupuleti, Krishna Kantikiran
    Klots, Boris
    Nagarajan, Vijayakrishnan
    Kandukuri, Ananthakiran
    Agarwal, Nipun
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3451 - 3460
  • [27] Incremental Checkpointing for Fault-Tolerant Stream Processing Systems: A Data Structure Approach
    Lin, Chia-Yu
    Wang, Li-Chun
    Chang, Shu-Ping
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (01) : 124 - 136
  • [28] Hybrid Precoding with Data Stream Adaptation for High Throughput mmWave MIMO Systems
    Zhou, Liang
    Ohashi, Yoji
    2016 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, 2016,
  • [29] PROCESSING OF LEAF LITTER BY A STREAM DETRITIVORE - EFFECT ON NUTRIENT AVAILABILITY TO COLLECTORS
    SHORT, RA
    MASLIN, PE
    ECOLOGY, 1977, 58 (04) : 935 - 938
  • [30] Efficient high availability commit processing
    Kolltveit, Heine
    Hvasshovd, Svein-Olaf
    ARES 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON AVAILABILITY, SECURITY AND RELIABILITY, 2008, : 64 - 71