A Hybrid Approach to High Availability in Stream Processing Systems

被引:36
|
作者
Zhang, Zhe [1 ]
Gu, Yu [2 ]
Ye, Fan [3 ]
Yang, Hao [4 ]
Kim, Minkyong [3 ]
Lei, Hui [3 ]
Liu, Zhen [4 ]
机构
[1] Oak Ridge Natl Lab, Natl Ctr Computat Sci, Oak Ridge, TN 37831 USA
[2] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
[3] IBM T J Watson Res Ctr, Hawthorne, NY USA
[4] Nokia Res Ctr, White Plains, NY USA
关键词
ALGORITHMS;
D O I
10.1109/ICDCS.2010.81
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Stream processing is widely used by today's applications such as financial data analysis and disaster response. In distributed stream processing systems, machine fail-stop events are handled by either active standby or passive standby. However, existing high availability (HA) schemes have not sufficiently addressed the situation when a machine becomes temporarily unavailable due to data rate spikes, intensive analysis or job sharing, which happens frequently but lasts for short time. It is not clear how well active and passive standby fare against such transient unavailability. In this paper, we first critically examine the suitability of active and passive standby against transient unavailability in a real testbed environment. We find that both approaches have advantages and drawbacks, but neither is ideal to provide fast recovery at low overhead as required to handle transient unavailability. Based on the insights gained, we propose a novel hybrid HA method that switches between active and passive standby modes depending on the occurrence of failure events. It presents a desirable tradeoff that is different from existing HA approaches: low overhead during normal conditions and fast recovery upon transient or permanent failure events. We have implemented our hybrid method and compared it with existing HA designs with comprehensive evaluation. The results show that our hybrid method can reduce two-thirds of the recovery time compared to passive standby and 80% message overhead compared to active standby, allowing applications to enjoy uninterrupted processing without paying a high premium.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Big Stream Processing Systems: An Experimental Evaluation
    Shahverdi, Elkhan
    Awad, Ahmed
    Sakr, Sherif
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, : 53 - 60
  • [42] Tracing Distributed Data Stream Processing Systems
    Zvara, Zoltan
    Szabo, Peter G. N.
    Hermann, Gabor
    Benczur, Andras
    2017 IEEE 2ND INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2017, : 235 - 242
  • [43] Enabling Deep Analytics in Stream Processing Systems
    Nikolic, Milos
    Chandramouli, Badrish
    Goldstein, Jonathan
    DATA ANALYTICS, 2017, 10365 : 94 - 98
  • [44] Beyond Analytics: The Evolution of Stream Processing Systems
    Carbone, Paris
    Fragkoulis, Marios
    Kalavri, Vasiliki
    Katsifodimos, Asterios
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2651 - 2658
  • [45] Conceptual Survey on Data Stream Processing Systems
    Hesse, Guenter
    Lorenz, Martin
    2015 IEEE 21ST INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2015, : 797 - 802
  • [46] Preferential Resource Allocation in Stream Processing Systems
    Works, Karen
    Rundensteiner, Elke A.
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2014, 23 (04)
  • [47] Spotlight on high availability systems
    New Electronics, 2000, 33 (06):
  • [48] Rethinking the design of distributed stream processing systems
    Zhou, Yongluan
    Aberer, Karl
    Salehi, Ali
    Tan, Kian-Lee
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 182 - +
  • [49] Stream Processing Engines for Smart Healthcare Systems
    Khiati, Rhaed
    Hanif, Muhammed
    Lee, Choonhwa
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC), 2018, : 467 - 471
  • [50] Out-of-Order Processing: A New Architecture for High-Performance Stream Systems
    Li, Jin
    Tufte, Kristin
    Shkapenyuk, Vladislav
    Papadimos, Vassilis
    Johnson, Theodore
    Maier, David
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01): : 274 - 288