Big Stream Processing Systems: An Experimental Evaluation

被引:9
|
作者
Shahverdi, Elkhan [1 ]
Awad, Ahmed [1 ]
Sakr, Sherif [1 ]
机构
[1] Univ Taru, Tartu, Estonia
关键词
D O I
10.1109/ICDEW.2019.00-35
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As the world gets more instrumented and connected, we are witnessing a flood of digital data generated from various hardware (e.g., sensors) or software in the format of flowing streams of data. Real-time processing for such massive amounts of streaming data is a crucial requirement in several application domains including financial markets, surveillance systems, manufacturing, smart cities, and scalable monitoring infrastructure. In the last few years, several big stream processing engines have been introduced to tackle this challenge. In this article, we present an extensive experimental study of five popular systems in this domain, namely, Apache Storm, Apache Rink, Apache Spark, Kafka Streams and Hazelcast Jet. We report and analyze the performance characteristics of these systems. In addition, we report a set of insights and important lessons that we have learned from conducting our experiments.
引用
收藏
页码:53 / 60
页数:8
相关论文
共 50 条
  • [41] Using Microservices and Event Driven Architecture for Big Data Stream Processing
    Zhelev, Svetoslav
    Rozeva, Anna
    PROCEEDINGS OF THE 45TH INTERNATIONAL CONFERENCE ON APPLICATION OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE'19), 2019, 2172
  • [42] Big Log Data Stream Processing: Adapting an Anomaly Detection Technique
    Dietz, Marietheres
    Pernul, Guenther
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 159 - 166
  • [43] An Empirical Evaluation of Intelligent Machine Learning Algorithms under Big Data Processing Systems
    Suleiman, Dima
    Al-Zewairi, Malek
    Naymat, Ghazi
    8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 539 - 544
  • [44] Modeling Stream Processing Applications for Dependability Evaluation
    Jacques-Silva, Gabriela
    Kalbarczyk, Zbigniew
    Gedik, Bugra
    Andrade, Henrique
    Wu, Kun-Lung
    Iyer, Ravishankar K.
    2011 IEEE/IFIP 41ST INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN), 2011, : 430 - 441
  • [45] Benchmarking Distributed Stream Data Processing Systems
    Karimov, Jeyhun
    Rabl, Tilmann
    Katsifodimos, Asterios
    Samarev, Roman
    Heiskanen, Henri
    Markl, Volker
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1507 - 1518
  • [46] Accommodating Bursts in Distributed Stream Processing Systems
    Drougas, Yannis
    Kalogeraki, Vana
    2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5, 2009, : 362 - 372
  • [47] Tracing Distributed Data Stream Processing Systems
    Zvara, Zoltan
    Szabo, Peter G. N.
    Hermann, Gabor
    Benczur, Andras
    2017 IEEE 2ND INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2017, : 235 - 242
  • [48] Enabling Deep Analytics in Stream Processing Systems
    Nikolic, Milos
    Chandramouli, Badrish
    Goldstein, Jonathan
    DATA ANALYTICS, 2017, 10365 : 94 - 98
  • [49] Beyond Analytics: The Evolution of Stream Processing Systems
    Carbone, Paris
    Fragkoulis, Marios
    Kalavri, Vasiliki
    Katsifodimos, Asterios
    SIGMOD'20: PROCEEDINGS OF THE 2020 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2020, : 2651 - 2658
  • [50] Conceptual Survey on Data Stream Processing Systems
    Hesse, Guenter
    Lorenz, Martin
    2015 IEEE 21ST INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2015, : 797 - 802