Duality-Based Locality-Aware Stream Partitioning in Distributed Stream Processing Engines

被引:0
|
作者
Son, Siwoon [1 ]
Moon, Yang-Sae [1 ]
机构
[1] Kangwon Natl Univ, Chunchon, South Korea
关键词
Distributed processing; Data stream; Locality; Duality;
D O I
10.1007/978-3-030-48340-1_57
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose duality-based locality-aware stream partitioning (LSP) in distributed stream processing engines (DSPEs). In general, LSP directly uses the locality concept of distributed batch processing engines (DBPEs). This concept does not fully take into account the characteristics of DSPEs and therefore does not maximize cluster resource utilization. To solve this problem, we first explain the limitations of existing LSP, and we then propose a duality relationship between DBPEs and DSPEs. We finally propose a simple but efficient ping-based mechanism to maximize the locality of DSPEs based on the duality. The insights uncovered in this paper can maximize the throughput and minimize the latency in stream partitioning.
引用
收藏
页码:725 / 730
页数:6
相关论文
共 50 条
  • [21] Context-Aware Stream Processing for Distributed IoT Applications
    Akbar, Adnan
    Carrez, Francois
    Moessner, Klaus
    Sancho, Juan
    Rico, Juan
    2015 IEEE 2ND WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2015, : 663 - 668
  • [22] High Performance Stream Query Processing With Correlation-Aware Partitioning
    Cao, Lei
    Rundensteiner, Elke A.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 7 (04): : 265 - 276
  • [23] Stream-aware indexing for distributed inequality join processing
    Aslam, Adeel
    Simonini, Giovanni
    Gagliardelli, Luca
    Zecchini, Luca
    Bergamaschi, Sonia
    INFORMATION SYSTEMS, 2024, 125
  • [24] Network-Aware Grouping in Distributed Stream Processing Systems
    Chen, Fei
    Wu, Song
    Jin, Hai
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 3 - 18
  • [25] Theodolite: Scalability Benchmarking of Distributed Stream Processing Engines in Microservice Architectures
    Henning, Soeren
    Hasselbring, Wilhelm
    BIG DATA RESEARCH, 2021, 25
  • [26] Stochastic distributed data stream partitioning using task locality: design, implementation, and optimization
    Son, Siwoon
    Im, Hyeonseung
    Moon, Yang-Sae
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (10): : 11353 - 11389
  • [27] Stochastic distributed data stream partitioning using task locality: design, implementation, and optimization
    Siwoon Son
    Hyeonseung Im
    Yang-Sae Moon
    The Journal of Supercomputing, 2021, 77 : 11353 - 11389
  • [28] Query-Centric Failure Recovery for Distributed Stream Processing Engines
    Su, Li
    Zhou, Yongluan
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1276 - 1279
  • [29] Locality-aware fountain codes for massive distributed storage systems
    Okpotse, Toritseju
    Yousefi, Shahram
    2015 IEEE 14TH CANADIAN WORKSHOP ON INFORMATION THEORY (CWIT), 2015, : 18 - 21
  • [30] A Stream Partitioning Approach to Processing Large Scale Distributed Graph Datasets
    Wang, Rui
    Chiu, Kenneth
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,