Priority-based Resource Scheduling in Distributed Stream Processing Systems for Big Data Applications

被引:0
|
作者
Bellavista, Paolo [1 ]
Corradi, Antonio [1 ]
Reale, Andrea [1 ]
Ticca, Nicola [1 ]
机构
[1] Univ Bologna, Dept Comp Sci & Engn, Bologna, Italy
关键词
Distributed Stream Processing; Big Data; Priority-based Resource Scheduling; Application-level and Application-specific Scheduling; Cloud Computing Optimization; Vehicular Traffic Analysis;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Distributed Stream Processing Systems (DSPSs) are attracting increasing industrial and academic interest as flexible tools to implement scalable and cost-effective on-line analytics applications over Big Data streams. Often hosted in private/public cloud deployment environments, DSPSs offer datastream processing services that transparently exploit the distributed computing resources made available to them at runtime. Given the volume of data of interest, possible (hard/soft) real-time processing requirements, and the time-variable characteristics of input datastreams, it is very important for DSPSs to use smart and innovative scheduling techniques that allocate computing resources properly and avoid static over-provisioning. In this paper, we originally investigate the suitability of exploiting application-level indications about differentiated priorities of different stream processing tasks to enable application-specific DSPS resource scheduling, e.g., capable of re-shaping processing resources in order to dynamically follow input data peaks of prioritized tasks, with no static over-provisioning. We originally propose a general and simple technique to design and implement priority-based resource scheduling in flow-graph-based DSPSs, by allowing application developers to augment DSPS graphs with priority metadata and by introducing an extensible set of priority schemas to be automatically handled by the extended DSPS. In addition, we show the effectiveness of our approach via its implementation and integration in our Quasit DSPS and through experimental evaluation of this prototype on a real-world stream processing application of Big Data vehicular traffic analysis.
引用
收藏
页码:363 / 370
页数:8
相关论文
共 50 条
  • [21] Congestion games with priority-based scheduling
    Bilo, Vittorio
    Vinci, Cosimo
    THEORETICAL COMPUTER SCIENCE, 2023, 974
  • [22] Congestion Games with Priority-Based Scheduling
    Bilo, Vittorio
    Vinci, Cosimo
    ALGORITHMIC GAME THEORY, SAGT 2020, 2020, 12283 : 67 - 82
  • [23] DSPBench: A Suite of Benchmark Applications for Distributed Data Stream Processing Systems
    Bordin, Maycon Viana
    Griebler, Dalvan
    Mencagli, Gabriele
    Geyer, Claudio F. R.
    Fernandes, Luiz Gustavo L.
    IEEE ACCESS, 2020, 8 : 222900 - 222917
  • [24] An Adaptive Priority-Based Heuristic Approach for Scheduling DAG Applications with Uncertainties
    Zheng, Wei
    Zhang, Xinbo
    Tang, Lu
    Zhang, Dongzhan
    Chen, Jinjun
    2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, : 72 - 79
  • [25] Poster: Iterative Scheduling for Distributed Stream Processing Systems
    Eskandari, Leila
    Mair, Jason
    Huang, Zhiyi
    Eyers, David
    DEBS'18: PROCEEDINGS OF THE 12TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, 2018, : 234 - 237
  • [26] Scheduling of big data applications on distributed cloud based on QoS parameters
    Rajinder Sandhu
    Sandeep K. Sood
    Cluster Computing, 2015, 18 : 817 - 828
  • [27] Scheduling of big data applications on distributed cloud based on QoS parameters
    Sandhu, Rajinder
    Sood, Sandeep K.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (02): : 817 - 828
  • [28] Resource allocation and aging priority-based scheduling of linear workflow applications with transient failures and selective imprecise computations
    Karatza, Helen D.
    Stavrinides, Georgios L.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (04): : 5473 - 5488
  • [29] Reliable stream data processing for elastic distributed stream processing systems
    Xiaohui Wei
    Yuan Zhuang
    Hongliang Li
    Zhiliang Liu
    Cluster Computing, 2020, 23 : 555 - 574
  • [30] Priority-based Application Level Multicast for Distributed Interactive Applications based on Distance
    Chen, Ren
    Li, Gui Sen
    Dai, Bin
    Zhu, Shun-Zhi
    2017 IEEE 8TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2017, : 410 - 415