Real-time processing of streaming big data

被引:0
|
作者
Ali A. Safaei
机构
[1] Tarbiat Modares University,Department of Medical Informatics, Faculty of Medical Sciences
来源
Real-Time Systems | 2017年 / 53卷
关键词
Streaming big data; Hybrid multiprocessor real-time scheduling; Clustering; Deadline-aware dispatching; Periodic continuous queries;
D O I
暂无
中图分类号
学科分类号
摘要
In the era of data explosion, high volume of various data is generated rapidly at each moment of time; and if not processed, the profits of their latent information would be missed. This is the main current challenge of most enterprises and Internet mega-companies (also known as the big data problem). Big data is composed of three dimensions: Volume, Variety, and Velocity. The velocity refers to the high speed, both in data arrival rate (e.g., streaming data) and in data processing (i.e., real-time processing). In this paper, the velocity dimension of big data is concerned; so, real-time processing of streaming big data is addressed in detail. For each real-time system, to be fast is inevitable and a necessary condition (although it is not sufficient and some other concerns e.g., real-time scheduling must be issued, too). Fast processing is achieved by parallelism via the proposed deadline-aware dispatching method. For the other prerequisite of real-time processing (i.e., real-time scheduling of the tasks), a hybrid clustering multiprocessor real-time scheduling algorithm is proposed in which both the partitioning and global real-time scheduling approaches are employed to have better schedulablity and resource utilization, with a tolerable overhead. The other components required for real-time processing of streaming big data are also designed and proposed as real time streaming big data (RT-SBD) processing engine. Its prototype is implemented and experimentally evaluated and compared with the Storm, a well-known real-time streaming big data processing engine. Experimental results show that the proposed RT-SBD significantly outperforms the Storm engine in terms of proportional deadline miss ratio, tuple latency and system throughput.
引用
收藏
页码:1 / 44
页数:43
相关论文
共 50 条
  • [31] A spark-based big data analysis framework for real-time sentiment prediction on streaming data
    Kilinc, Deniz
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (09): : 1352 - 1364
  • [32] Soft Real-Time Hadoop Scheduler for Big Data Processing in Smart Cities
    Barbieru, Ciprian
    Pop, Florin
    IEEE 30TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS IEEE AINA 2016, 2016, : 863 - 870
  • [33] Multi-GPUs Gaussian Filtering for Real-Time Big Data Processing
    Zhang, Chaolong
    Xu, Yuanping
    He, Jia
    Lu, Jun
    Lu, Li
    Xu, Zhijie
    PROCEEDINGS OF 2016 10TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT & APPLICATIONS (SKIMA), 2016, : 231 - 236
  • [34] Predicting Coronavirus Pandemic in Real-Time Using Machine Learning and Big Data Streaming System
    Zhang, Xiongwei
    Saleh, Hager
    Younis, Eman M. G.
    Sahal, Radhya
    Ali, Abdelmgeid A.
    COMPLEXITY, 2020, 2020
  • [35] Optimization of real-time ultrasound PCIe data streaming and OpenCL processing for SAFT imaging
    Walczak, M.
    Lewandowski, M.
    Zolek, N.
    2013 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2013, : 2064 - 2067
  • [36] Real-time big data processing for instantaneous marketing decisions: A problematization approach
    Jabbar, Abdul
    Akhtar, Pervaiz
    Dani, Samir
    INDUSTRIAL MARKETING MANAGEMENT, 2020, 90 : 558 - 569
  • [37] A scalable and real-time system for disease prediction using big data processing
    Ed-daoudy, Abderrahmane
    Maalmi, Khalil
    El Ouaazizi, Aziza
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 30405 - 30434
  • [38] Near Real-Time Big Data Stream Processing Platform Using Cassandra
    Pal, Gautam
    Li, Gangmin
    Atkinson, Katie
    2018 4TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2018,
  • [39] A review on big data real-time stream processing and its scheduling techniques
    Tantalaki, Nicoleta
    Souravlas, Stavros
    Roumeliotis, Manos
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2020, 35 (05) : 571 - 601
  • [40] A scalable and real-time system for disease prediction using big data processing
    Abderrahmane Ed-daoudy
    Khalil Maalmi
    Aziza El Ouaazizi
    Multimedia Tools and Applications, 2023, 82 : 30405 - 30434