Survey of Real-time Processing Systems for Big Data

被引:42
|
作者
Liu, Xiufeng [1 ]
Iftikhar, Nadeem [2 ]
Xie, Xike [3 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
[2] Univ Coll Northern, Hjorring, Denmark
[3] Aalborg Univ, Aalborg, Denmark
关键词
Survey; Real-time; Big data; Architectures; Systems; MAPREDUCE;
D O I
10.1145/2628194.2628251
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, real-time processing and analytics systems for big data-in the context of Business Intelligence (BI)-have received a growing attention. The traditional BI platforms that perform regular updates on daily, weekly or monthly basis are no longer adequate to satisfy the fast-changing business environments. However, due to the nature of big data, it has become a challenge to achieve the real-time capability using the traditional technologies. The recent distributed computing technology, MapReduce, provides off-the-shelf high scalability that can significantly shorten the processing time for big data; Its open-source implementation such as Hadoop has become the de-facto standard for processing big data, however, Hadoop has the limitation of supporting real-time updates. The improvements in Hadoop for the real-time capability, and the other alternative real-time frameworks have been emerging in recent years. This paper presents a survey of the open source technologies that support big data processing in a real-time/near real-time fashion, including their system architectures and platforms.
引用
收藏
页码:356 / 361
页数:6
相关论文
共 50 条
  • [21] InfoFrame table access method for real-time processing of big data
    Oosawa, Hideki
    Miyata, Tsuyoshi
    NEC Technical Journal, 2012, 7 (02): : 23 - 27
  • [22] Real-time intelligent big data processing:technology, platform, and applications
    Tongya ZHENG
    Gang CHEN
    Xinyu WANG
    Chun CHEN
    Xingen WANG
    Sihui LUO
    ScienceChina(InformationSciences), 2019, 62 (08) : 102 - 113
  • [23] Beyond Batch Processing: Towards Real-Time and Streaming Big Data
    Shahrivari, Saeed
    COMPUTERS, 2014, 3 (04) : 117 - 129
  • [24] Real-Time Data ETL Framework for Big Real-Time Data Analysis
    Li, Xiaofang
    Mao, Yingchi
    2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, 2015, : 1289 - 1294
  • [25] Survey on Real-time Anomaly Detection Technology for Big Data Streams
    Luo, Yuanvan
    Du, Xuehui
    Sun, Yi
    PROCEEDINGS OF 2018 12TH IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY, AND IDENTIFICATION (ASID), 2018, : 26 - 30
  • [26] Parallel Job Processing Technique for Real-time Big-Data Processing Framework
    Son, Jae Gi
    Kang, Ji-Woo
    An, Jae-Hoon
    Ahn, Hyung-Joo
    Chun, Hyo-Jung
    Kim, Jung-Guk
    2016 RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS, 2016, : 226 - 229
  • [27] Load Balancing Scheme for Supporting Real-time Processing of Big Data in Distributed In-Memory Systems
    Bok, Kyoungsoo
    Choi, Kitae
    Lim, Jongtae
    Yoo, Jaesoo
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 170 - 174
  • [28] REAL-TIME DATA-PROCESSING SYSTEMS - INTRODUCTORY CONCEPTS
    WILLIAMS, RE
    NUCLEAR APPLICATIONS, 1965, 1 (04): : 381 - &
  • [29] AutoDiagn: An Automated Real-Time Diagnosis Framework for Big Data Systems
    Demirbaga, Umit
    Wen, Zhenyu
    Noor, Ayman
    Mitra, Karan
    Alwasel, Khaled
    Garg, Saurabh
    Zomaya, Albert Y.
    Ranjan, Rajiv
    IEEE TRANSACTIONS ON COMPUTERS, 2022, 71 (05) : 1035 - 1048
  • [30] Near real-time analysis of big fusion data on HPC systems
    Kube, Ralph
    Churchill, R. Michael
    Choi, Jong
    Wang, Ruonan
    Choi, Minjun
    Klasky, Scott
    Chang, C. S.
    PROCEEDINGS OF URGENTHPC 2020: THE IEEE/ACM INTERNATIONAL WORKSHOPS ON URGENT AND INTERACTIVE HPC, 2020, : 55 - 63