Alovera: A Fast Stream Processing System for Large-Scale Data

被引:0
|
作者
Zhang, Zhen'An [1 ]
Zhang, Dongjie [2 ]
Yu, Xiaopeng [1 ]
Wang, Jing [2 ]
He, Chunjiang [3 ]
Yuan, Pingpeng [2 ]
Jin, Hai [2 ]
机构
[1] HAEPC Elect Power Res Inst, Zhengzhou, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
[3] China Elect Power Res Inst, Beijing 100085, Peoples R China
关键词
Large-scale data analysis; query execution; columnar store; stream processing;
D O I
10.1109/ChinaGrid.2013.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Growing of data volume poses challenges to data processing system. In this paper, Alovera, a fast stream processing system for large-scale data is presented. By using columnar data layout and stream processing, it is capable of pipelining data processing efficiently. It can process part of data instead of waiting for all data to be ready for the next operation. Thus, it can reduce the query time dramatically. Experimental results indicate significant performance improvement in a variety of tasks. In the experiments, we also evaluate our methods with different systems including HadoopDB and Hive. The extensive experiments confirm efficiency and better performance of our system.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 50 条
  • [1] Optimizing data stream processing for large-scale applications
    Cappellari, Paolo
    Roantree, Mark
    Chun, Soon Ae
    SOFTWARE-PRACTICE & EXPERIENCE, 2018, 48 (09): : 1607 - 1641
  • [2] Dynamic and fast processing of queries on large-scale RDF data
    Yuan, Pingpeng
    Xie, Changfeng
    Jin, Hai
    Liu, Ling
    Yang, Guang
    Shi, Xuanhua
    KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 41 (02) : 311 - 334
  • [3] Dynamic and fast processing of queries on large-scale RDF data
    Pingpeng Yuan
    Changfeng Xie
    Hai Jin
    Ling Liu
    Guang Yang
    Xuanhua Shi
    Knowledge and Information Systems, 2014, 41 : 311 - 334
  • [4] Adaptive correlated prefetch with large-scale hybrid memory system for stream processing
    Lee, Sung Min
    Yoon, Su-Kyung
    Kim, Jeong-Geun
    Kim, Shin-Dug
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (09): : 4746 - 4770
  • [5] Adaptive correlated prefetch with large-scale hybrid memory system for stream processing
    Sung Min Lee
    Su-Kyung Yoon
    Jeong-Geun Kim
    Shin-Dug Kim
    The Journal of Supercomputing, 2018, 74 : 4746 - 4770
  • [6] Towards Large-Scale Graph Stream Processing Platform
    Suzumura, Toyotaro
    Nishii, Shunsuke
    Ganse, Masaru
    WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 1321 - 1326
  • [7] A Hybrid Processing System for Large-Scale Traffic Sensor Data
    Zhao, Zhuofeng
    Ding, Weilong
    Wang, Jianwu
    Han, Yanbo
    IEEE ACCESS, 2015, 3 : 2341 - 2351
  • [8] ShuffleBench: A Benchmark for Large-Scale Data Shuffling Operations with Distributed Stream Processing Frameworks
    Henning, Soeren
    Vogel, Adriano
    Leichtfried, Michael
    Ertl, Otmar
    Rabiser, Rick
    PROCEEDINGS OF THE 15TH ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING, ICPE 2024, 2024, : 2 - 13
  • [9] Fast and exact processing of large-scale video data based on matrix operation
    1600, Inst. of Image Information and Television Engineers (67):
  • [10] Boosting Algorithms for Large-Scale Data and Data Batch Stream
    Yoon, Young Joo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2010, 23 (01) : 197 - 206