Alovera: A Fast Stream Processing System for Large-Scale Data

被引:0
|
作者
Zhang, Zhen'An [1 ]
Zhang, Dongjie [2 ]
Yu, Xiaopeng [1 ]
Wang, Jing [2 ]
He, Chunjiang [3 ]
Yuan, Pingpeng [2 ]
Jin, Hai [2 ]
机构
[1] HAEPC Elect Power Res Inst, Zhengzhou, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Cluster & Grid Comp Lab, Serv Comp Technol & Syst Lab, Wuhan 430074, Peoples R China
[3] China Elect Power Res Inst, Beijing 100085, Peoples R China
关键词
Large-scale data analysis; query execution; columnar store; stream processing;
D O I
10.1109/ChinaGrid.2013.9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Growing of data volume poses challenges to data processing system. In this paper, Alovera, a fast stream processing system for large-scale data is presented. By using columnar data layout and stream processing, it is capable of pipelining data processing efficiently. It can process part of data instead of waiting for all data to be ready for the next operation. Thus, it can reduce the query time dramatically. Experimental results indicate significant performance improvement in a variety of tasks. In the experiments, we also evaluate our methods with different systems including HadoopDB and Hive. The extensive experiments confirm efficiency and better performance of our system.
引用
收藏
页码:74 / 79
页数:6
相关论文
共 50 条
  • [21] Predicting the Stability of Large-scale Distributed Stream Processing Systems on the Cloud
    Tri Minh Truong
    Harwood, Aaron
    Sinnott, Richard O.
    CLOSER: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, 2017, : 575 - 582
  • [22] Performance Analysis of Large-scale Distributed Stream Processing Systems on the Cloud
    Tri Minh Truong
    Harwood, Aaron
    Sinnott, Richard O.
    Chen, Shiping
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 754 - 761
  • [23] A United Framework for Large-Scale Resource Description Framework Stream Processing
    Fang, Hong
    Zhao, Bo
    Zhang, Xiao-Wang
    Yang, Xuan-Xing
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2019, 34 (04) : 762 - 774
  • [24] Storage optimization for large-scale distributed stream-processing systems
    Hildrum, Kirsten
    Douglis, Fred
    Wolf, Joel L.
    Yu, Philip S.
    Fleischer, Lisa
    Katta, Akshay
    ACM Transactions on Storage, 2008, 3 (04)
  • [25] A United Framework for Large-Scale Resource Description Framework Stream Processing
    Hong Fang
    Bo Zhao
    Xiao-Wang Zhang
    Xuan-Xing Yang
    Journal of Computer Science and Technology, 2019, 34 : 762 - 774
  • [26] A Fast Retrieval Algorithm for Large-Scale XML Data
    Tanioka, Hiroki
    FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 129 - 137
  • [27] RESEARCH IMPLEMENTATION OF BEIDOU DATA FAST PROCESSING OF LARGE-SCALE GNSS REFERENCE STATION NETWORK
    Zhang Qinglan
    Zhang Peng
    Sun Zhanyi
    Wu Junli
    Chen Hua
    Wang Xiaoqing
    Qi Ligang
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION IV, 2022, 43-B4 : 203 - 208
  • [28] Fast Processing Method of SAR Raw Data Simulation For Large-scale Forest Stand Application
    Sun, Hanwei
    Tian, Weiming
    Hu, Cheng
    Zeng, Tao
    Long, Teng
    2011 IEEE RADAR CONFERENCE (RADAR), 2011, : 299 - 302
  • [29] Large-Scale Data Processing for Information Retrieval Applications
    Khandel, Pooya
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3489 - 3489
  • [30] Parallel Strategy for the Large-Scale Data Streams Processing
    Yuan, Ya-Juan
    Ma, Guo-Jie
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND INFORMATION SYSTEMS, 2016, 52 : 232 - 234