Distributed stream processing analysis in high availability context

被引:0
|
作者
Gorawski, Marcin [1 ]
Marks, Pawel [1 ]
机构
[1] Silesian Tech Univ, Inst Comp Sci, Akademicka 16, PL-44100 Gliwice, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Not so long ago data warehouses were used to process data sets loaded periodically during ETL process (Extraction, Transformation and Loading). We could distinguish two kinds of ETL processes: full and incremental. Now we often have to process real-time data and analyse them almost on-the-fly, so the analyses are always up to date. There are many possible applications for real-time data warehouses. In most cases two features are important: delivering data to the warehouse as quick as possible, and not losing any tuple in case of failures. In this paper we propose an architecture for gathering and processing data from geographically distributed data sources. We present theoretical analysis, mathematical model of a data source, some rules of system modules configuration and results of experiments. At the end of the paper our future plans are described briefly.
引用
收藏
页码:61 / +
页数:2
相关论文
共 50 条
  • [1] High-availability algorithms for distributed stream processing
    Hwang, JH
    Balazinska, M
    Rasin, A
    Çetintemel, U
    Stonebraker, M
    Zdonik, S
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 779 - 790
  • [2] Load management and high availability in the Borealis distributed stream processing engine
    Tatbul, Nesime
    Ahmad, Yanif
    Cetintemel, Ugur
    Hwang, Jeong-Hyon
    Xing, Ying
    Zdonik, Stan
    GEOSENSOR NETWORKS, 2008, 4540 : 66 - +
  • [3] Context-Aware Stream Processing for Distributed IoT Applications
    Akbar, Adnan
    Carrez, Francois
    Moessner, Klaus
    Sancho, Juan
    Rico, Juan
    2015 IEEE 2ND WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2015, : 663 - 668
  • [4] A Hybrid Approach to High Availability in Stream Processing Systems
    Zhang, Zhe
    Gu, Yu
    Ye, Fan
    Yang, Hao
    Kim, Minkyong
    Lei, Hui
    Liu, Zhen
    2010 INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS ICDCS 2010, 2010,
  • [5] A High Throughput Distributed Log Stream Processing System for Network Security Analysis
    Zhao, Jingfen
    Zhang, Peng
    Sun, Yong
    Liu, Qingyun
    Tan, Guolin
    Li, Zhengmin
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1092 - 1096
  • [6] Recovery Processing for High Availability Stream Processing Systems in Local Area Networks
    Aritsugi, Masayoshi
    Nagano, Kyoko
    TENCON 2010: 2010 IEEE REGION 10 CONFERENCE, 2010, : 1036 - 1041
  • [7] Research on High Availability Mechanism in Distributed Data Stream Management System
    Liu Wei
    Qi Jun
    Xu LiZhen
    2013 10TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA 2013), 2013, : 289 - +
  • [8] AVAILABILITY ANALYSIS FOR THE DESIGN OF DISTRIBUTED-PROCESSING NETWORKS
    TSUCHIYA, M
    JOURNAL OF SYSTEMS AND SOFTWARE, 1985, 5 (03) : 221 - 227
  • [9] Dragon: A Lightweight, High Performance Distributed Stream Processing Engine
    Harwood, Aaron
    Read, Maria Rodriguez
    Amarasinghe, Gayashan Niroshana
    2020 IEEE 40TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2020, : 1344 - 1351
  • [10] Robust Distributed Stream Processing
    Lei, Chuan
    Rundensteiner, Elke A.
    Guttman, Joshua D.
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 817 - 828