s2p: Provenance Research for Stream Processing System

被引:3
|
作者
Ye, Qian [1 ,2 ]
Lu, Minyan [1 ,2 ]
机构
[1] Beihang Univ, Key Lab Reliabil & Environm Engn Technol, Beijing 100191, Peoples R China
[2] Beihang Univ, Sch Reliabil & Syst Engn, Beijing 100191, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 12期
关键词
stream provenance; fine-grained provenance; coarse-grained provenance; replay; checkpoint; MAPREDUCE; MODEL;
D O I
10.3390/app11125523
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The main purpose of our provenance research for DSP (distributed stream processing) systems is to analyze abnormal results. Provenance for these systems is not nontrivial because of the ephemerality of stream data and instant data processing mode in modern DSP systems. Challenges include but are not limited to an optimization solution for avoiding excessive runtime overhead, reducing provenance-related data storage, and providing it in an easy-to-use fashion. Without any prior knowledge about which kinds of data may finally lead to the abnormal, we have to track all transformations in detail, which potentially causes hard system burden. This paper proposes s2p (Stream Process Provenance), which mainly consists of online provenance and offline provenance, to provide fine- and coarse-grained provenance in different precision. We base our design of s2p on the fact that, for a mature online DSP system, the abnormal results are rare, and the results that require a detailed analysis are even rarer. We also consider state transition in our provenance explanation. We implement s2p on Apache Flink named as s2p-flink and conduct three experiments to evaluate its scalability, efficiency, and overhead from end-to-end cost, throughput, and space overhead. Our evaluation shows that s2p-flink incurs a 13% to 32% cost overhead, 11% to 24% decline in throughput, and few additional space costs in the online provenance phase. Experiments also demonstrates the s2p-flink can scale well. A case study is presented to demonstrate the feasibility of the whole s2p solution.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] S2P: A stable 2-pole RC delay and coupling noise metric
    Acar, E
    Odabasioglu, A
    Celik, M
    Pileggi, LT
    NINTH GREAT LAKES SYMPOSIUM ON VLSI, PROCEEDINGS, 1999, : 60 - 63
  • [32] S1P/S2P wave separation and shear wave splitting correction for SVP data
    Yue, Yuanyuan
    Qian, Zhongping
    Nie, Hongmei
    Sun, Pengyuan
    Deng, Zhiwen
    Li, Jianfeng
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2023, 58 (06): : 1374 - 1381
  • [33] LIGAND DISPLACEMENT FROM TETRAKIS(OO'-DIETHYL "PHOSPHORODITHIOATE)-LANTHANOID(III) ANIONS BY TRIPHENYLPHOSPHINE OXIDE - X-RAY CRYSTAL-STRUCTURE OF [LA[S2P(OET)2]3(POPH3)2] AND [SM[S2P(OET)2]2(POPH3)3]-[S2P(OET)2]
    PINKERTON, AA
    SCHWARZENBACH, D
    JOURNAL OF THE CHEMICAL SOCIETY-DALTON TRANSACTIONS, 1976, (23): : 2466 - 2471
  • [34] S1P/S2P波分离及SVP横波分裂校正
    岳媛媛
    钱忠平
    聂红梅
    孙鹏远
    邓志文
    李建峰
    石油地球物理勘探, 2023, 58 (06) : 1374 - 1381
  • [35] S2P: A Desktop Application for Fast and Easy Processing of 2D-Gel and MALDI-Based Mass Spectrometry Protein Data
    Lopez-Fernandez, Hugo
    Araujo, Jose E.
    Glez-Pena, Daniel
    Reboiro-Jato, Miguel
    Fdez-Riverola, Florentino
    Capelo-Martinez, Jose L.
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 1 - 8
  • [36] Biochemical Characterization of Function and Structure of RseP, an Escherichia coli S2P Protease
    Hizukuri, Y.
    Akiyama, K.
    Akiyama, Y.
    ENZYMOLOGY AT THE MEMBRANE INTERFACE: INTRAMEMBRANE PROTEASES, 2017, 584 : 1 - 33
  • [37] The anatomy of a stream processing system
    Gilani, Altaf
    Sonune, Satyajeet
    Kendai, Balakumar
    Chakravarthy, Sharma
    FLEXIBLE AND EFFICIENT INFORMATION HANDLING, 2006, 4042 : 232 - 239
  • [38] EVIDENCE FOR EXISTENCE OF MOO2[S2P(OET)2]2 FROM DISSOCIATION OF MO2O3[S2P(OET)2]4 AND FORMATION OF MIXED-LIGAND MO(V) COMPLEXES
    CHEN, GJJ
    MCDONALD, JW
    NEWTON, WE
    INORGANIC & NUCLEAR CHEMISTRY LETTERS, 1976, 12 (09): : 697 - 702
  • [39] The role of p53 in the DNA damage-related ubiquitylation of S2P RNAPII
    Borsos, Barbara N.
    Pantazi, Vasiliki
    Pahi, Zoltan G.
    Majoros, Hajnalka
    Ujfaludi, Zsuzsanna
    Berzsenyi, Ivett
    Pankotai, Tibor
    PLOS ONE, 2022, 17 (05):
  • [40] Using S2P for routing awareness in tuple-based pervasive systems
    Rahmani, Shahpour
    Sharifi, Mohsen
    Kolahdooz, Saman
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2009, 4 (02) : 91 - 98