Evaluating the Reproducibility cost of the scientific workflows

被引:0
|
作者
Banati, Anna [1 ]
Kacsuk, Peter [2 ,3 ]
Kozlovszky, Miklos [1 ,2 ]
机构
[1] Obuda Univ, John von Neumann Fac Informat, Biotech Lab, Becsi Str 96-B, H-1034 Budapest, Hungary
[2] MTA SZTAKI, Pf 63, H-1518 Budapest, Hungary
[3] Univ Westminster, 115 New Cavendish St, London W1W 6UW, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In almost all research field scientific studies can be implemented by in silico experiments. They are modelled by scientific workflows which describes the data or control flow between the consecutive computational tasks. Since these experiments are data and compute intensive they need parallel and distributed infrastructures to be enacted (grids, clusters, clouds and supercomputers). The complexity of the infrastructures and the continuously changing environment faces us a big challenge in reproducibility, which is often needed for results sharing or for judging scientific claims in the scientists' community. The necessary parameters of reproducible workflows can be originated from different sources (infrastructural, third party, or related to the binaries), which may change or become unavailable during the process of re-execution. However in most cases the lack of the original parameters can be compensated by replacing, evaluating or simulating the value of the descriptors with some extra cost in order to make it reproducible. In this paper we give the expected cost of making a workflow reproducible or more precisely to determine the probability of making a workflow reproducible with more than a predefined cost C.
引用
收藏
页码:187 / 190
页数:4
相关论文
共 50 条
  • [1] Evaluating the Average Reproducibility Cost of the Scientific Workflows
    Banati, Anna
    Karasz, Peter
    Kacsuk, Peter
    Kozlovszky, Miklos
    2016 IEEE 14TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY), 2016, : 79 - 84
  • [2] Reproducibility Analysis of Scientific Workflows
    Banati, Anna
    Kacsuk, Peter
    Kozlovszky, Miklos
    ACTA POLYTECHNICA HUNGARICA, 2017, 14 (02) : 201 - 217
  • [3] Dealing with Reusability and Reproducibility for Scientific Workflows
    Lifschitz, Sergio
    Gomes, Luciana
    Rehen, Stevens K.
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 625 - 632
  • [4] Computational reproducibility of scientific workflows at extreme scales
    Pouchard, Line
    Baldwin, Sterling
    Elsethagen, Todd
    Jha, Shantenu
    Raju, Bibi
    Stephan, Eric
    Tang, Li
    Van Dam, Kerstin Kleese
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2019, 33 (05): : 763 - 776
  • [5] Classification of Scientific Workflows Based on Reproducibility Analysis
    Banati, A.
    Kacsuk, P.
    Kozlovszky, M.
    2016 39TH INTERNATIONAL CONVENTION ON INFORMATION AND COMMUNICATION TECHNOLOGY, ELECTRONICS AND MICROELECTRONICS (MIPRO), 2016, : 327 - 331
  • [6] Experiences with Reproducibility: Case Studies from Scientific Workflows
    Ghoshal, Devarshi
    Paine, Drew
    Pastorello, Gilberto
    Elbashandy, Abdelrahman
    Gunter, Dan
    Amusat, Oluwamayowa
    Ramakrishnan, Lavanya
    PROCEEDINGS OF THE 4TH INTERNATIONAL WORKSHOP ON PRACTICAL REPRODUCIBLE EVALUATION OF COMPUTER SYSTEMS, P-RECS 2021, 2021, : 3 - 8
  • [7] Science Capsule: Towards Sharing and Reproducibility of Scientific Workflows
    Ghoshal, Devarshi
    Bianchi, Ludovico
    Essiari, Abdelilah
    Paine, Drew
    Poon, Sarah S.
    Beach, Michael
    N'Diaye, Alpha T.
    Huck, Patrick
    Ramakrishnan, Lavanya
    PROCEEDINGS OF 16TH WORKSHOP ON WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS21), 2021, : 66 - 73
  • [8] Facilitating the Reproducibility of Scientific Workflows with Execution Environment Specifications
    Meng, Haiyan
    Thain, Douglas
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS 2017), 2017, 108 : 705 - 714
  • [9] Data Provenance and Reproducibility in Grid Based Scientific Workflows
    Tylissanakis, G.
    Cotronis, Y.
    2009 4TH INTERNATIONAL CONFERENCE ON GRID AND PERVASIVE COMPUTING WORKSHOPS: (GPC WORKSHOPS), 2009, : 40 - 47
  • [10] Towards Reproducibility in Scientific Workflows: An Infrastructure-Based Approach
    Santana-Perez, Idafen
    Perez-Hernandez, Maria S.
    SCIENTIFIC PROGRAMMING, 2015, 2015