A simulation provenance data management system for efficient job execution on an online computational science engineering platform

被引:1
|
作者
Ma, Jin [1 ]
Lee, Sik [1 ]
Cho, Kum Won [1 ]
Suh, Young-Kyoon [2 ]
机构
[1] Korea Inst Sci & Technol Informat, Natl Inst Supercomp & Networking, Daejeon, South Korea
[2] Kyungpook Natl Univ, Sch Comp Sci & Engn, Daegu, South Korea
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2019年 / 22卷 / 01期
基金
新加坡国家研究基金会;
关键词
EDISON; HPC; PROV; Provenance; Science app; Simulation;
D O I
10.1007/s10586-018-2827-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the past few years an online simulation service platform (named EDISON) has been applauded by several computational science and engineering communities in several countries. Though armed with multiple computing clusters and high-end storage resources, the platform has suffered from handling a huge amount of CPU-/IO-bound simulations that are most duplicated. Such intense simulations are normally admitted with no duplicate elimination and thus can adversely affect the performance of the platform. To address this performance concern, we propose a novel system, termed SuperMan, to seamlessly record and retrieve the provenances of previously executed simulations, and so prevent users from initiating duplicate and/or similar simulations using the limited computing resources. The system collects the simulation provenances based on a variant of a de-facto standard form, thereby offering interoperability. Based on the stored provenances, the system can provide useful simulation run statistics for users that need assistance. SuperMan also applies a hash-based duplicate elimination technique, resulting in making more efficient simulations on the platform. Finally, we show that the proposed proposed system could remove slightly over half of duplicate simulations on a variety of simulation software while obtaining about overall elapsed time savings of 30% and queuing time savings of 25%.
引用
收藏
页码:147 / 159
页数:13
相关论文
共 20 条
  • [1] A simulation provenance data management system for efficient job execution on an online computational science engineering platform
    Jin Ma
    Sik Lee
    Kum Won Cho
    Young-Kyoon Suh
    Cluster Computing, 2019, 22 : 147 - 159
  • [2] Provenance Data in the Machine Learning Lifecycle in Computational Science and Engineering
    Souza, Renan
    Azevedo, Leonardo
    Lourenco, Vitor
    Soares, Elton
    Thiago, Raphael
    Brandao, Rafael
    Civitarese, Daniel
    Brazil, Emilio Vital
    Moreno, Marcio
    Valduriez, Patrick
    Mattoso, Marta
    Cerqueira, Renato
    Netto, Marco A. S.
    PROCEEDINGS OF WORKS19: THE 2019 14TH IEEE/ACM WORKFLOWS IN SUPPORT OF LARGE-SCALE SCIENCE (WORKS), 2019, : 1 - 10
  • [3] Capturing Provenance for Runtime Data Analysis in Computational Science and Engineering Applications
    Silva, Vitor
    Souza, Renan
    Camata, Jose
    de Oliveira, Daniel
    Valduriez, Patrick
    Coutinho, Alvaro L. G. A.
    Mattoso, Marta
    PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, IPAW 2018, 2018, 11017 : 183 - 187
  • [4] EXTES: An Execution-Time Estimation Scheme for Efficient Computational Science and Engineering Simulation via Machine Learning
    Kim, Seounghyeon
    Suh, Young-Kyoon
    Kim, Jeeyoung
    IEEE ACCESS, 2019, 7 : 98993 - 99002
  • [5] Computational Science Engineering Platform Integration of heterogeneous systems for simulation service
    Kwon, Yejin
    Jeon, Inho
    Seo, Jerry H.
    Lee, Jongsuk R.
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1269 - 1273
  • [6] Simulation Data Lifecycle Management Based On Engineering Flight Simulation Platform
    Yang, Penglin
    Xu, Hong
    Shen, Weiqun
    MECHANICAL AND ELECTRONICS ENGINEERING VI, 2014, 654 : 385 - 390
  • [7] Effectiveness analysis of web based simulation for computational science and engineering on Improvement EDISON Platform
    Kwon, Yejin
    Jeon, Inho
    Lee, Jongsuk R.
    Seo, Jerry H.
    2018 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2018, : 27 - 33
  • [8] PEGASEF: A Provenance-Based Big Data Service Framework for Efficient Simulation Execution on Shared Computing Clusters
    Suh, Young-Kyoon
    Lee, Ki Yong
    Baek, Nakhoon
    BIG DATA APPLICATIONS AND SERVICES 2017, 2019, 770 : 175 - 182
  • [9] Effectiveness of Online Simulation-Based E-Learning for Computational Science and Engineering in Higher Education
    Park, Sun-Rae
    Lee, Jongsuk Ruth
    Cho, Kumwon
    ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2257 - 2261
  • [10] SUPERMAN: A Novel System for Storing and Retrieving Scientific-Simulation Provenance for Efficient Job Executions on Computing Clusters
    Suh, Young-Kyoon
    Ma, Jin
    2017 IEEE 2ND INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2017, : 283 - 288