vivaGen - a survival data set generator for software testing

被引:0
|
作者
Gietzelt, Matthias [1 ,2 ,3 ]
Karmen, Christian [1 ]
Knaup-Gregori, Petra [1 ]
Ganzinger, Matthias [1 ]
机构
[1] Heidelberg Univ, Inst Med Biometry & Informat, Neuenheimer Feld 130-3, D-69120 Heidelberg, Germany
[2] TU Braunschweig, Peter L Reichertz Inst Med Informat, Carl Neuberg Str 1, D-30625 Hannover, Germany
[3] Hannover Med Sch, Carl Neuberg Str 1, D-30625 Hannover, Germany
关键词
Data set generator; Survival data; Biomarker; !text type='Java']Java[!/text;
D O I
10.1186/s12859-020-3478-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Software testing is an essential part of the software development process, but real-world data may not be suited or available for testing purposes. In the medical context, it can be especially hard to get the necessary test data for various reasons such as privacy concerns. To overcome these obstacles and provide data for the necessary thorough tests of software, the generation of simulated data sets can be a solution. In this paper, we focus on the challenging task of generating such survival data sets containing known effects. So far, no user-friendly software exists for the simulation of survival data, as they are typically derived from clinical trials with follow-ups. Results: To overcome these shortcomings, we developed an easy to use software package called vivaGen. In our Java software, parameters of survival time distributions are replaced by comprehensive measures that can be configured more intuitive by practitioners. vivaGen is equipped with a graphical frontend that allows users to adjust parameters and visualize the results in survival plots of the simulated cohorts. Conclusions: vivaGen is freely available and published as open source. It provides a novel way to generate test data sets based on probability distributions in a comprehensive and user-friendly way.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] vivaGen – a survival data set generator for software testing
    Matthias Gietzelt
    Christian Karmen
    Petra Knaup-Gregori
    Matthias Ganzinger
    BMC Bioinformatics, 21
  • [2] DDRAGE: A data set generator to evaluate ddRADseq analysis software
    Timm, Henning
    Weigand, Hannah
    Weiss, Martina
    Leese, Florian
    Rahmann, Sven
    MOLECULAR ECOLOGY RESOURCES, 2018, 18 (03) : 681 - 690
  • [3] Development of a synthetic data set generator for building and testing information discovery systems
    Lin, Pengyue J.
    Samadi, Behrokh
    Cipolone, Alan
    Jeske, Daniel R.
    Cox, Sean
    Rendon, Carlos
    Holt, Douglas
    Xiao, Rui
    THIRD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: NEW GENERATIONS, PROCEEDINGS, 2006, : 707 - +
  • [4] Code Generator for ADAS Software Testing
    Mihalj, Andrija
    Grbic, Ratko
    Lukic, Nemanja
    Kaprocki, Zvonimir
    2020 ZOOMING INNOVATION IN CONSUMER TECHNOLOGIES CONFERENCE (ZINC), 2020, : 184 - 189
  • [5] A software data generator for radiographic imaging investigations
    Lazos, D
    Kolitsi, Z
    Pallikarakis, N
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2000, 4 (01): : 76 - 79
  • [6] Using the SEEM software for laser SET testing and analysis
    Pouget, Vincent
    Fouillat, Pascal
    Lewis, Dean
    RADIATION EFFECTS ON EMBEDDED SYSTEMS, 2007, : 259 - +
  • [7] Integrated Test Platform of One Diesel Generator Set Embedded Software
    Zheng, Yuan-jian
    Huang, Zheng
    Ni, He
    2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, SIMULATION AND MODELLING (AMSM 2017), 2017, 162 : 355 - 360
  • [8] Blue Pages: Software as a Service Data Set
    Alkalbani, Asma Musabah
    Ghamry, Ahmed Mohamed
    Hussain, Farookh Khadeer
    Hussain, Omar Khadeer
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 269 - 274
  • [9] Spiked proteomic standard data set for testing label-free quantitative software and statistical methods
    Ramus, Claire
    Hovasse, Agnes
    Marcellin, Marlene
    Hesse, Anne-Marie
    Mouton-Barbosa, Emmanuelle
    Bouyssie, David
    Vaca, Sebastian
    Carapito, Christine
    Chaoui, Karima
    Bruley, Christophe
    Garin, Jerome
    Cianferani, Sarah
    Ferro, Myriam
    Van Dorssaeler, Alain
    Burlet-Schiltz, Odile
    Schaeffer, Christine
    Coute, Yohann
    de Peredo, Anne Gonzalez
    DATA IN BRIEF, 2016, 6 : 286 - 294
  • [10] MSTGen: Simulated Data Generator for Multistage Testing
    Han, Kyung T.
    APPLIED PSYCHOLOGICAL MEASUREMENT, 2013, 37 (08) : 666 - 668