Six years later: testing vs. model checking

被引:0
|
作者
Beyer, Dirk [1 ]
Lemberger, Thomas [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Munich, Germany
关键词
Software verification; Model checking; Program analysis; Test generation; Testing; Fuzzing;
D O I
10.1007/s10009-024-00769-8
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Six years ago, we performed the first large-scale comparison of automated test generators and software model checkers with respect to bug-finding capabilities on a benchmark set with 5693 C programs. Since then, the International Competition on Software Testing (Test-Comp) has established standardized formats and community-agreed rules for the experimental comparison of test generators. With this new context, it is time to revisit our initial question: Model checkers or test generators-which tools are more effective in finding bugs in software? To answer this, we perform a comparative analysis on the tools and existing data published by two competitions, the International Competition on Software Verification (SV-COMP) and Test-Comp. The results provide two insights: (1) Almost all test generators that participate in Test-Comp use hybrid approaches that include formal methods, and (2) although the considered model checkers are still highly competitive, they are now outperformed by the bug-finding capabilities of the considered test generators.
引用
收藏
页码:633 / 646
页数:14
相关论文
共 50 条
  • [31] Dying Revisited: Understanding the Process, Six Years Later
    Ornstein, Moshe C.
    JOURNAL OF PALLIATIVE MEDICINE, 2013, 16 (11) : 1479 - 1480
  • [32] RS3PE:: six years later
    Gumá, M
    Casado, E
    Tena, X
    Olivé, A
    ANNALS OF THE RHEUMATIC DISEASES, 1999, 58 (11) : 722 - 722
  • [33] Scrap tires. Six years later and counting
    Trojak, Larry
    Waste Age, 1991, 22 (08):
  • [34] Present focus and billing systems: Testing 'pay-as-you-go' vs. 'pay-later'
    Werthschulte, Madeline
    JOURNAL OF ECONOMIC BEHAVIOR & ORGANIZATION, 2023, 212 : 108 - 121
  • [35] BDD vs. constraint-based model checking: An experimental evaluation for asynchronous concurrent systems
    Bultan, T
    TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, 2000, 1785 : 441 - 455
  • [36] BDDs vs. zero-suppressed BDDs: for CTL symbolic model checking of Petri nets
    Yoneda, T
    Hatori, H
    Takahara, A
    Minato, S
    FORMAL METHODS IN COMPUTER-AIDED DESIGN, 1996, 1166 : 435 - 449
  • [37] Anaplastic Lymphoma Kinase Testing: IHC vs. FISH vs. NGS
    Niu, Xiaomin
    Chuang, Jody C.
    Berry, Gerald J.
    Wakelee, Heather A.
    CURRENT TREATMENT OPTIONS IN ONCOLOGY, 2017, 18 (12)
  • [38] Anaplastic Lymphoma Kinase Testing: IHC vs. FISH vs. NGS
    Xiaomin Niu
    Jody C. Chuang
    Gerald J. Berry
    Heather A. Wakelee
    Current Treatment Options in Oncology, 2017, 18
  • [39] Observation error model selection by information criteria vs. normality testing
    Rüdiger Lehmann
    Studia Geophysica et Geodaetica, 2015, 59 : 489 - 504
  • [40] Freedom vs. intervention: Six tough cases
    Nelson, Janet R.
    JOURNAL OF THE SOCIETY OF CHRISTIAN ETHICS, 2006, 26 (02) : 223 - 225