Cost of Flaky Tests in Continuous Integration: An Industrial Case Study

被引:0
|
作者
Leinen, Fabian [1 ]
Elsner, Daniel [1 ]
Pretschner, Alexander [1 ]
Stahlbauer, Andreas [2 ]
Sailer, Michael [2 ]
Juergens, Elmar [2 ]
机构
[1] Tech Univ Munich, Munich, Germany
[2] CQSE GmbH, Munich, Germany
关键词
flaky tests; continuous integration; regression testing; cost modeling; industrial case study;
D O I
10.1109/ICST60714.2024.00037
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Researchers and practitioners alike increasingly often perceive flaky tests as a major challenge in software engineering. They spend a lot of effort trying to detect, repair, and mitigate the negative effects of flaky tests. However, it is yet unclear where and to what extent the costs of flaky tests manifest in industrial Continuous Integration (CI) development processes. In this study, we compile cost factors introduced by flaky tests in CI development from research and practice and derive a cost model that allows gaining insight into the costs incurred. We then instantiate this model in a case study of a large, commercial software project with similar to 30 developers and similar to 1M SLoC. We analyze five years of development history, including CI test logs, commits from the Version Control System (VCS), issue tickets, and tracked work time to quantify the cost factors implied by flaky tests. We find that the time spent dealing with flaky tests in the studied project represents at least 2.5% of the productive developer time. This effort is divided into investigating potentially flaky test failures, which accounts for 1.1% of the total time spent, repairing flaky tests adds another 1.3%, and developing tools to monitor flaky tests adds 0.1%. Contrary to most other studies, we find the cost for rerunning tests to be negligible and inexpensive. Automatically rerunning a test costs 0.02 cents, while not rerunning and thus letting the pipeline fail results in a manual investigation costing $5.67 in our context. The insights gained from our case study have led to the decision to shift effort from investigation and repair to automatically rerunning tests. Our cost model can help practitioners analyze the cost of flaky tests in their context and make informed decisions. Furthermore, our case study provides a first step to better understand the costs of flaky tests, which can lead researchers to industry-relevant problems.
引用
收藏
页码:329 / 340
页数:12
相关论文
共 50 条
  • [21] Virtual commissioning for the control of the continuous industrial processes - case study
    Fratczak, Michal
    Nowak, Pawel
    Klopot, Tomasz
    Czeczot, Jacek
    Bysko, Szymon
    Opilski, Bartosz
    2015 20TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2015, : 1032 - 1037
  • [22] Industrial customer outage cost analysis: a case study of Nepal
    Karki, Nava Raj
    Mishra, Arbind Kumar
    Shrestha, Jayandra
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2010, 1 (01) : 44 - 51
  • [23] Cost and Schedule Monitoring of Industrial Building Projects: Case Study
    De Marco, Alberto
    Briccarello, Diego
    Rafele, Carlo
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT-ASCE, 2009, 135 (09): : 853 - 862
  • [24] Transformers for GUI Testing: A Plausible Solution to Automated Test Case Generation and Flaky Tests
    Khaliq, Zubair
    Farooq, Sheikh Umar
    Khan, Dawood Ashraf
    COMPUTER, 2022, 55 (03) : 64 - 73
  • [25] Simulations on Consumer Tests: A Systematic Evaluation Approach in an Industrial Case Study
    Berger, Christian
    Block, Delf
    Heeren, Soenke
    Hons, Christian
    Kuehnel, Stefan
    Leschke, Andre
    Plotnikov, Dimitri
    Rumpe, Bernhard
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2014, : 1474 - 1480
  • [26] Simulations on Consumer Tests: A Systematic Evaluation Approach in an Industrial Case Study
    Berger, Christian
    Block, Delf
    Heeren, Soenke
    Hons, Christian
    Kuehnel, Stefan
    Leschke, Andre
    Plotnikov, Dimitri
    Rumpe, Bernhard
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2015, 7 (04) : 24 - 36
  • [27] Case study to promote the use of industrial byproducts: The relevance of performance tests
    Gomes Correia, A.
    Roque, António J.
    Reis Ferreira, S.M.
    Fortunato, Eduardo
    Journal of ASTM International, 2012, 9 (02):
  • [28] Comparing the effort and effectiveness of automated and manual tests An industrial case study
    Dobles, Ignacio
    Martinez, Alexandra
    Quesada-Lopez, Christian
    2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2019,
  • [29] Integration of Urban Design and Industrial Estate Planning, a Case Study: Medan Industrial Estate, Indonesia
    Fachrudin, Hilma Tamiami
    Ali, Kausar Hj
    INTERNATIONAL TRANSACTION JOURNAL OF ENGINEERING MANAGEMENT & APPLIED SCIENCES & TECHNOLOGIES, 2012, 3 (04): : 423 - 437
  • [30] Integration and Development of Industrial Organization: The Case Study of Sugar Industry in China
    Wang, Yaxin
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE, EDUCATION MANAGEMENT AND SPORTS EDUCATION, 2015, 39 : 1544 - 1550