Markov model based coverage testing of deep learning software systems

被引:0
|
作者
Shi, Ying [1 ]
Yin, Beibei [1 ]
Shi, Jing-Ao [1 ]
机构
[1] Beihang Univ BUAA, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
关键词
Deep learning software systems; Deep learning testing; Markov chains; Coverage criteria; Information theory; REPRESENTATION; FRAMEWORK;
D O I
10.1016/j.infsof.2024.107628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Deep Learning (DL) software systems have been widely deployed in safety and security-critical domains, which calls for systematic testing to guarantee their accuracy and reliability. Objective measurement of test quality is one of the key issues in software testing. Recently, many coverage criteria have been proposed to measure the testing adequacy of Deep Neural Networks (DNNs). Objective: Recent research demonstrates that existing criteria have some limitations on interpreting the increasingly diverse behaviors of DNNs or clarifying the relationship between the coverage and the decision logic of DNNs. Moreover, some evaluations argue against the correlation between coverage and defect detection. In this paper, a novel coverage approach is proposed to interpret the internal information of programs. Methods: The process of coverage testing is formalized and quantified by constructing Markov models based on critical neurons extracted using Layer-wise Relevance Propagation in the structure of DNNs. The difference in the transition matrix of Markov chains between training and testing data is measured by KL divergence, and it is developed as a coverage criterion. Results: The values of the proposed coverage increase as the number of classes increases. The values are different for various test suites, and they become higher with the addition of new samples. Higher coverage values are observed to correlate with an increased fault detection capability. Conclusion: The experimental results illustrate that the proposed approach can effectively measure actual diversity and exhibit more adaptability to additional test cases. Furthermore, there is a positive correlation between the proposed coverage and fault detection, which provides support for test case selection guided by coverage.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] DeepBoundary: A Coverage Testing Method of Deep Learning Software based on Decision Boundary Representation
    Liu, Yue
    Feng, Lichao
    Wang, Xingya
    Zhang, Shiyu
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 166 - 172
  • [2] DeepCon: Contribution Coverage Testing for Deep Learning Systems
    Zhou, Zhiyang
    Dou, Wensheng
    Liu, Jie
    Zhang, Chenxin
    Wei, Jun
    Ye, Dan
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2021), 2021, : 189 - 200
  • [3] Model-based testing of software for automation systems using heuristics and coverage criterion
    Sarmento Peixoto, Rodrigo Jose
    da Silva, Leandro Dias
    Perkusich, Angelo
    SOFTWARE AND SYSTEMS MODELING, 2019, 18 (02): : 797 - 823
  • [4] Model-based testing of software for automation systems using heuristics and coverage criterion
    Rodrigo José Sarmento Peixoto
    Leandro Dias da Silva
    Angelo Perkusich
    Software & Systems Modeling, 2019, 18 : 797 - 823
  • [5] Coverage Guided Differential Adversarial Testing of Deep Learning Systems
    Guo, Jianmin
    Zhao, Yue
    Song, Houbing
    Jiang, Yu
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2021, 8 (02): : 933 - 942
  • [6] Prioritizing software regression testing using reinforcement learning and hidden Markov model
    Rawat N.
    Somani V.
    Tripathi A.K.
    International Journal of Computers and Applications, 2023, 45 (12) : 748 - 754
  • [7] Markov approach for quantifying the software code coverage using genetic algorithm in software testing
    Boopathi, M.
    Sujatha, R.
    Kumar, C. Senthil
    Narasimman, S.
    Rajan, A.
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2019, 14 (01) : 27 - 45
  • [8] A binomial software reliability model based on coverage of structural testing criteria
    Adalberto Nobiato Crespo
    Mario Jino
    Alberto Pasquini
    José Carlos Maldonado
    Empirical Software Engineering, 2008, 13
  • [9] A binomial software reliability model based on coverage of structural testing criteria
    Crespo, Adalberto Nobiato
    Jino, Mario
    Pasquini, Alberto
    Maldonado, Jose Carlos
    EMPIRICAL SOFTWARE ENGINEERING, 2008, 13 (02) : 185 - 209
  • [10] Heuristics for Improving Model Learning Based Software Testing
    Irfan, Muhammad Naeem
    2009 TESTING: ACADEMIC AND INDUSTRIAL CONFERENCE-PRACTICE AND RESEARCH TECHNIQUES, TAIC PART 2009, 2009, : 127 - 128