Mutation-Based White Box Testing of Deep Neural Networks

被引：1

作者：

Cetiner, Gokhan ^{[1
]}

Yayan, Ugur ^{[2
]}

Yazici, Ahmet ^{[1
]}

机构：

[1] Univ Eskisehir Osmangazi, Comp Engn Dept, TR-26040 Eskisehir, Turkiye

[2] Univ Eskisehir Osmangazi, Software Engn Dept, TR-26040 Eskisehir, Turkiye

来源：

IEEE ACCESS | 2024年 / 12卷

关键词：

Testing; Artificial neural networks; Robustness; Software testing; Long short term memory; Accuracy; Transformers; Predictive models; Libraries; Convolutional neural networks; Reinforcement learning; Convolutional neural network; deep neural networks; long short-term memory; machine learning; mutation-based testing; reinforcement learning; transformers;

D O I：

10.1109/ACCESS.2024.3482114

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep Neural Networks (DNNs) are used in many critical areas, such as autonomous vehicles, generative AI systems, etc. Therefore, testing DNNs is vital, especially for models used in critical areas. Mutation-based testing is a very successful technique for testing DNNs by mutating their complex structures. Deep Mutation Module was developed to address mutation-based testing and the robustness challenges of DNNs. It analyses the structures of DNNs in detail. It tests models by applying mutation to parameters and structures using its fault library. Testing DNN structures and detecting faults is a highly complex and open-ended challenge. The method proposed in this study applies mutations to DNN parameters to expose faults and weaknesses in the models, thereby testing their robustness. The paper focuses on mutation-based tests of an Reinforce Learning (RL) model developed for electric vehicle routing, a Long Short-Term Memory (LSTM) model developed for prognostic predictions, and a Transformer-based neural network model for electric vehicle routing tasks. The best mutation scores for the LSTM model were measured as 96%, 91.02%, 71.19%, and 68.77%. The test results for the RL model resulted in mutation scores of 93.20%, 72.13%, 77.47%, 79.28%, and 55.74%. The mutation scores of the Transformer model were 75.87%, 76.36%, and 74.93%. These results show that the module can successfully test the targeted models and generate mutants classified as "survived mutants" that outperform the original models. In this way, it provides critical information to researchers to improve the overall performance of the models. Conducting these tests before using them in real-world applications minimizes faults and maximizes model success.

引用

页码：160156 / 160174

页数：19

共 50 条

[1] Mutation-based Fault Localization of Deep Neural Networks
Ghanbari, Ali
Thomas, Deepak-George
Arshad, Muhammad Arbab
Rajan, Hridesh
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1301 - 1313
[2] A White-Box Testing for Deep Neural Networks Based on Neuron Coverage
Yu, Jing
Duan, Shukai
Ye, Xiaojun
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 9185 - 9197
[3] MBEANN: Mutation-based evolving artificial neural networks
Ohkura, Kazuhiro
Yasuda, Toshiyuki
Kawamatsu, Yuichi
Matsumura, Yoshiyuki
Ueda, Kanji
ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 936 - +
[4] Test4Deep: an Effective White-box Testing for Deep Neural Networks
Yu, Jing
Fu, Yao
Zheng, Yanan
Zheng, Wang
Ye, Xiaojun
2019 22ND IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (IEEE CSE 2019) AND 17TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (IEEE EUC 2019), 2019, : 16 - 23
[5] Mutation-Based Deep Learning Framework Testing Method in JavaScript Environment
Zou, Yinglong
Liu, Jiawei
Zhai, Juan
Zheng, Tao
Fang, Chunrong
Chen, Zhenyu
arXiv,
[6] Investigating fault injection techniques in hardware-based deep neural networks and mutation-based fault localization
Le Traon, Yves
Xie, Tao
SOFTWARE TESTING VERIFICATION & RELIABILITY, 2024, 34 (04):
[7] Black-Box Testing of Deep Neural Networks
Byun, Taejoon
Rayadurgam, Sanjai
Heimdahl, Mats P. E.
2021 IEEE 32ND INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE 2021), 2021, : 309 - 320
[8] GenMuNN: A mutation-based approach to repair deep neural network models
Wu, Huanhuan
Li, Zheng
Cui, Zhanqi
Liu, Jianbin
INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2022, 13 (02)
[9] Mutation-based genetic neural network
Palmes, PP
Hayasaka, T
Usui, S
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (03): : 587 - 600
[10] GraphPrior: Mutation-based Test Input Prioritization for Graph Neural Networks
Dang, Xueqi
Li, Yinghua
Papadakis, Mike
Klein, Jacques
Bissyande, Tegawende F.
Le Traon, Yves
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (01)

← 1 2 3 4 5 →