Bug characterization in machine learning-based systems

被引：0

作者：

Mohammad Mehdi Morovati

Amin Nikanjam

Florian Tambon

Foutse Khomh

Zhen Ming (Jack) Jiang

机构：

[1] Polytechnique Montréal,SWAT Lab.

[2] York University,undefined

来源：

Empirical Software Engineering | 2024年 / 29卷

关键词：

Software bug; Software testing; ML-based systems; ML bug; Deep learning; Software maintenance; Empirical study;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The rapid growth of applying Machine Learning (ML) in different domains, especially in safety-critical areas, increases the need for reliable ML components, i.e., a software component operating based on ML. Since corrective maintenance, i.e. identifying and resolving systems bugs, is a key task in the software development process to deliver reliable software components, it is necessary to investigate the usage of ML components, from the software maintenance perspective. Understanding the bugs’ characteristics and maintenance challenges in ML-based systems can help developers of these systems to identify where to focus maintenance and testing efforts, by giving insights into the most error-prone components, most common bugs, etc. In this paper, we investigate the characteristics of bugs in ML-based software systems and the difference between ML and non-ML bugs from the maintenance viewpoint. We extracted 447,948 GitHub repositories that used one of the three most popular ML frameworks, i.e., TensorFlow, Keras, and PyTorch. After multiple filtering steps, we select the top 300 repositories with the highest number of closed issues. We manually investigate the extracted repositories to exclude non-ML-based systems. Our investigation involved a manual inspection of 386 sampled reported issues in the identified ML-based systems to indicate whether they affect ML components or not. Our analysis shows that nearly half of the real issues reported in ML-based systems are ML bugs, indicating that ML components are more error-prone than non-ML components. Next, we thoroughly examined 109 identified ML bugs to identify their root causes, and symptoms, and calculate their required fixing time. The results also revealed that ML bugs have significantly different characteristics compared to non-ML bugs, in terms of the complexity of bug-fixing (number of commits, changed files, and changed lines of code). Based on our results, fixing ML bugs is more costly and ML components are more error-prone, compared to non-ML bugs and non-ML components respectively. Hence, paying significant attention to the reliability of the ML components is crucial in ML-based systems. These results deepen the understanding of ML bugs and we hope that our findings help shed light on opportunities for designing effective tools for testing and debugging ML-based systems.

引用

共 50 条

[31] Adversarial Robustness of Machine Learning-based Indoor Positioning Systems
Swartz, Pete
Hobbs, Kevin
Hancock, Levi
Salih, Raed
Clark, Michael R.
DISRUPTIVE TECHNOLOGIES IN INFORMATION SCIENCES V, 2021, 11751
[32] A machine learning-based usability evaluation method for eLearning systems
Oztekin, Asil
Delen, Dursun
Turkyilmaz, Ali
Zaim, Selim
DECISION SUPPORT SYSTEMS, 2013, 56 : 63 - 73
[33] Machine Learning-based Fall Detection in Geriatric Healthcare Systems
Ramachandra, Anita
Adarsh, R.
Pahwa, Piyush
Anupama, K. R.
2018 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (ANTS), 2018,
[34] Machine learning-based fault estimation of nonlinear descriptor systems
Patel, Tigmanshu
Rao, M. S.
Gandhi, Dhrumil
Purohit, Jalesh L.
Shah, V. A.
INTERNATIONAL JOURNAL OF AUTOMATION AND CONTROL, 2024, 18 (01) : 1 - 29
[35] Performance Analysis of Machine Learning-Based Systems for Detecting Deforestation
de Araujo, Michel
Andrade, Ermeson
Machida, Fumio
2021 XI BRAZILIAN SYMPOSIUM ON COMPUTING SYSTEMS ENGINEERING (SBESC), 2021,
[36] Design Patterns for Machine Learning-Based Systems With Humans in the Loop
Andersen, Jakob Smedegaard
Maalej, Walid
IEEE SOFTWARE, 2024, 41 (04) : 151 - 159
[37] Machine learning-based intrusion detection for SCADA systems in healthcare
Tolgahan Öztürk
Zeynep Turgut
Gökçe Akgün
Cemal Köse
Network Modeling Analysis in Health Informatics and Bioinformatics, 2022, 11
[38] MAARS: Machine learning-based Analytics for Automated Rover Systems
Ono, Masahiro
Rothrock, Brandon
Otsu, Kyohei
Higa, Shoya
Iwashita, Yumi
Didier, Annie
Islam, Tanvir
Laporte, Christopher
Sun, Vivian
Stack, Kathryn
Sawoniewicz, Jacek
Daftry, Shreyansh
Timmaraju, Virisha
Sahnoune, Sami
Mattmann, Chris A.
Lamarre, Olivier
Ghosh, Sourish
Qiu, Dicong
Nomura, Shunichiro
Roy, Hiya
Sarabu, Hemanth
Hedrick, Gabrielle
Folsom, Larkin
Suehr, Sean
Park, Hyoshin
2020 IEEE AEROSPACE CONFERENCE (AEROCONF 2020), 2020,
[39] Machine Learning-Based Fingerprint Positioning for Massive MIMO Systems
Gong, Xinrui
Yu, Xianglong
Liu, Xiaofeng
Gao, Xiqi
IEEE ACCESS, 2022, 10 : 89320 - 89330
[40] Arabic Natural Language Processing and Machine Learning-Based Systems
Marie-Sainte, Souad Larabi
Alalyani, Nada
Alotaibi, Sihaam
Ghouzali, Sanaa
Abunadi, Ibrahim
IEEE ACCESS, 2019, 7 : 7011 - 7020

← 1 2 3 4 5 →