Semantic Data Augmentation for Deep Learning Testing using Generative AI

被引:1
|
作者
Missaoui, Sondess [1 ]
Gerasimou, Simos [1 ]
Matragkas, Nicholas [2 ]
机构
[1] Univ York, Dept Comp Sci, York, N Yorkshire, England
[2] Univ Paris Saclay, CEA, List, Paris, France
关键词
Generative AI; Deep Learning Testing; Coverage Guided Fuzzing; Data Augmentation; Safe AI;
D O I
10.1109/ASE56229.2023.00194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.
引用
收藏
页码:1694 / 1698
页数:5
相关论文
共 50 条
  • [1] Data Augmentation for Sparse Multidimensional Learning Performance Data Using Generative AI
    Zhang, Liang
    Lin, Jionghao
    Sabatini, John
    Borchers, Conrad
    Weitekamp, Daniel
    Cao, Meng
    Hollander, John
    Hu, Xiangen
    Graesser, Arthur C.
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2025, 18 : 145 - 164
  • [2] Data Augmentation for the Femoral Head Using Generative Deep Learning Models
    Won, Joon Hee
    Goh, Tae Sik
    Lee, Jung Sub
    Lim, Hee Chang
    TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS B, 2025, 49 (02) : 109 - 119
  • [3] Deep Generative Models for Data Synthesis and Augmentation in Machine Learning
    Adavala, Kiran Mayee
    Vhatkar, Sangeeta
    Ruprah, Taranpreet Singh
    Bhatia, Sukhwinder Kaur
    Kumar, Vipin
    Sharma, Dharmendra
    Praveen, B. Shyam
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1242 - 1249
  • [4] Boosting Deep Reinforcement Learning Agents with Generative Data Augmentation
    Papagiannis, Tasos
    Alexandridis, Georgios
    Stafylopatis, Andreas
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [5] Distributed Raman Spectrum Data Augmentation System Using Federated Learning with Deep Generative Models
    Kim, Yaeran
    Lee, Woonghee
    SENSORS, 2022, 22 (24)
  • [6] AI4AVP: an antiviral peptides predictor in deep learning approach with generative adversarial network data augmentation
    Lin, Tzu-Tang
    Sun, Yih-Yun
    Wang, Ching-Tien
    Cheng, Wen-Chih
    Lu, I-Hsuan
    Lin, Chung-Yen
    Chen, Shu-Hwa
    Mulder, Nicola
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [7] Data Augmentation With Semantic Enrichment for Deep Learning Invoice Text Classification
    Chi, Wei Wen
    Tang, Tiong Yew
    Salleh, Narishah Mohamed
    Mukred, Muaadh
    Alsalman, Hussain
    Zohaib, Muhammad
    IEEE ACCESS, 2024, 12 : 57326 - 57344
  • [8] An Explainable Deep Learning-Based Method for Schizophrenia Diagnosis Using Generative Data-Augmentation
    Saadatinia, Mehrshad
    Salimi-Badr, Armin
    IEEE ACCESS, 2024, 12 : 98379 - 98392
  • [9] Music Generation Using Deep Learning and Generative AI: A Systematic Review
    Mitra, Rohan
    Zualkernan, Imran
    IEEE ACCESS, 2025, 13 : 18079 - 18106
  • [10] Geometric Morphometric Data Augmentation Using Generative Computational Learning Algorithms
    Courtenay, Lloyd A.
    Gonzalez-Aguilera, Diego
    APPLIED SCIENCES-BASEL, 2020, 10 (24): : 1 - 25