Semantic Data Augmentation for Deep Learning Testing using Generative AI

被引:1
|
作者
Missaoui, Sondess [1 ]
Gerasimou, Simos [1 ]
Matragkas, Nicholas [2 ]
机构
[1] Univ York, Dept Comp Sci, York, N Yorkshire, England
[2] Univ Paris Saclay, CEA, List, Paris, France
关键词
Generative AI; Deep Learning Testing; Coverage Guided Fuzzing; Data Augmentation; Safe AI;
D O I
10.1109/ASE56229.2023.00194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of state-of-the-art Deep Learning models heavily depends on the availability of well-curated training and testing datasets that sufficiently capture the operational domain. Data augmentation is an effective technique in alleviating data scarcity, reducing the time-consuming and expensive data collection and labelling processes. Despite their potential, existing data augmentation techniques primarily focus on simple geometric and colour space transformations, like noise, flipping and resizing, producing datasets with limited diversity. When the augmented dataset is used for testing the Deep Learning models, the derived results are typically uninformative about the robustness of the models. We address this gap by introducing GENFUZZER, a novel coverage-guided data augmentation fuzzing technique for Deep Learning models underpinned by generative AI. We demonstrate our approach using widely-adopted datasets and models employed for image classification, illustrating its effectiveness in generating informative datasets leading up to a 26% increase in widely-used coverage criteria.
引用
收藏
页码:1694 / 1698
页数:5
相关论文
共 50 条
  • [41] Panel: Using Generative AI in Teaching and Learning
    Sumner, Mary
    Van Slyke, Craig
    Galletta, Dennis F.
    Niederman, Fred
    PROCEEDINGS OF THE 2024 COMPUTERS AND PEOPLE RESEARCH CONFERENCE, SIGMIS-CPR 2024, 2024,
  • [42] Towards enhanced assessment question classification: a study using machine learning, deep learning, and generative AI
    Gani, Mohammed Osman
    Ayyasamy, Ramesh Kumar
    Alhashmi, Saadat M.
    Alam, Khondaker Sajid
    Sangodiah, Anbuselvan
    Khaleduzzman, Khondaker
    Ponnusamy, Chinnasamy
    CONNECTION SCIENCE, 2025, 37 (01)
  • [43] A Domain Adaptive Semantic Segmentation Method Using Contrastive Learning and Data Augmentation
    Xiang, Yixiao
    Tian, Lihua
    Li, Chen
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [44] A Domain Adaptive Semantic Segmentation Method Using Contrastive Learning and Data Augmentation
    Yixiao Xiang
    Lihua Tian
    Chen Li
    Neural Processing Letters, 56
  • [46] Using Deep Learning in Semantic Classification for Point Cloud Data
    Yao, Xuanxia
    Guo, Jia
    Hu, Juan
    Cao, Qixuan
    IEEE ACCESS, 2019, 7 : 37121 - 37130
  • [47] Enhancing Text Classification Models with Generative AI-aided Data Augmentation
    Zhao, Huanhuan
    Chen, Haihua
    Yoon, Hong-Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE TESTING, AITEST, 2023, : 138 - 145
  • [48] Seismic data interpolation using deep learning with generative adversarial networks
    Kaur, Harpreet
    Pham, Nam
    Fomel, Sergey
    GEOPHYSICAL PROSPECTING, 2021, 69 (02) : 307 - 326
  • [49] Generative AI-Driven Data Augmentation for Crack Detection in Physical Structures
    Kim, Jinwook
    Seon, Joonho
    Kim, Soohyun
    Sun, Youngghyu
    Lee, Seongwoo
    Kim, Jeongho
    Hwang, Byungsun
    Kim, Jinyoung
    ELECTRONICS, 2024, 13 (19)
  • [50] Data augmentation using conditional generative adversarial network (cGAN): applications for sewer condition classification and testing using different machine learning techniques
    Woldesellasse, Haile
    Tesfamariam, Solomon
    JOURNAL OF HYDROINFORMATICS, 2024, 26 (07) : 1471 - 1489