Binary Classification Optimisation with AI-Generated Data

被引:0
|
作者
Mazon, Manuel Jesus Cerezo [1 ]
Garcia, Ricardo Moya [1 ]
Garcia, Ekaitz Arriola [1 ]
del Castillo, Miguel Herencia Garcia [1 ]
Iglesias, Guillermo [2 ]
机构
[1] Ainovis, Colquide 6, Madrid 28231, Spain
[2] Univ Politecn Madrid, Madrid, Spain
来源
关键词
Machine Learning; Synthetic Data; GAN; Skin Lesion Classification; Data Augmentation; FID; ISIC; Medical Imaging; AUGMENTATION;
D O I
10.1007/978-3-031-80889-0_15
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In the field of machine learning, obtaining sufficient and high-quality data is a persistent challenge. This report explores the innovative solution of using synthetic data generated from existing datasets to overcome this limitation. By employing synthetic data, we not only increase the quantity of available information but also maintain the integrity and essential characteristics of natural data. This methodology allows the application of conventional data augmentation techniques, ensuring a more robust and efficient learning process. The study is based on a dataset provided by the International Skin Imaging Collaboration (ISIC), consisting of 3,323 cases divided equally between melanomas and Basal Cell Carcinoma (BCC). Using Generative Adversarial Networks (GANs), specifically StyleGAN2 with transfer learning from the Flickr-Faces-HQ (FFHQ) model, synthetic images were generated, expanding the dataset fourfold to a total of 26,584 synthetic records. The quality of the synthetic images was ensured using the Frechet Inception Distance (FID) metric [5], with BCC obtaining 22.2534 and melanomas obtaining 20.4577 according to this metric. Models trained with a hybrid approach using both real and synthetic data showed improved performance metrics (F1 0.71 to 0.79), highlighting the effectiveness of this method in enhancing binary classification tasks in medical imaging. The source code for all the research, along with the generated dataset is publicly available.
引用
收藏
页码:210 / 216
页数:7
相关论文
共 50 条
  • [41] Presentation matters for AI-generated clinical advice
    Marzyeh Ghassemi
    Nature Human Behaviour, 2023, 7 : 1833 - 1835
  • [42] Towards Detection of AI-Generated Texts and Misinformation
    Najee-Ullah, Ahmad
    Landeros, Luis
    Balytskyi, Yaroslav
    Chang, Sang-Yoon
    SOCIO-TECHNICAL ASPECTS IN SECURITY, STAST 2021, 2022, 13176 : 194 - 205
  • [43] Astronomers explore uses for AI-generated images
    Castelvecchi, Davide
    NATURE, 2017, 542 (7639) : 16 - 17
  • [44] Appeal and quality assessment for AI-generated images
    Goering, Steve
    Rao, Rakesh Ramachandra Rao
    Merten, Rasmus
    Raake, Alexander
    2023 15TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX, 2023, : 115 - 118
  • [45] Human heuristics for AI-generated language are flawed
    Jakesch, Maurice
    Hancock, Jeffrey T.
    Naaman, Mor
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (11)
  • [46] Fact-Checking of AI-Generated Reports
    Mahmood, Razi
    Wang, Ge
    Kalra, Mannudeep
    Yan, Pingkun
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2023, PT II, 2024, 14349 : 214 - 223
  • [47] Learning to Evaluate the Artness of AI-Generated Images
    Chen, Junyu
    An, Jie
    Lyu, Hanjia
    Kanan, Christopher
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 10731 - 10740
  • [48] Testing of detection tools for AI-generated text
    Weber-Wulff, Debora
    Anohina-Naumeca, Alla
    Bjelobaba, Sonja
    Foltynek, Tomas
    Guerrero-Dib, Jean
    Popoola, Olumide
    Sigut, Petr
    Waddington, Lorna
    INTERNATIONAL JOURNAL FOR EDUCATIONAL INTEGRITY, 2023, 19 (01)
  • [49] The Nature and Ownership of Copyright for AI-Generated Works
    Tian, Jinyang
    NTUT JOURNAL OF INTELLECTUAL PROPERTY LAW AND MANAGEMENT, 2024, 13 (01):
  • [50] Testing of detection tools for AI-generated text
    Debora Weber-Wulff
    Alla Anohina-Naumeca
    Sonja Bjelobaba
    Tomáš Foltýnek
    Jean Guerrero-Dib
    Olumide Popoola
    Petr Šigut
    Lorna Waddington
    International Journal for Educational Integrity, 19