Gan-based data augmentation to improve breast ultrasound and mammography mass classification

被引:4
|
作者
Jimenez-Gaona, Yuliana [1 ,3 ,5 ]
Carrion-Figueroa, Diana [2 ]
Lakshminarayanan, Vasudevan [3 ,4 ]
Rodriguez-Alvarez, Maria Jose [5 ]
机构
[1] Univ Tecn Particular Loja, Dept Quim & Ciencias Exactas, San Cayetano Alto S-N CP1101608, Loja, Ecuador
[2] Hosp Gen Sur Quito IESS, Calle Moraspungo & Pinllopata, Quito 170111, Ecuador
[3] Univ Waterloo, Sch Optometry & Vis Sci, Theoret & Expt Epistemol Lab, Waterloo, ON N2L 3G1, Canada
[4] Univ Waterloo, Dept Syst Design Engn Phys & Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[5] Univ Politecn Valencia, Inst Instrumentac Imagen Mol I3M, E-46022 Valencia, Spain
关键词
Breast cancer; Data augmentation; Deep learning algorithms; Generative Adversarial Networks (GAN); Mammography; Ultrasound; GENERATIVE ADVERSARIAL NETWORK;
D O I
10.1016/j.bspc.2024.106255
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Data imbalance is a common problem in breast cancer diagnosis, to address this challenge, the research explores the use of Generative Adversarial Networks (GANs) to generate synthetic medical data. Various GAN methods, including Wasserstein GAN with Gradient Penalty (WGAN-GP), Cycle GAN, Conditional GAN, and Spectral Normalization GAN (SNGAN), were tested for data augmentation in breast regions of interest (ROIs) using mammography and ultrasound databases. The study employed real, synthetic, and hybrid ROIs (128x128 pixels) to train a Resnet network for classifying as benign (B) or malignant (M) classes. The quality and diversity of the synthetic data were assessed using several metrics: Fre <acute accent>chet Inception Distance (FID), Kernel Inception Distance (KID), Structural Similarity Index (SSIM), Multi -Scale SSIM (MS-SSIM), Blind Reference Image Spatial Quality Evaluator (BRISQUE), Naturalness Image Quality Evaluator (NIQE), and Perception -based Image Quality Evaluator (PIQE).Results revealed that the SNGAN model (FID = 52.89) was most effective for augmenting mammography data, while CGAN (FID = 116.03) excelled with ultrasound data. Cycle GAN and WGAN-GP, though demonstrating lower KID values, did not perform better than SNGAN and CGAN. The lower average MS-SSIM values suggested that SNGAN and CGAN produced a high diversity of synthetic images. However, lower SSIM, BRISQUE, NIQE, and PIQE values indicated poor quality in both real and synthetic images. Classification results showed high accuracy without data augmentation in both US (93.1 %B/94.9 %M) and mammography (80.9 %B/76.9 %M). The research concludes that preprocessing and characterizing ROIs by abnormality type is crucial to generate diverse synthetic data and improve accuracy in the classification process using combined GANs and CNN models.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] GAN-Based Data Augmentation Technique for Various Transmission Line Fault Data
    Lee, Kyeong-Yeong
    Lim, Se-Heon
    Kim, Tae-Geun
    Song, Kyung-Min
    Yoon, Sung-Guk
    Transactions of the Korean Institute of Electrical Engineers, 2024, 73 (08): : 1318 - 1326
  • [22] GAN-based data augmentation to improve 3D chromatin features identification in Hi-C data
    Li, Chong
    Mohammad, Erfan
    Song, Chen
    Shi, Xinghua
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 297 - 298
  • [23] An improved GAN-based data augmentation model for addressing data scarcity in SRMs
    Yang, Huixin
    Xiang, Zijian
    Li, Xiang
    Zhang, Wei
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (02)
  • [24] A NOVEL GAN-BASED DATA AUGMENTATION ALGORITHM FOR SEMICONDUCTOR DEFECT INSPECTION
    Liu, Yang
    Guan, Yuanjun
    Han, Tianyan
    Ma, Can
    Wang, Jiayi
    Wang, Tao
    Yi, Qianchuan
    Hu, Lilei
    CONFERENCE OF SCIENCE & TECHNOLOGY FOR INTEGRATED CIRCUITS, 2024 CSTIC, 2024,
  • [25] A Survey on GAN-Based Data Augmentation for Hand Pose Estimation Problem
    Farahanipad, Farnaz
    Rezaei, Mohammad
    Nasr, Mohammad Sadegh
    Kamangar, Farhad
    Athitsos, Vassilis
    TECHNOLOGIES, 2022, 10 (02)
  • [26] On Constructing Vessel Dataset Structure Using GAN-based Data Augmentation
    Oh, Ah Reum
    Lee, Jiwon
    Moon, Sung-Won
    Lee, Jung Soo
    Nam, Do-Won
    Yoo, Wonyoung
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1700 - 1702
  • [27] A new method for GAN-based data augmentation for classes with distinct clusters
    Kuntalp, Mehmet
    Duzyel, Okan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [28] Evolutionary GAN-Based Data Augmentation for Cardiac Magnetic Resonance Image
    Fu, Ying
    Gong, Minxue
    Yang, Guang
    Wei, Hong
    Zhou, Jiliu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 1359 - 1374
  • [29] GAN-Based Synthetic Data Augmentation for Infrared Small Target Detection
    Kim, Jun-Hyung
    Hwang, Youngbae
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [30] GAN-Based Data Augmentation for Prediction Improvement Using Gene Expression Data in Cancer
    Moreno-Barea, Francisco J.
    Jerez, Jose M.
    Franco, Leonardo
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 28 - 42