Diffusion Model-Based Data Augmentation for Lung Ultrasound Classification with Limited Data

被引:0
|
作者
Zhang, Xiaohui [1 ]
Gangopadhyay, Ahana [2 ]
Chang, Hsi-Ming [2 ]
Soni, Ravi [2 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] GE HealthCare, Chicago, IL USA
关键词
Single Image Denoising Diffusion Model; Synthetic Image Generation; Data Augmentation; Lung Ultrasound Classification; Limited Data; Class Imbalance; DIAGNOSIS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models typically require large quantities of data for good generalization. However, acquiring labeled medical imaging data is expensive, particularly for rare pathologies. While standard data augmentation is routinely performed to improve data variety, it may not be sufficient to improve the performance of downstream tasks with a clinical diagnostic purpose. Here we investigate the applicability of SinDDM (Kulikov et al., 2023), a single-image denoising diffusion model, for medical image data augmentation with lung ultrasound (LUS) images. Qualitative and quantitative evaluation of perceptual quality of the generated images were conducted. A multi-class classification task to detect various pathologies from LUS images was also employed to demonstrate the effectiveness of synthetic data augmentation using SinDDM. We further evaluated the image generation performance of FewDDM, an extended version of SinDDM trained on a limited number of images instead of a single image. Our results show that both SinDDM and FewDDM are able to generate images superior in quality compared to single-image generative adversarial networks (GANs), and are also highly effective in augmenting medical imaging data with limited number of samples to improve downstream task performance.
引用
收藏
页码:664 / 676
页数:13
相关论文
共 50 条
  • [1] Enhancing plant health classification via diffusion model-based data augmentation
    Lee, Younghoon
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [2] Null Model-Based Data Augmentation for Graph Classification
    Wang, Zeyu
    Wang, Jinhuan
    Shan, Yalu
    Yu, Shanqing
    Xu, Xiaoke
    Xuan, Qi
    Chen, Guanrong
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (02): : 1821 - 1833
  • [3] Diffusion Model-Based Generation of CT Slices with Limited Data
    Hwang, J.
    Park, S.
    Cho, S.
    Kim, J. S.
    MEDICAL PHYSICS, 2024, 51 (10) : 7843 - 7843
  • [4] Improving Text Classification with Large Language Model-Based Data Augmentation
    Zhao, Huanhuan
    Chen, Haihua
    Ruggles, Thomas A.
    Feng, Yunhe
    Singh, Debjani
    Yoon, Hong-Jun
    ELECTRONICS, 2024, 13 (13)
  • [5] MOCODA: Model-based Counterfactual Data Augmentation
    Pitis, Silviu
    Creager, Elliot
    Mandlekar, Ajay
    Garg, Animesh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [6] Pre-Trained Model-Based NFR Classification: Overcoming Limited Data Challenges
    Rahman, Kiramat
    Ghani, Anwar
    Alzahrani, Abdulrahman
    Tariq, Muhammad Usman
    Rahman, Arif Ur
    IEEE ACCESS, 2023, 11 : 81787 - 81802
  • [7] Data Augmentation Based on Color Features for Limited Training Texture Classification
    Huu-Thanh Duong
    Vinh Truong Hoang
    PROCEEDINGS OF THE 2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT): ENCOMPASSING INTELLIGENT TECHNOLOGY AND INNOVATION TOWARDS THE NEW ERA OF HUMAN LIFE, 2019, : 208 - 211
  • [8] Model-based clustering and classification of functional data
    Chamroukhi, Faicel
    Nguyen, Hien D.
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (04)
  • [9] Model-based Clustering and Classification for Data Science
    Unwin, Antony
    INTERNATIONAL STATISTICAL REVIEW, 2020, 88 (01) : 263 - 264
  • [10] Adaptive Model-Based Classification of PolSAR Data
    Li, Dong
    Zhang, Yunhua
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12): : 6940 - 6955