Exploring synthetic datasets for computer-aided detection: a case study using phantom scan data for enhanced lung nodule false positive reduction

被引:0
|
作者
Farhangi, Mohammad Mehdi [1 ]
Maynord, Michael [1 ,2 ]
Fermuller, Cornelia [2 ]
Aloimonos, Yiannis [2 ]
Sahiner, Berkman [1 ]
Petrick, Nicholas [1 ]
机构
[1] US FDA, Div Imaging Diagnost & Software Reliabil, CDRH, OSEL, Silver Spring, MD 20993 USA
[2] Univ Maryland, Iribe Ctr Comp Sci & Engn, Comp Sci Dept, College Pk, MD USA
关键词
physical phantom; lung nodule detection; image transformation; CT scan; semi-supervised learning; AUTOMATIC DETECTION; PULMONARY NODULES; CT; VALIDATION; RESOURCE; IMAGES;
D O I
10.1117/1.JMI.11.4.044507
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Purpose: Synthetic datasets hold the potential to offer cost-effective alternatives to clinical data, ensuring privacy protections and potentially addressing biases in clinical data. We present a method leveraging such datasets to train a machine learning algorithm applied as part of a computer-aided detection (CADe) system. Approach: Our proposed approach utilizes clinically acquired computed tomography (CT) scans of a physical anthropomorphic phantom into which manufactured lesions were inserted to train a machine learning algorithm. We treated the training database obtained from the anthropomorphic phantom as a simplified representation of clinical data and increased the variability in this dataset using a set of randomized and parameterized augmentations. Furthermore, to mitigate the inherent differences between phantom and clinical datasets, we investigated adding unlabeled clinical data into the training pipeline. Results: We apply our proposed method to the false positive reduction stage of a lung nodule CADe system in CT scans, in which regions of interest containing potential lesions are classified as nodule or non-nodule regions. Experimental results demonstrate the effectiveness of the proposed method; the system trained on labeled data from physical phantom scans and unlabeled clinical data achieves a sensitivity of 90% at eight false positives per scan. Furthermore, the experimental results demonstrate the benefit of the physical phantom in which the performance in terms of competitive performance metric increased by 6% when a training set consisting of 50 clinical CT scans was enlarged by the scans obtained from the physical phantom. Conclusions: The scalability of synthetic datasets can lead to improved CADe performance, particularly in scenarios in which the size of the labeled clinical data is limited or subject to inherent bias. Our proposed approach demonstrates an effective utilization of synthetic datasets for training machine learning algorithms.
引用
收藏
页数:14
相关论文
共 40 条
  • [21] Computer-aided detection of lung nodules on multidetector row computed tomography using three-dimensional analysis of nodule candidates and their surroundings
    Matsumoto, Sumiaki
    Ohno, Yoshiharu
    Yamagata, Hitoshi
    Takenaka, Daisuke
    Sugimura, Kazuro
    RADIATION MEDICINE, 2008, 26 (09): : 562 - 569
  • [22] Computer-aided detection of lung nodules on multidetector row computed tomography using three-dimensional analysis of nodule candidates and their surroundings
    Sumiaki Matsumoto
    Yoshiharu Ohno
    Hitoshi Yamagata
    Daisuke Takenaka
    Kazuro Sugimura
    Radiation Medicine, 2008, 26 : 562 - 569
  • [23] AN EFFICIENT MULTI-SCALE DATA REPRESENTATION METHOD FOR LUNG NODULE FALSE POSITIVE REDUCTION USING CONVOLUTIONAL NEURAL NETWORKS
    Augusto, Dario
    Oliveira, Borges
    Viana, Matheus Palhares
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 269 - 272
  • [24] A comparative study of database reduction methods for case-based computer-aided detection systems: preliminary results
    Mazurowski, Maciej A.
    Malof, Jordan M.
    Zurada, Jacek M.
    Tourassi, Georgia D.
    MEDICAL IMAGING 2009: COMPUTER-AIDED DIAGNOSIS, 2009, 7260
  • [25] Performance comparison between two computer-aided detection colonoscopy models by trainees using different false positive thresholds: a cross-sectional study in Thailand
    Tiankanon, Kasenee
    Karuehardsuwan, Julalak
    Aniwan, Satimai
    Mekaroonkamol, Parit
    Sunthornwechapong, Panukorn
    Navadurong, Huttakan
    Tantitanawat, Kittithat
    Mekritthikrai, Krittaya
    Samutrangsi, Salin
    Vateekul, Peerapon
    Rerknimitr, Rungsun
    CLINICAL ENDOSCOPY, 2024, 57 (02) : 217 - 225
  • [26] A Dynamic Probabilistic Model for Heterogeneous Data Fusion: A Pilot Case Study from Computer-Aided Detection of Depression
    Vitale, Federica
    Carbonaro, Bruno
    Esposito, Anna
    BRAIN SCIENCES, 2023, 13 (09)
  • [27] Computer aided detection of breast masses on full-field digital mammograms: false positive reduction using gradient field analysis
    Wei, J
    Sahiner, B
    Hadjiiski, LM
    Chan, HP
    MEDICAL IMAGING 2004: IMAGE PROCESSING, PTS 1-3, 2004, 5370 : 992 - 998
  • [28] IMAGE FEATURE ANALYSIS AND COMPUTER-AIDED DIAGNOSIS IN MAMMOGRAPHY - REDUCTION OF FALSE-POSITIVE CLUSTERED MICROCALCIFICATIONS USING LOCAL EDGE-GRADIENT ANALYSIS
    EMA, T
    DOI, K
    NISHIKAWA, RM
    JIANG, YL
    PAPAIOANNOU, J
    MEDICAL PHYSICS, 1995, 22 (02) : 161 - 169
  • [29] Principal-Component Massive-Training Machine-Learning Regression for False-Positive Reduction in Computer-Aided Detection of Polyps in CT Colonography
    Suzuki, Kenji
    Xu, Jianwu
    Zhang, Jun
    Sheu, Ivan
    MACHINE LEARNING IN MEDICAL IMAGING, 2010, 6357 : 182 - 189
  • [30] Multiresolution local binary pattern texture analysis combined with variable selection for application to false-positive reduction in computer-aided detection of breast masses on mammograms
    Choi, Jae Young
    Ro, Yong Man
    PHYSICS IN MEDICINE AND BIOLOGY, 2012, 57 (21): : 7029 - 7052