Addressing challenges in radiomics research: systematic review and repository of open-access cancer imaging datasets

被引:8
|
作者
Woznicki, Piotr [1 ]
Laqua, Fabian Christopher [1 ]
Al-Haj, Adam [2 ]
Bley, Thorsten [1 ]
Baessler, Bettina [1 ]
机构
[1] Univ Hosp Wurzburg, Dept Diagnost & Intervent Radiol, Wurzburg, Germany
[2] Med Univ Warsaw, Fac Med, Warsaw, Poland
关键词
Radiomics; Radiology; Cancer imaging; Machine learning; Reproducibility of results; BIOMARKERS; TEXTURE; MODEL; HEAD;
D O I
10.1186/s13244-023-01556-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectivesOpen-access cancer imaging datasets have become integral for evaluating novel AI approaches in radiology. However, their use in quantitative analysis with radiomics features presents unique challenges, such as incomplete documentation, low visibility, non-uniform data formats, data inhomogeneity, and complex preprocessing. These issues may cause problems with reproducibility and standardization in radiomics studies.MethodsWe systematically reviewed imaging datasets with public copyright licenses, published up to March 2023 across four large online cancer imaging archives. We included only datasets with tomographic images (CT, MRI, or PET), segmentations, and clinical annotations, specifically identifying those suitable for radiomics research. Reproducible preprocessing and feature extraction were performed for each dataset to enable their easy reuse.ResultsWe discovered 29 datasets with corresponding segmentations and labels in the form of health outcomes, tumor pathology, staging, imaging-based scores, genetic markers, or repeated imaging. We compiled a repository encompassing 10,354 patients and 49,515 scans. Of the 29 datasets, 15 were licensed under Creative Commons licenses, allowing both non-commercial and commercial usage and redistribution, while others featured custom or restricted licenses. Studies spanned from the early 1990s to 2021, with the majority concluding after 2013. Seven different formats were used for the imaging data. Preprocessing and feature extraction were successfully performed for each dataset.ConclusionRadiomicsHub is a comprehensive public repository with radiomics features derived from a systematic review of public cancer imaging datasets. By converting all datasets to a standardized format and ensuring reproducible and traceable processing, RadiomicsHub addresses key reproducibility and standardization challenges in radiomics.Critical relevance statementThis study critically addresses the challenges associated with locating, preprocessing, and extracting quantitative features from open-access datasets, to facilitate more robust and reliable evaluations of radiomics models.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction. Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Challenges to access and provision of palliative care for people who are homeless: a systematic review of qualitative research
    Briony F. Hudson
    Kate Flemming
    Caroline Shulman
    Bridget Candy
    BMC Palliative Care, 15
  • [32] Challenges to access and provision of palliative care for people who are homeless: a systematic review of qualitative research
    Hudson, Briony F.
    Flemming, Kate
    Shulman, Caroline
    Candy, Bridget
    BMC PALLIATIVE CARE, 2016, 15
  • [33] The application of radiomics in cancer imaging with a focus on lung cancer, renal cell carcinoma, gastrointestinal cancer, and head and neck cancer: A systematic review
    Fusco, Roberta
    Granata, Vincenza
    Setola, Sergio Venanzio
    Trovato, Piero
    Galdiero, Roberta
    Raso, Mauro Mattace
    Maio, Francesca
    Porto, Annamaria
    Pariante, Paolo
    Cerciello, Vincenzo
    Sorgente, Eugenio
    Pecori, Biagio
    Castaldo, Mimma
    Izzo, Francesco
    Petrillo, Antonella
    PHYSICA MEDICA-EUROPEAN JOURNAL OF MEDICAL PHYSICS, 2025, 130
  • [34] Deep learning techniques for solar tracking systems: A systematic literature review, research challenges, and open research directions
    Phiri, Musa
    Mulenga, Mwenge
    Zimba, Aaron
    Eke, Christopher Ifeanyi
    SOLAR ENERGY, 2023, 262
  • [35] Challenges of server consolidation in virtualized data centers and open research issues: a systematic literature review
    Abadi, Reza Mohamadi Bahram
    Rahmani, Amir Masoud
    Alizadeh, Sasan Hossein
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (04): : 2876 - 2927
  • [36] Risk-adapted screening in bladder Cancer with the open-access tool RiskCheck Bladder Cancer: Proof of principle by the health service research foundation IQUO
    Geiges, Goetz
    Koenig, Frank
    Luedecke, Gerson
    JOURNAL OF CLINICAL ONCOLOGY, 2013, 31 (15)
  • [37] Radiomics in prostate cancer imaging for a personalized treatment approach - current aspects of methodology and a systematic review on validated studies
    Spohn, Simon K. B.
    Bettermann, Alisa S.
    Bamberg, Fabian
    Benndorf, Matthias
    Mix, Michael
    Nicolay, Nils H.
    Fechter, Tobias
    Hoelscher, Tobias
    Grosu, Radu
    Chiti, Arturo
    Grosu, Anca L.
    Zamboglou, Constantinos
    THERANOSTICS, 2021, 11 (16): : 8027 - 8042
  • [38] Security challenges for routing protocols in mobile ad hoc network: a systematic review and open research issues
    Jose, Mitha Rachel
    Singh, J. Amar Pratap
    INTERNATIONAL JOURNAL OF ELECTRONIC SECURITY AND DIGITAL FORENSICS, 2021, 13 (03) : 268 - 297
  • [39] Correction to: Challenges of server consolidation in virtualized data centers and open research issues: a systematic literature review
    Reza Mohamadi Bahram Abadi
    Amir Masoud Rahmani
    Sasan Hossein Alizadeh
    The Journal of Supercomputing, 2020, 76 : 2928 - 2928
  • [40] Uplink non-orthogonal multiple access in heterogeneous networks: A review of recent advances and open research challenges
    Rehman, Bilal Ur
    Babar, Mohammad Inayatullah
    Ahmad, Arbab Waheed
    Amir, Mohammad
    Habib, Wasim
    Farooq, Muhammad
    Azim, Gamil Abdel
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2022, 18 (10)