Addressing challenges in radiomics research: systematic review and repository of open-access cancer imaging datasets

被引:8
|
作者
Woznicki, Piotr [1 ]
Laqua, Fabian Christopher [1 ]
Al-Haj, Adam [2 ]
Bley, Thorsten [1 ]
Baessler, Bettina [1 ]
机构
[1] Univ Hosp Wurzburg, Dept Diagnost & Intervent Radiol, Wurzburg, Germany
[2] Med Univ Warsaw, Fac Med, Warsaw, Poland
关键词
Radiomics; Radiology; Cancer imaging; Machine learning; Reproducibility of results; BIOMARKERS; TEXTURE; MODEL; HEAD;
D O I
10.1186/s13244-023-01556-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectivesOpen-access cancer imaging datasets have become integral for evaluating novel AI approaches in radiology. However, their use in quantitative analysis with radiomics features presents unique challenges, such as incomplete documentation, low visibility, non-uniform data formats, data inhomogeneity, and complex preprocessing. These issues may cause problems with reproducibility and standardization in radiomics studies.MethodsWe systematically reviewed imaging datasets with public copyright licenses, published up to March 2023 across four large online cancer imaging archives. We included only datasets with tomographic images (CT, MRI, or PET), segmentations, and clinical annotations, specifically identifying those suitable for radiomics research. Reproducible preprocessing and feature extraction were performed for each dataset to enable their easy reuse.ResultsWe discovered 29 datasets with corresponding segmentations and labels in the form of health outcomes, tumor pathology, staging, imaging-based scores, genetic markers, or repeated imaging. We compiled a repository encompassing 10,354 patients and 49,515 scans. Of the 29 datasets, 15 were licensed under Creative Commons licenses, allowing both non-commercial and commercial usage and redistribution, while others featured custom or restricted licenses. Studies spanned from the early 1990s to 2021, with the majority concluding after 2013. Seven different formats were used for the imaging data. Preprocessing and feature extraction were successfully performed for each dataset.ConclusionRadiomicsHub is a comprehensive public repository with radiomics features derived from a systematic review of public cancer imaging datasets. By converting all datasets to a standardized format and ensuring reproducible and traceable processing, RadiomicsHub addresses key reproducibility and standardization challenges in radiomics.Critical relevance statementThis study critically addresses the challenges associated with locating, preprocessing, and extracting quantitative features from open-access datasets, to facilitate more robust and reliable evaluations of radiomics models.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction. Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] AI-Powered Diagnosis of Skin Cancer: A Contemporary Review, Open Challenges and Future Research Directions
    Melarkode, Navneet
    Srinivasan, Kathiravan
    Qaisar, Saeed Mian
    Plawiak, Pawel
    CANCERS, 2023, 15 (04)
  • [42] Addressing political, economic, administrative, regulatory, logistical, ethical, and social challenges to clinical research responses to emerging epidemics and pandemics: a systematic review
    Sigfrid, Louise
    Bannister, Peter G.
    Maskell, Katherine
    Regmi, Sadie
    Collinson, Shelui
    Blackmore, Claire
    Ismail, Sharif A.
    Harriss, Eli
    Longuere, Kajsa-Stina
    Gobat, Nina
    Clarke, Mike
    Carson, Gail
    LANCET, 2019, 394 : 86 - 86
  • [43] A Systematic Review of Reliability Studies on Composite Power Systems: A Coherent Taxonomy Motivations, Open Challenges, Recommendations, and New Research Directions
    Abunima, Hamza
    Teh, Jiashen
    Lai, Ching-Ming
    Jabir, Hussein Jumma
    ENERGIES, 2018, 11 (09)
  • [44] Risk-adapted screening in bladder cancer using the open-access Internet-based questionnaire RiskCheck bladder cancer: First evaluation by the health services research foundation IQUO, Germany
    Luedecke, Gerson
    Geiges, Goetz
    JOURNAL OF CLINICAL ONCOLOGY, 2012, 30 (05)
  • [46] RETRACTED: The Rise of Cloud Computing: Data Protection, Privacy, and Open Research Challenges-A Systematic Literature Review (SLR) (Retracted Article)
    Hassan, Junaid
    Shehzad, Danish
    Habib, Usman
    Aftab, Muhammad Umar
    Ahmad, Muhammad
    Kuleev, Ramil
    Mazzara, Manuel
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [47] RETRACTED: Interoperability Requirements for Blockchain-Enabled Electronic Health Records in Healthcare: A Systematic Review and Open Research Challenges (Retracted Article)
    Reegu, Faheem Ahmad
    Abas, Hafiza
    Jabbari, Abdoh
    Akmam, Rudzidatul
    Uddin, Mueen
    Wu, Chih-Ming
    Chen, Chin-Ling
    Khalaf, Osamah Ibrahim
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [48] Systematic review of research design and reporting of imaging studies applying convolutional neural networks for radiological cancer diagnosis
    O'Shea, Robert J.
    Sharkey, Amy Rose
    Cook, Gary J. R.
    Goh, Vicky
    EUROPEAN RADIOLOGY, 2021, 31 (10) : 7969 - 7983
  • [49] Systematic review of research design and reporting of imaging studies applying convolutional neural networks for radiological cancer diagnosis
    Robert J. O’Shea
    Amy Rose Sharkey
    Gary J. R. Cook
    Vicky Goh
    European Radiology, 2021, 31 : 7969 - 7983
  • [50] A systematic review reporting quality of radiomics research in neuro-oncology: toward clinical utility and quality improvement using high-dimensional imaging features
    Ji Eun Park
    Ho Sung Kim
    Donghyun Kim
    Seo Young Park
    Jung Youn Kim
    Se Jin Cho
    Jeong Hoon Kim
    BMC Cancer, 20