Addressing challenges in radiomics research: systematic review and repository of open-access cancer imaging datasets

被引:8
|
作者
Woznicki, Piotr [1 ]
Laqua, Fabian Christopher [1 ]
Al-Haj, Adam [2 ]
Bley, Thorsten [1 ]
Baessler, Bettina [1 ]
机构
[1] Univ Hosp Wurzburg, Dept Diagnost & Intervent Radiol, Wurzburg, Germany
[2] Med Univ Warsaw, Fac Med, Warsaw, Poland
关键词
Radiomics; Radiology; Cancer imaging; Machine learning; Reproducibility of results; BIOMARKERS; TEXTURE; MODEL; HEAD;
D O I
10.1186/s13244-023-01556-w
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
ObjectivesOpen-access cancer imaging datasets have become integral for evaluating novel AI approaches in radiology. However, their use in quantitative analysis with radiomics features presents unique challenges, such as incomplete documentation, low visibility, non-uniform data formats, data inhomogeneity, and complex preprocessing. These issues may cause problems with reproducibility and standardization in radiomics studies.MethodsWe systematically reviewed imaging datasets with public copyright licenses, published up to March 2023 across four large online cancer imaging archives. We included only datasets with tomographic images (CT, MRI, or PET), segmentations, and clinical annotations, specifically identifying those suitable for radiomics research. Reproducible preprocessing and feature extraction were performed for each dataset to enable their easy reuse.ResultsWe discovered 29 datasets with corresponding segmentations and labels in the form of health outcomes, tumor pathology, staging, imaging-based scores, genetic markers, or repeated imaging. We compiled a repository encompassing 10,354 patients and 49,515 scans. Of the 29 datasets, 15 were licensed under Creative Commons licenses, allowing both non-commercial and commercial usage and redistribution, while others featured custom or restricted licenses. Studies spanned from the early 1990s to 2021, with the majority concluding after 2013. Seven different formats were used for the imaging data. Preprocessing and feature extraction were successfully performed for each dataset.ConclusionRadiomicsHub is a comprehensive public repository with radiomics features derived from a systematic review of public cancer imaging datasets. By converting all datasets to a standardized format and ensuring reproducible and traceable processing, RadiomicsHub addresses key reproducibility and standardization challenges in radiomics.Critical relevance statementThis study critically addresses the challenges associated with locating, preprocessing, and extracting quantitative features from open-access datasets, to facilitate more robust and reliable evaluations of radiomics models.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction. Key points- Through a systematic review, we identified 29 cancer imaging datasets suitable for radiomics research.- A public repository with collection overview and radiomics features, encompassing 10,354 patients and 49,515 scans, was compiled.- Most datasets can be shared, used, and built upon freely under a Creative Commons license.- All 29 identified datasets have been converted into a common format to enable reproducible radiomics feature extraction.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A Systematic Literature Review of Open Government Data Research: Challenges, Opportunities and Gaps
    Hassan, Manal Ibrahim Ali
    Twinomurinzi, Hossana
    2018 OPEN INNOVATIONS CONFERENCE (OI), 2018, : 299 - 304
  • [22] Image steganalysis using deep learning: a systematic review and open research challenges
    Farooq N.
    Selwal A.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (06) : 7761 - 7793
  • [23] Sarcasm identification in textual data: systematic review, research challenges and open directions
    Christopher Ifeanyi Eke
    Azah Anir Norman
    Henry Friday Liyana Shuib
    Artificial Intelligence Review, 2020, 53 : 4215 - 4258
  • [24] A Review on AI-Driven Aerial Access Networks: Challenges and Open Research Issues
    Lakew, Demeke Shumeye
    Tran, Anh-Tien
    Masood, Arooj
    Dao, Nhu-Ngoc
    Cho, Sungrae
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 718 - 723
  • [25] Application of magnetic resonance imaging radiomics in endometrial cancer: a systematic review and meta-analysis
    Huang, Meng-Lin
    Ren, Jing
    Jin, Zheng-Yu
    Liu, Xin-Yu
    Li, Yuan
    He, Yong-Lan
    Xue, Hua-Dan
    RADIOLOGIA MEDICA, 2024, 129 (03): : 439 - 456
  • [26] Magnetic resonance imaging-radiomics in endometrial cancer: a systematic review and meta-analysis
    Di Donato, Violante
    Kontopantelis, Evangelos
    Cuccu, Ilaria
    Sgamba, Ludovica
    Golia D'Auge, Tullio
    Pernazza, Angelina
    Della Rocca, Carlo
    Manganaro, Lucia
    Catalano, Carlo
    Perniola, Giorgia
    Palaia, Innocenza
    Tomao, Federica
    Giannini, Andrea
    Muzii, Ludovico
    Bogani, Giorgio
    INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2023, 33 (07) : 1070 - 1076
  • [27] Application of magnetic resonance imaging radiomics in endometrial cancer: a systematic review and meta-analysis
    Meng-Lin Huang
    Jing Ren
    Zheng-Yu Jin
    Xin-Yu Liu
    Yuan Li
    Yong-Lan He
    Hua-Dan Xue
    La radiologia medica, 2024, 129 : 439 - 456
  • [28] A Systematic Review on Smartphone Skin Cancer Apps: Coherent Taxonomy, Motivations, Open Challenges and Recommendations, and New Research Direction
    Yas, Qahtan M.
    Zaidan, A. A.
    Zaidan, B. B.
    Hashim, M.
    Lim, C. K.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2018, 27 (05)
  • [29] Systematic review and meta-analysis on predictors of prognosis in patients with schizophrenia spectrum disorders: An overview of current evidence and a call for prospective research and open access to datasets
    van Dee, Violet
    Schnack, Hugo G.
    Cahn, Wiepke
    SCHIZOPHRENIA RESEARCH, 2023, 254 : 133 - 142
  • [30] Preoperative magnetic resonance imaging-radiomics in cervical cancer: a systematic review and meta-analysis
    Wu, Linyong
    Li, Songhua
    Li, Shaofeng
    Lin, Yan
    Wei, Dayou
    FRONTIERS IN ONCOLOGY, 2024, 14