Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta

被引:7
|
作者
Hamidi, Farzaneh [1 ]
Gilani, Neda [1 ,2 ]
Arabi Belaghi, Reza [3 ,4 ,5 ]
Yaghoobi, Hanif [6 ]
Babaei, Esmaeil [6 ,7 ]
Sarbakhsh, Parvin [1 ]
Malakouti, Jamileh [8 ]
机构
[1] Tabriz Univ Med Sci, Fac Hlth, Dept Stat & Epidemiol, Tabriz, Iran
[2] Tabriz Univ Med Sci, Rd Traff Injury Res Ctr, Tabriz, Iran
[3] Uppsala Univ, Dept Math Appl Math & Stat, Uppsala, Sweden
[4] Univ Tabriz, Fac Math Sci, Dept Stat, Tabriz, Iran
[5] Swedish Agr Univ, Dept Energy & Technol, Uppsala, Sweden
[6] Univ Tabriz, Sch Nat Sci, Dept Biol Sci, Tabriz, Iran
[7] Univ Tubingen, Interfac Inst Bioinformat & Med Informat IBMI, Tubingen, Germany
[8] Tabriz Univ Med Sci, Fac Nursing & Midwifery, Dept Midwifery, Tabriz, Iran
来源
关键词
artificial intelligence; Boruta; biomarker; feature selection; Gene Expression Omnibus; ovarian cancer; oncology; MICRORNA SIGNATURES; EXOSOMAL MIR-1290; EXPRESSION; CLASSIFICATION; RESISTANCE; PROGNOSIS; SELECTION; TUMOR; SERUM;
D O I
10.3389/fdgth.2023.1187578
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
IntroductionIn gynecologic oncology, ovarian cancer is a great clinical challenge. Because of the lack of typical symptoms and effective biomarkers for noninvasive screening, most patients develop advanced-stage ovarian cancer by the time of diagnosis. MicroRNAs (miRNAs) are a type of non-coding RNA molecule that has been linked to human cancers. Specifying diagnostic biomarkers to determine non-cancer and cancer samples is difficult. MethodsBy using Boruta, a novel random forest-based feature selection in the machine-learning techniques, we aimed to identify biomarkers associated with ovarian cancer using cancerous and non-cancer samples from the Gene Expression Omnibus (GEO) database: GSE106817. In this study, we used two independent GEO data sets as external validation, including GSE113486 and GSE113740. We utilized five state-of-the-art machine-learning algorithms for classification: logistic regression, random forest, decision trees, artificial neural networks, and XGBoost. ResultsFour models discovered in GSE113486 had an AUC of 100%, three in GSE113740 with AUC of over 94%, and four in GSE113486 with AUC of over 94%. We identified 10 miRNAs to distinguish ovarian cancer cases from normal controls: hsa-miR-1290, hsa-miR-1233-5p, hsa-miR-1914-5p, hsa-miR-1469, hsa-miR-4675, hsa-miR-1228-5p, hsa-miR-3184-5p, hsa-miR-6784-5p, hsa-miR-6800-5p, and hsa-miR-5100. Our findings suggest that miRNAs could be used as possible biomarkers for ovarian cancer screening, for possible intervention.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Identification of Circulating Diagnostic Biomarkers for Coronary Microvascular Disease in Postmenopausal Women Using Machine-Learning Techniques
    Eve, Alicia Arredondo
    Tunc, Elif
    Liu, Yu-Jeh
    Agrawal, Saumya
    Yilmaz, Huriye Erbak
    Emren, Sadik Volkan
    Akcay, Filiz Akyildiz
    Mainzer, Luidmila
    Zurauskien, Justina
    Erdogan, Zeynep Madak
    METABOLITES, 2021, 11 (06)
  • [22] Prediction of Cancer Disease using Machine learning Approach
    Shaikh, F. J.
    Rao, D. S.
    MATERIALS TODAY-PROCEEDINGS, 2022, 50 : 40 - 47
  • [23] Machine learning models for predicting prostate cancer recurrence and identifying potential molecular biomarkers
    Antunes, Maria Eliza
    Araujo, Thaise Goncalves
    Till, Tatiana Martins
    Pantaleao, Eliana
    Mancera, Paulo F. A.
    de Oliveira, Marta Helena
    FRONTIERS IN ONCOLOGY, 2025, 15
  • [24] Validation of miRNA signatures for ovarian cancer earlier detection in the pre-diagnosis setting using machine learning approaches
    Stawiski, Konrad
    Fortner, Renee T.
    Pestarino, Luca
    Umu, Sinan U.
    Kaaks, Rudolf
    Rounge, Trine B.
    Elias, Kevin M.
    Fendler, Wojciech
    Langseth, Hilde
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [25] Time to diagnosis analysis using miRNA-based ovarian cancer prediction models
    Webber, James
    Wollborn, Laura
    Mishra, Sudhanshu
    Alimena, Stephanie
    Testino, Bryanna
    Vitonis, Allison
    Cramer, Daniel
    Chowdhury, Dipanjan
    Elias, Kevin
    CANCER RESEARCH, 2024, 84 (06)
  • [26] A Network-based Machine Learning Approach for Identifying Biomarkers of Breast Cancer Survivability
    Pham, Huy Q.
    Guba, Jurko
    Gawanmeh, Mousa
    Porter, Lisa A.
    Ngom, Alioune
    ACM-BCB'19: PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, 2019, : 639 - 644
  • [27] Identifying Mitotic Kinesins as Potential Prognostic Biomarkers in Ovarian Cancer Using Bioinformatic Analyses
    Liu, Hailun
    Chen, Chen
    Fehm, Tanja
    Cheng, Zhongping
    Neubauer, Hans
    DIAGNOSTICS, 2022, 12 (02)
  • [28] Crime analysis and prediction using machine-learning approach in the case of Hossana Police Commission
    Wubineh, Betelhem Zewdu
    SECURITY JOURNAL, 2024, 37 (04) : 1269 - 1284
  • [29] Scale-Prediction/Inhibition Design Using Machine-Learning Techniques and Probabilistic Approach
    Al-Hajri, Nasser M.
    Al-Ghamdi, Abdullah
    Tariq, Zeeshan
    Mahmoud, Mohamed
    SPE PRODUCTION & OPERATIONS, 2020, 35 (04): : 987 - 1009
  • [30] A machine-learning approach to setting optimal thresholds and its application in rolling bearing fault diagnosis
    Tang, Yao-Chi
    Li, Kuo-Hao
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):