Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta

被引:7
|
作者
Hamidi, Farzaneh [1 ]
Gilani, Neda [1 ,2 ]
Arabi Belaghi, Reza [3 ,4 ,5 ]
Yaghoobi, Hanif [6 ]
Babaei, Esmaeil [6 ,7 ]
Sarbakhsh, Parvin [1 ]
Malakouti, Jamileh [8 ]
机构
[1] Tabriz Univ Med Sci, Fac Hlth, Dept Stat & Epidemiol, Tabriz, Iran
[2] Tabriz Univ Med Sci, Rd Traff Injury Res Ctr, Tabriz, Iran
[3] Uppsala Univ, Dept Math Appl Math & Stat, Uppsala, Sweden
[4] Univ Tabriz, Fac Math Sci, Dept Stat, Tabriz, Iran
[5] Swedish Agr Univ, Dept Energy & Technol, Uppsala, Sweden
[6] Univ Tabriz, Sch Nat Sci, Dept Biol Sci, Tabriz, Iran
[7] Univ Tubingen, Interfac Inst Bioinformat & Med Informat IBMI, Tubingen, Germany
[8] Tabriz Univ Med Sci, Fac Nursing & Midwifery, Dept Midwifery, Tabriz, Iran
来源
关键词
artificial intelligence; Boruta; biomarker; feature selection; Gene Expression Omnibus; ovarian cancer; oncology; MICRORNA SIGNATURES; EXOSOMAL MIR-1290; EXPRESSION; CLASSIFICATION; RESISTANCE; PROGNOSIS; SELECTION; TUMOR; SERUM;
D O I
10.3389/fdgth.2023.1187578
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
IntroductionIn gynecologic oncology, ovarian cancer is a great clinical challenge. Because of the lack of typical symptoms and effective biomarkers for noninvasive screening, most patients develop advanced-stage ovarian cancer by the time of diagnosis. MicroRNAs (miRNAs) are a type of non-coding RNA molecule that has been linked to human cancers. Specifying diagnostic biomarkers to determine non-cancer and cancer samples is difficult. MethodsBy using Boruta, a novel random forest-based feature selection in the machine-learning techniques, we aimed to identify biomarkers associated with ovarian cancer using cancerous and non-cancer samples from the Gene Expression Omnibus (GEO) database: GSE106817. In this study, we used two independent GEO data sets as external validation, including GSE113486 and GSE113740. We utilized five state-of-the-art machine-learning algorithms for classification: logistic regression, random forest, decision trees, artificial neural networks, and XGBoost. ResultsFour models discovered in GSE113486 had an AUC of 100%, three in GSE113740 with AUC of over 94%, and four in GSE113486 with AUC of over 94%. We identified 10 miRNAs to distinguish ovarian cancer cases from normal controls: hsa-miR-1290, hsa-miR-1233-5p, hsa-miR-1914-5p, hsa-miR-1469, hsa-miR-4675, hsa-miR-1228-5p, hsa-miR-3184-5p, hsa-miR-6784-5p, hsa-miR-6800-5p, and hsa-miR-5100. Our findings suggest that miRNAs could be used as possible biomarkers for ovarian cancer screening, for possible intervention.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Explainable machine learning model identified potential biomarkers in liver cancer survival prediction
    Pan, Qi
    Hounye, Alphonse Houssou
    Miao, Kexin
    Su, Liuyan
    Wang, Jiaoju
    Hou, Muzhou
    Xiong, Li
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96
  • [32] Risk Prediction of Pancreatic Cancer in Patients With Recent-onset Hyperglycemia A Machine-learning Approach
    Chen, Wansu
    Butler, Rebecca K.
    Lustigova, Eva
    Chari, Suresh T.
    Maitra, Anirban
    Rinaudo, Jo A.
    Wu, Bechien U.
    JOURNAL OF CLINICAL GASTROENTEROLOGY, 2023, 57 (01) : 103 - 110
  • [33] Identifying Effective Biomarkers for Accurate Pancreatic Cancer Prognosis Using Statistical Machine Learning
    Abu-Khudir, Rasha
    Hafsa, Noor
    Badr, Badr E.
    DIAGNOSTICS, 2023, 13 (19)
  • [34] Proposed approach for breast cancer diagnosis using machine learning
    Saoud, Hajar
    Ghadi, Abderrahim
    Ghailani, Mohamed
    4TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS (SCA' 19), 2019,
  • [35] A survey on multi-omics-based cancer diagnosis using machine learning with the potential application in gastrointestinal cancer
    Wang, Suixue
    Wang, Shuling
    Wang, Zhengxia
    FRONTIERS IN MEDICINE, 2023, 9
  • [36] Machine-learning prediction of cancer survival: a retrospective study using electronic administrative records and a cancer registry
    Gupta, Sunil
    Truyen Tran
    Luo, Wei
    Dinh Phung
    Kennedy, Richard Lee
    Broad, Adam
    Campbell, David
    Kipp, David
    Singh, Madhu
    Khasraw, Mustafa
    Matheson, Leigh
    Ashley, David M.
    Venkatesh, Svetha
    BMJ OPEN, 2014, 4 (03):
  • [37] Identifying potential biomarkers for non-obstructive azoospermia using WGCNA and machine learning algorithms
    Tang, Qizhen
    Su, Quanxin
    Wei, Letian
    Wang, Kenan
    Jiang, Tao
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [38] Using Machine Learning algorithms for breast cancer risk prediction and diagnosis
    Bharat, Anusha
    Pooja, N.
    Reddy, R. Anishka
    2018 3RD INTERNATIONAL CONFERENCE ON CIRCUITS, CONTROL, COMMUNICATION AND COMPUTING (I4C), 2018,
  • [39] Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis
    Asri, Hiba
    Mousannif, Hajar
    Al Moatassime, Hassan
    Noel, Thomas
    7TH INTERNATIONAL CONFERENCE ON AMBIENT SYSTEMS, NETWORKS AND TECHNOLOGIES (ANT 2016) / THE 6TH INTERNATIONAL CONFERENCE ON SUSTAINABLE ENERGY INFORMATION TECHNOLOGY (SEIT-2016) / AFFILIATED WORKSHOPS, 2016, 83 : 1064 - 1069
  • [40] Groundwater fluoride prediction modeling using physicochemical parameters in Punjab, India: a machine-learning approach
    Kerketta, Anjali
    Kapoor, Harmanpreet Singh
    Sahoo, Prafulla Kumar
    FRONTIERS IN SOIL SCIENCE, 2024, 4