Machine-learning diagnostics of breast cancer using piRNA biomarkers

被引:0
|
作者
Zhao, Amy R. [1 ]
Kouznetsova, Valentina L. [2 ,3 ,4 ]
Kesari, Santosh [5 ]
Tsigelny, Igor F. [2 ,3 ,4 ,6 ]
机构
[1] CureSci Inst, Scholars Program, San Diego, CA USA
[2] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA USA
[3] BIAna Inst, San Diego, CA USA
[4] CureScience Inst, San Diego, CA USA
[5] Pacific Neurosci Inst, Dept Neurooncol, Santa Monica, CA USA
[6] Univ Calif San Diego, Dept Neurosci, La Jolla, CA USA
关键词
Biomarkers; breast cancer; blood-based piRNAs; circulating piRNAs; machine learning; PIWI-INTERACTING RNA; BIOGENESIS; EXPRESSION; HALLMARKS; PROTEINS; ELEMENTS;
D O I
10.1080/1354750X.2025.2461067
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background and objectivesPrior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.MethodsWe created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.ResultsThe top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.ConclusionsOur study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Machine-learning to stratify diabetic patients using novel cardiac biomarkers and integrative genomics
    Quincy A. Hathaway
    Skyler M. Roth
    Mark V. Pinti
    Daniel C. Sprando
    Amina Kunovac
    Andrya J. Durr
    Chris C. Cook
    Garrett K. Fink
    Tristen B. Cheuvront
    Jasmine H. Grossman
    Ghadah A. Aljahli
    Andrew D. Taylor
    Andrew P. Giromini
    Jessica L. Allen
    John M. Hollander
    Cardiovascular Diabetology, 18
  • [22] Predicting breast cancer metastasis by using serum biomarkers and clinicopathological data with machine learning technologies
    Tseng, Yi-Ju
    Huang, Chuan-En
    Wen, Chiao-Ni
    Lai, Po-Yin
    Wu, Min-Hsien
    Sun, Yu-Chen
    Wang, Hsin-Yao
    Lu, Jang-Jih
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 128 : 79 - 86
  • [23] Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta
    Hamidi, Farzaneh
    Gilani, Neda
    Arabi Belaghi, Reza
    Yaghoobi, Hanif
    Babaei, Esmaeil
    Sarbakhsh, Parvin
    Malakouti, Jamileh
    FRONTIERS IN DIGITAL HEALTH, 2023, 5
  • [24] Classifying Breast Cancer Using machine learning
    不详
    CURRENT SCIENCE, 2020, 119 (05): : 734 - 735
  • [25] Breast Cancer Detection Using Machine Learning
    Sivasangari, A.
    Ajitha, P.
    Bevishjenila
    Vimali, J. S.
    Jose, Jithina
    Gowri, S.
    MOBILE COMPUTING AND SUSTAINABLE INFORMATICS, 2022, 68 : 693 - 702
  • [26] Preventing dataset shift from breaking machine-learning biomarkers
    Dockes, Jerome
    Varoquaux, Gael
    Poline, Jean-Baptiste
    GIGASCIENCE, 2021, 10 (09):
  • [27] Breast Cancer Identification Using Machine Learning
    Jia, Xiao
    Sun, Xiaolin
    Zhang, Xingang
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022
  • [28] Exploration of biomarkers for systemic lupus erythematosus by machine-learning analysis
    Zhao, Xingyun
    Duan, Lishuang
    Cui, Dawei
    Xie, Jue
    BMC IMMUNOLOGY, 2023, 24 (01)
  • [29] Exploration of biomarkers for systemic lupus erythematosus by machine-learning analysis
    Xingyun Zhao
    Lishuang Duan
    Dawei Cui
    Jue Xie
    BMC Immunology, 24
  • [30] Acute toxicity prediction after breast radiotherapy using machine-learning and spectrophotometry
    Cilla, S.
    Romano, C.
    Macchia, G.
    Boccardi, M.
    Pezzulla, D.
    Buwenge, M.
    Di Castelnuovo, A.
    Bracone, F.
    De Curtis, A.
    Cerletti, C.
    Iacoviello, L.
    Donati, M. B.
    Deodato, F.
    Morganti, A. G.
    RADIOTHERAPY AND ONCOLOGY, 2023, 182 : S1880 - S1881