Machine-learning diagnostics of breast cancer using piRNA biomarkers

被引:0
|
作者
Zhao, Amy R. [1 ]
Kouznetsova, Valentina L. [2 ,3 ,4 ]
Kesari, Santosh [5 ]
Tsigelny, Igor F. [2 ,3 ,4 ,6 ]
机构
[1] CureSci Inst, Scholars Program, San Diego, CA USA
[2] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA USA
[3] BIAna Inst, San Diego, CA USA
[4] CureScience Inst, San Diego, CA USA
[5] Pacific Neurosci Inst, Dept Neurooncol, Santa Monica, CA USA
[6] Univ Calif San Diego, Dept Neurosci, La Jolla, CA USA
关键词
Biomarkers; breast cancer; blood-based piRNAs; circulating piRNAs; machine learning; PIWI-INTERACTING RNA; BIOGENESIS; EXPRESSION; HALLMARKS; PROTEINS; ELEMENTS;
D O I
10.1080/1354750X.2025.2461067
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background and objectivesPrior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.MethodsWe created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.ResultsThe top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.ConclusionsOur study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] BC-Net: Early Diagnostics of Breast Cancer Using Nested Ensemble Technique of Machine Learning
    Sourabh Kuljeet Singh
    Sachin Shastri
    Vibhakar Kumar
    Automatic Control and Computer Sciences, 2023, 57 : 646 - 659
  • [32] BC-Net: Early Diagnostics of Breast Cancer Using Nested Ensemble Technique of Machine Learning
    Singh, Kuljeet
    Shastri, Sourabh
    Kumar, Sachin
    Mansotra, Vibhakar
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2023, 57 (06) : 646 - 659
  • [33] Machine learning using convolutional neural networks for SERS analysis of biomarkers in medical diagnostics
    Li, Joy Qiaoyi
    Dukes, Priya Vohra
    Lee, Walter
    Sarkis, Michael
    Vo-Dinh, Tuan
    JOURNAL OF RAMAN SPECTROSCOPY, 2022, 53 (12) : 2044 - 2057
  • [34] Staging of colorectal cancer using lipid biomarkers and machine learning
    Sanduru Thamarai Krishnan
    David Winkler
    Darren Creek
    Dovile Anderson
    Chandra Kirana
    Guy J Maddern
    Kevin Fenix
    Ehud Hauben
    David Rudd
    Nicolas Hans Voelcker
    Metabolomics, 19
  • [35] Identification of thyroid cancer biomarkers using WGCNA and machine learning
    Hu, Gaofeng
    Niu, Wenyuan
    Ge, Jiaming
    Xuan, Jie
    Liu, Yanyang
    Li, Mengjia
    Shen, Huize
    Ma, Shang
    Li, Yuanqiang
    Li, Qinglin
    EUROPEAN JOURNAL OF MEDICAL RESEARCH, 2025, 30 (01)
  • [36] Staging of colorectal cancer using lipid biomarkers and machine learning
    Krishnan, Sanduru Thamarai
    Winkler, David
    Creek, Darren
    Anderson, Dovile
    Kirana, Chandra
    Maddern, Guy J.
    Fenix, Kevin
    Hauben, Ehud
    Rudd, David
    Voelcker, Nicolas Hans
    METABOLOMICS, 2023, 19 (10)
  • [37] Discovery pipeline of cancer transcriptomic biomarkers using machine learning
    Kim, Eileen
    CANCER RESEARCH, 2019, 79 (13)
  • [38] Predicting clinical response to everolimus in ER plus breast cancers using machine-learning
    Nath, Aritro
    Cosgrove, Patrick A.
    Chang, Jeffrey T.
    Bild, Andrea H.
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [39] Groundwater Prediction Using Machine-Learning Tools
    Hussein, Eslam A.
    Thron, Christopher
    Ghaziasgar, Mehrdad
    Bagula, Antoine
    Vaccari, Mattia
    ALGORITHMS, 2020, 13 (11)
  • [40] New Directions in Diagnostics for Aortic Aneurysms: Biomarkers and Machine Learning
    Alexander, Kyle C.
    Ikonomidis, John S.
    Akerman, Adam W.
    JOURNAL OF CLINICAL MEDICINE, 2024, 13 (03)