Machine-learning diagnostics of breast cancer using piRNA biomarkers

被引:0
|
作者
Zhao, Amy R. [1 ]
Kouznetsova, Valentina L. [2 ,3 ,4 ]
Kesari, Santosh [5 ]
Tsigelny, Igor F. [2 ,3 ,4 ,6 ]
机构
[1] CureSci Inst, Scholars Program, San Diego, CA USA
[2] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA USA
[3] BIAna Inst, San Diego, CA USA
[4] CureScience Inst, San Diego, CA USA
[5] Pacific Neurosci Inst, Dept Neurooncol, Santa Monica, CA USA
[6] Univ Calif San Diego, Dept Neurosci, La Jolla, CA USA
关键词
Biomarkers; breast cancer; blood-based piRNAs; circulating piRNAs; machine learning; PIWI-INTERACTING RNA; BIOGENESIS; EXPRESSION; HALLMARKS; PROTEINS; ELEMENTS;
D O I
10.1080/1354750X.2025.2461067
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background and objectivesPrior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.MethodsWe created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.ResultsThe top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.ConclusionsOur study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] piRNA in Machine-Learning-Based Diagnostics of Colorectal Cancer
    Li, Sienna
    Kouznetsova, Valentina L.
    Kesari, Santosh
    Tsigelny, Igor F.
    MOLECULES, 2024, 29 (18):
  • [2] Studying Combined Breast Cancer biomarkers using Machine Learning techniques
    Saleh, Dina T.
    Attia, Amir
    Shaker, Olfat
    2016 IEEE 14TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI), 2016, : 247 - 251
  • [3] Radiomic Machine-Learning Classifiers for Prognostic Biomarkers of Head and Neck Cancer
    Parmar, Chintan
    Grossmann, Patrick
    Rietveld, Derek
    Rietbergen, Michelle M.
    Lambin, Philippe
    Aerts, Hugo J. W. L.
    FRONTIERS IN ONCOLOGY, 2015, 5
  • [4] Decoding the Role of Epigenetics in Breast Cancer Using Formal Modeling and Machine-Learning Methods
    Asim, Ayesha
    Kiani, Yusra Sajid
    Saeed, Muhammad Tariq
    Jabeen, Ishrat
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [5] Evaluation of biomarkers of exposure and effects of mercury using machine-learning methods
    Gibicar, Darija
    Kocev, Dragi
    Zenko, Bernard
    Horvat, Milena
    Dzeroski, Saso
    Fajon, Vesna
    Mazzola, Barbara
    TOXICOLOGY LETTERS, 2006, 164 : S13 - S14
  • [6] Ranking Breast Cancer Drugs and Biomarkers Identification Using Machine Learning and Pharmacogenomics
    Mehmood, Aamir
    Nawab, Sadia
    Jin, Yifan
    Hassan, Hesham
    Kaushik, Aman Chandra
    Wei, Dong-Qing
    ACS PHARMACOLOGY & TRANSLATIONAL SCIENCE, 2023, : 399 - 409
  • [7] A study on application of machine-learning on DBI soot diagnostics
    Liu, Dan
    Xuan, Tiemin
    He, Zhixia
    Yao, Mingfa
    Payri, Raul
    FUEL, 2023, 346
  • [8] Alzheimer's Disease Diagnostics Using miRNA Biomarkers and Machine Learning
    Xu, Amy
    Kouznetsova, Valentina L.
    Tsigelny, Igor F.
    JOURNAL OF ALZHEIMERS DISEASE, 2022, 86 (02) : 841 - 859
  • [9] Diagnostics of Thyroid Cancer Using Machine Learning and Metabolomics
    Kuang, Alyssa
    Kouznetsova, Valentina L.
    Kesari, Santosh
    Tsigelny, Igor F.
    METABOLITES, 2024, 14 (01)
  • [10] Validation of miRNAs as Breast Cancer Biomarkers with a Machine Learning Approach
    Rehman, Oneeb
    Zhuang, Hanqi
    Ali, Ali Muhamed
    Ibrahim, Ali
    Li, Zhongwei
    CANCERS, 2019, 11 (03):