Machine-learning diagnostics of breast cancer using piRNA biomarkers

被引:0
|
作者
Zhao, Amy R. [1 ]
Kouznetsova, Valentina L. [2 ,3 ,4 ]
Kesari, Santosh [5 ]
Tsigelny, Igor F. [2 ,3 ,4 ,6 ]
机构
[1] CureSci Inst, Scholars Program, San Diego, CA USA
[2] Univ Calif San Diego, San Diego Supercomp Ctr, La Jolla, CA USA
[3] BIAna Inst, San Diego, CA USA
[4] CureScience Inst, San Diego, CA USA
[5] Pacific Neurosci Inst, Dept Neurooncol, Santa Monica, CA USA
[6] Univ Calif San Diego, Dept Neurosci, La Jolla, CA USA
关键词
Biomarkers; breast cancer; blood-based piRNAs; circulating piRNAs; machine learning; PIWI-INTERACTING RNA; BIOGENESIS; EXPRESSION; HALLMARKS; PROTEINS; ELEMENTS;
D O I
10.1080/1354750X.2025.2461067
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background and objectivesPrior studies have shown that small non-coding RNAs (sncRNAs) are associated with cancer occurrence or development. Recently, a newly discovered class of small ncRNAs known as PIWI-interacting RNAs (piRNAs) have been found to play a vital role in physiological processes and cancer initiation. This study aims to utilize piRNAs as innovative, noninvasive diagnostic biomarkers for breast cancer. Our objective is to develop computational methods that leverage piRNA attributes for breast cancer prediction and its application in diagnostics.MethodsWe created a set of piRNA sequence descriptors using information extracted from the piRNA sequences. To ensure accuracy, we found a path to convert non-standard piRNA names to standard ones to enable precise identification of these sequences. Using these descriptors, we applied machine-learning (ML) techniques in WEKA (Waikato Environment for Knowledge Analysis) to a dataset of piRNA to assess the predictive accuracy of the following classifiers: Logistic Regression model, Sequential Minimal Optimization (SMO), Random Forest classifier, and Logistic Model Tree (LMT). Furthermore, we performed Shapley additive explanations (SHAP) Analysis to understand which descriptors were the most relevant to the prediction accuracy. The ML models were then validated on an independent dataset to evaluate their effectiveness in predicting breast cancer.ResultsThe top three performing classifiers in WEKA were Logistic Regression, SMO, and LMT. The Logistic Regression model achieved an accuracy of 90.7% in predicting breast cancer, while SMO and LMT attained 89.7% and 85.65%, respectively.ConclusionsOur study demonstrates the effectiveness of using ML-based piRNA classifiers in diagnosing breast cancer and contributes to the growing body of evidence supporting piRNAs as biomarkers in cancer diagnosis. However, additional research is needed to validate these findings and further assess the clinical applicability of this approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Ship performance monitoring using machine-learning
    Gupta, Prateek
    Rasheed, Adil
    Steen, Sverre
    OCEAN ENGINEERING, 2022, 254
  • [42] IMPROVED LUNG CANCER DIAGNOSTIC ACCURACY USING A MACHINE-LEARNING NODULE CLASSIFIER
    Finigan, James
    Brevard, Ryan
    Brevard, Mathew
    Calhoun, Michael
    CHEST, 2024, 166 (04) : 4417A - 4417A
  • [43] DVFS Binning Using Machine-Learning Techniques
    Chang, Keng-Wei
    Huang, Chun-Yang
    Mu, Szu-Pang
    Huang, Jian-Min
    Chen, Shi-Hao
    Chao, Mango C-T
    2018 IEEE INTERNATIONAL TEST CONFERENCE IN ASIA (ITC-ASIA 2018), 2018, : 31 - 36
  • [44] Radiomic machine-learning classifiers for prognostic biomarkers of advanced nasopharyngeal carcinoma
    Zhang, Bin
    He, Xin
    Ouyang, Fusheng
    Gu, Dongsheng
    Dong, Yuhao
    Zhang, Lu
    Mo, Xiaokai
    Huang, Wenhui
    Tian, Jie
    Zhang, Shuixing
    CANCER LETTERS, 2017, 403 : 21 - 27
  • [45] Adapting convergent scheduling using machine-learning
    Puppin, D
    Stephenson, M
    Amarasinghe, S
    Martin, M
    O'Reilly, UM
    LANGUAGES AND COMPILERS FOR PARALLEL COMPUTING, 2004, 2958 : 17 - 31
  • [46] Machine Learning Model for Identifying Gene Biomarkers for Breast Cancer Treatment Survival
    Abou Tabl, Ashraf
    Alkhateeb, Abed
    ElMaraghy, Waguih
    Ngom, Alioune
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 607 - 607
  • [47] Machine Learning Model for Multiomics Biomarkers Identification for Menopause Status in Breast Cancer
    Alghanim, Firas
    Al-Hurani, Ibrahim
    Qattous, Hazem
    Al-Refai, Abdullah
    Batiha, Osamah
    Alkhateeb, Abedalrhman
    Ikki, Salama
    ALGORITHMS, 2024, 17 (01)
  • [48] Breast Cancer Risk Analysis using Machine Learning
    Adane, D. S.
    Kabra, Laxmikant
    Banode, Akansha
    Agrawal, Mansi
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (05): : 723 - 731
  • [49] Morphological Subtyping of Breast Cancer using Machine Learning
    Hanna, M.
    Lee, M.
    Bozkurt, A.
    Hamilton, P.
    Godrich, R.
    Casson, A.
    Raciti, P.
    Sue, J.
    Viret, J.
    Lee, D.
    Grady, L.
    Rothrock, B.
    Dogdas, B.
    Fuchs, T.
    Reis-Filho, J.
    Kanan, C.
    JOURNAL OF PATHOLOGY, 2021, 255 : S35 - S35
  • [50] A Machine Learning Approach for Identifying Gene Biomarkers Guiding the Treatment of Breast Cancer
    Abou Tabl, Ashraf
    Alkhateeb, Abedalrhman
    ElMaraghy, Waguih
    Rueda, Luis
    Ngom, Alioune
    FRONTIERS IN GENETICS, 2019, 10