Identification of Common Gene Signatures in Microarray and RNA-Sequencing Data Using Network-Based Regularization

被引:0
|
作者
Diegues, Ines [1 ]
Vinga, Susana [1 ]
Lopes, Marta B. [2 ,3 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, R Alves Redol 9, P-1000029 Lisbon, Portugal
[2] UNL, FCT, NOVA Lab Comp Sci & Informat NOVA LINCS, P-2829516 Caparica, Portugal
[3] UNL, FCT, CMA, P-2829516 Caparica, Portugal
关键词
Microarray; RNA-sequencing; Machine learning; Biomarkers; Network-based regularization; EXPRESSION OMNIBUS; CANCER; ASSOCIATION; SELECTION;
D O I
10.1007/978-3-030-45385-5_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray and RNA-sequencing (RNA-seq) gene expression data alongside machine learning algorithms are promising in the discovery of new cancer biomarkers. However, even though they are similar in purpose, there are some fundamental differences between the two techniques. We propose a methodology for cross-platform integration, and biomarker discovery based on network-based regularization via the Twin Networks Recovery (twiner) penalty, as a strategy to enhance the selection of breast cancer gene signatures that have similar correlation patterns in both platforms. In a classification setting based on sparse logistic regression (LR) taking as classes tumor from both RNA-seq and microarray, and normal tissue samples, twiner achieved precision-recall accuracies of 99.71% and 99.57% in the training and test set, respectively. Moreover, the survival analysis results validated the biological relevance of the signatures identified by twiner. Therefore, by leveraging from the existing amount of data for microarray and RNA-seq, a single biological conclusion can be reached, independent of each technology.
引用
收藏
页码:15 / 26
页数:12
相关论文
共 50 条
  • [31] The clinicopathological significance of UBE2C in breast cancer: a study based on immunohistochemistry, microarray and RNA-sequencing data
    Chao-hua Mo
    Li Gao
    Xiao-fei Zhu
    Kang-lai Wei
    Jing-jing Zeng
    Gang Chen
    Zhen-bo Feng
    Cancer Cell International, 17
  • [32] The clinicopathological significance of UBE2C in breast cancer: a study based on immunohistochemistry, microarray and RNA-sequencing data
    Mo, Chao-hua
    Gao, Li
    Zhu, Xiao-fei
    Wei, Kang-lai
    Zeng, Jing-jing
    Chen, Gang
    Feng, Zhen-bo
    CANCER CELL INTERNATIONAL, 2017, 17
  • [33] Protocol for analysis of RNA-sequencing and proteome profiling data for subgroup identification and comparison
    Yang, Kevin C.
    Gorski, Sharon M.
    STAR PROTOCOLS, 2022, 3 (02):
  • [34] Graph attention network for link prediction of gene regulations from single-cell RNA-sequencing data
    Chen, Guangyi
    Liu, Zhi-Ping
    BIOINFORMATICS, 2022, 38 (19) : 4522 - 4529
  • [35] Identification of Key Genes and Potential New Biomarkers for Ovarian Aging: A Study Based on RNA-Sequencing Data
    Ma, Lingwei
    Lu, Huan
    Chen, Runhua
    Wu, Meng
    Jin, Yan
    Zhang, Jinjin
    Wang, Shixuan
    FRONTIERS IN GENETICS, 2020, 11
  • [36] A comparison of survival analysis methods for cancer gene expression RNA-Sequencing data
    Raman, Pichai
    Zimmerman, Samuel
    Rathi, Komal S.
    de Torrente, Laurence
    Sarmady, Mahdi
    Wu, Chao
    Leipzig, Jeremy
    Taylor, Deanne M.
    Tozeren, Aydin
    Mar, Jessica C.
    CANCER GENETICS, 2019, 235 : 1 - 12
  • [37] Defining pervasive transcription units using chromatin RNA-sequencing data
    Guo, Ziwei
    Liu, Xinhong
    Chen, Mo
    STAR PROTOCOLS, 2022, 3 (02):
  • [38] Phylogenomic analysis of Fundulidae (Teleostei: Cyprinodotiformes) using RNA-sequencing data
    Rodgers, Rachel
    Roach, Jennifer L.
    Reid, Noah M.
    Whitehead, Andrew
    Duvernell, David D.
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2018, 121 : 150 - 157
  • [39] Normalization of gene counts affects principal components-based exploratory analysis of RNA-sequencing data
    van Lingen, Henk J.
    Suarez-Diez, Maria
    Saccenti, Edoardo
    BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS, 2024, 1867 (04):
  • [40] Network-based inference of cancer progression from microarray data
    Park, Yongjin
    Shackney, Stanley
    Schwartz, Russell
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2008, 4983 : 268 - +