Identification of Common Gene Signatures in Microarray and RNA-Sequencing Data Using Network-Based Regularization

被引:0
|
作者
Diegues, Ines [1 ]
Vinga, Susana [1 ]
Lopes, Marta B. [2 ,3 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, R Alves Redol 9, P-1000029 Lisbon, Portugal
[2] UNL, FCT, NOVA Lab Comp Sci & Informat NOVA LINCS, P-2829516 Caparica, Portugal
[3] UNL, FCT, CMA, P-2829516 Caparica, Portugal
关键词
Microarray; RNA-sequencing; Machine learning; Biomarkers; Network-based regularization; EXPRESSION OMNIBUS; CANCER; ASSOCIATION; SELECTION;
D O I
10.1007/978-3-030-45385-5_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray and RNA-sequencing (RNA-seq) gene expression data alongside machine learning algorithms are promising in the discovery of new cancer biomarkers. However, even though they are similar in purpose, there are some fundamental differences between the two techniques. We propose a methodology for cross-platform integration, and biomarker discovery based on network-based regularization via the Twin Networks Recovery (twiner) penalty, as a strategy to enhance the selection of breast cancer gene signatures that have similar correlation patterns in both platforms. In a classification setting based on sparse logistic regression (LR) taking as classes tumor from both RNA-seq and microarray, and normal tissue samples, twiner achieved precision-recall accuracies of 99.71% and 99.57% in the training and test set, respectively. Moreover, the survival analysis results validated the biological relevance of the signatures identified by twiner. Therefore, by leveraging from the existing amount of data for microarray and RNA-seq, a single biological conclusion can be reached, independent of each technology.
引用
收藏
页码:15 / 26
页数:12
相关论文
共 50 条
  • [1] Regularization network-based gene selection for microarray data analysis
    Zhou, Xin
    Mao, K. Z.
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2006, 16 (05) : 341 - 352
  • [2] Identification of common and dissimilar biomarkers for different cancer types from gene expressions of RNA-sequencing data
    Venkataramana, Lokeswari
    Jacob, Shomona Gracia
    Saraswathi, S.
    Prasad, D. Venkata Vara
    GENE REPORTS, 2020, 19
  • [3] Identification of breast cancer-related circRNAs by analysis of microarray and RNA-sequencing data An observational study
    Zhao, Chun-Hua
    Qu, Le
    Zhang, Hui
    Qu, Rui
    MEDICINE, 2019, 98 (46) : e18042
  • [4] Transfer of clinically relevant gene expression signatures in breast cancer: from Affymetrix microarray to Illumina RNA-Sequencing technology
    Debora Fumagalli
    Alexis Blanchet-Cohen
    David Brown
    Christine Desmedt
    David Gacquer
    Stefan Michiels
    Françoise Rothé
    Samira Majjaj
    Roberto Salgado
    Denis Larsimont
    Michail Ignatiadis
    Marion Maetens
    Martine Piccart
    Vincent Detours
    Christos Sotiriou
    Benjamin Haibe-Kains
    BMC Genomics, 15
  • [5] Transfer of clinically relevant gene expression signatures in breast cancer: from Affymetrix microarray to Illumina RNA-Sequencing technology
    Fumagalli, Debora
    Blanchet-Cohen, Alexis
    Brown, David
    Desmedt, Christine
    Gacquer, David
    Michiels, Stefan
    Rothe, Francoise
    Majjaj, Samira
    Salgado, Roberto
    Larsimont, Denis
    Ignatiadis, Michail
    Maetens, Marion
    Piccart, Martine
    Detours, Vincent
    Sotiriou, Christos
    Haibe-Kains, Benjamin
    BMC GENOMICS, 2014, 15
  • [6] Identification of Dysregulated microRNAs in Glioma Using RNA-sequencing
    Chang Liu
    Ying-ying Ge
    Xiao-xun Xie
    Bin Luo
    Ning Shen
    Xing-sheng Liao
    Shui-qing Bi
    Tao Xu
    Shao-wen Xiao
    Qing-mei Zhang
    Current Medical Science, 2021, 41 : 356 - 367
  • [7] Identification of Dysregulated microRNAs in Glioma Using RNA-sequencing
    Liu, Chang
    Ge, Ying-ying
    Xie, Xiao-Xun
    Luo, Bin
    Shen, Ning
    Liao, Xing-Sheng
    Bi, Shui-Qing
    Xu, Tao
    Xiao, Shao-wen
    Zhang, Qing-mei
    CURRENT MEDICAL SCIENCE, 2021, 41 (02) : 356 - 367
  • [8] PASTA: splice junction identification from RNA-Sequencing data
    Tang, Shaojun
    Riva, Alberto
    BMC BIOINFORMATICS, 2013, 14
  • [9] PASTA: splice junction identification from RNA-Sequencing data
    Shaojun Tang
    Alberto Riva
    BMC Bioinformatics, 14
  • [10] Using data mining to discover signatures in network-based intrusion detection
    Han, H
    Lu, XL
    Ren, LY
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 13 - 17