Identification of Common Gene Signatures in Microarray and RNA-Sequencing Data Using Network-Based Regularization

被引:0
|
作者
Diegues, Ines [1 ]
Vinga, Susana [1 ]
Lopes, Marta B. [2 ,3 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, R Alves Redol 9, P-1000029 Lisbon, Portugal
[2] UNL, FCT, NOVA Lab Comp Sci & Informat NOVA LINCS, P-2829516 Caparica, Portugal
[3] UNL, FCT, CMA, P-2829516 Caparica, Portugal
关键词
Microarray; RNA-sequencing; Machine learning; Biomarkers; Network-based regularization; EXPRESSION OMNIBUS; CANCER; ASSOCIATION; SELECTION;
D O I
10.1007/978-3-030-45385-5_2
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray and RNA-sequencing (RNA-seq) gene expression data alongside machine learning algorithms are promising in the discovery of new cancer biomarkers. However, even though they are similar in purpose, there are some fundamental differences between the two techniques. We propose a methodology for cross-platform integration, and biomarker discovery based on network-based regularization via the Twin Networks Recovery (twiner) penalty, as a strategy to enhance the selection of breast cancer gene signatures that have similar correlation patterns in both platforms. In a classification setting based on sparse logistic regression (LR) taking as classes tumor from both RNA-seq and microarray, and normal tissue samples, twiner achieved precision-recall accuracies of 99.71% and 99.57% in the training and test set, respectively. Moreover, the survival analysis results validated the biological relevance of the signatures identified by twiner. Therefore, by leveraging from the existing amount of data for microarray and RNA-seq, a single biological conclusion can be reached, independent of each technology.
引用
收藏
页码:15 / 26
页数:12
相关论文
共 50 条
  • [41] Network-Based Inference of Cancer Progression from Microarray Data
    Park, Yongjin
    Shackney, Stanley
    Schwartz, Russell
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (02) : 200 - 212
  • [42] OMEN: network-based driver gene identification using mutual exclusivity
    Van Daele, Dries
    Weytjens, Bram
    De Raedt, Luc
    Marchal, Kathleen
    BIOINFORMATICS, 2022, 38 (12) : 3245 - 3251
  • [43] Clinical roles of the aberrantly expressed lncRNAs in lung squamous cell carcinoma: a study based on RNA-sequencing and microarray data mining
    Chen, Wen-Jie
    Tang, Rui-Xue
    He, Rong-Quan
    Li, Dong-Yao
    Liang, Liang
    Zeng, Jiang-Hui
    Hu, Xiao-Hua
    Ma, Jie
    Li, Shi-Kang
    Chen, Gang
    ONCOTARGET, 2017, 8 (37) : 61282 - 61304
  • [44] Identification of Long Non-Coding RNA Profiles and Potential Therapeutic Agents for Fibrolamellar Carcinoma Based on RNA-Sequencing Data
    Kim, Janghyun
    Kim, Young
    Lee, Bora
    GENES, 2023, 14 (09)
  • [45] Identification of the key genes implicated in the transformation of OLP to OSCC using RNA-sequencing
    Yang, Qiaozhen
    Guo, Bin
    Sun, Hongying
    Zhang, Jie
    Liu, Shangfeng
    Hexige, Saiyin
    Yu, Xuedi
    Wang, Xiaxia
    ONCOLOGY REPORTS, 2017, 37 (04) : 2355 - 2365
  • [46] Identification of Novel Transcribed Regions in Zebrafish (Danio rerio) Using RNA-Sequencing
    Wang, Jingwen
    Vesterlund, Liselotte
    Kere, Juha
    Jiao, Hong
    PLOS ONE, 2016, 11 (07):
  • [47] Identification of altered biological processes in heterogeneous RNA-sequencing data by discretization of expression profiles
    Lauria, Andrea
    Peirone, Serena
    Del Giudice, Marco
    Priante, Francesca
    Rajan, Prabhakar
    Caselle, Michele
    Oliviero, Salvatore
    Cereda, Matteo
    NUCLEIC ACIDS RESEARCH, 2020, 48 (04) : 1730 - 1747
  • [48] CELL SUBCLASS IDENTIFICATION IN SINGLE-CELL RNA-SEQUENCING DATA USING ORTHOGONAL NONNEGATIVE MATRIX FACTORIZATION
    Wang, Shuai
    Wu, Peng
    Zhou, Manqi
    Chang, Tsung-Hui
    Wu, Song
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 876 - 880
  • [49] RNA-sequencing based identification of crucial genes for esophageal squamous cell carcinoma
    Fu, Jian-Hua
    Wang, Li-Quan
    Li, Tao
    Ma, Guo-Jun
    JOURNAL OF CANCER RESEARCH AND THERAPEUTICS, 2015, 11 (02) : 420 - 425
  • [50] tsRNAsearch: a pipeline for the identification of tRNA and ncRNA fragments from small RNA-sequencing data
    Donovan, Paul D.
    McHale, Natalie M.
    Veno, Morten T.
    Prehn, Jochen H. M.
    BIOINFORMATICS, 2021, 37 (23) : 4424 - 4430