A graph-based multi-sample test for identifying pathways associated with cancer progression

被引:2
|
作者
Zhang, Qingyang [1 ]
Mahdi, Ghadeer [1 ,2 ]
Tinker, Jian [1 ]
Chen, Hao [3 ]
机构
[1] Univ Arkansas, Dept Math Sci, Fayetteville, AR 72701 USA
[2] Baghdad Univ, Coll Educ, Dept Math, Baghdad, Iraq
[3] Univ Calif Davis, Dept Stat, Davis, CA 95616 USA
关键词
Edge-count test; Tumorigenesis; Serous ovarian cancer; Pathway analysis; The Cancer Genome Atlas; GENE-EXPRESSION PROFILES; BREAST-CANCER; 2-SAMPLE TEST; MULTIVARIATE; MELANOMA;
D O I
10.1016/j.compbiolchem.2020.107285
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cancer is in general not a result of an abnormally of a single gene but a consequence of changes in many genes, it is therefore of great importance to understand the roles of different oncogenic and tumor suppressor pathways in tumorigenesis. In recent years, there have been many computational models developed to study the genetic alterations of different pathways in the evolutionary process of cancer. However, most of the methods are knowledge-based enrichment analyses and inflexible to analyze user-defined pathways or gene sets. In this paper, we develop a nonparametric and data-driven approach to testing for the dynamic changes of pathways over the cancer progression. Our method is based on an expansion and refinement of the pathway being studied, followed by a graph-based multivariate test, which is very easy to implement in practice. The new test is applied to the rich Cancer Genome Atlas data to study the (epi)genetic alterations of 186 KEGG pathways in the development of serous ovarian cancer. To make use of the comprehensive data, we incorporate three data types in the analysis representing gene expression level, copy number and DNA methylation level. Our analysis suggests a list of nine pathways that are closely associated with serous ovarian cancer progression, including cell cycle, ERBB, JAK-STAT signaling and p53 signaling pathways. By pairwise tests, we found that most of the identified pathways contribute only to a particular transition step. For instance, the cell cycle and ERBB pathways play key roles in the early-stage transition, while the ECM receptor and apoptosis pathways contribute to the progression from stage III to stage IV. The proposed computational pipeline is powerful in detecting important pathways and gene sets that drive cancers at certain stage(s). It offers new insights into the understanding of molecular mechanism of cancer initiation and progression.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] A bi-aspect nonparametric test for the multi-sample location problem
    Marozzi, M
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 46 (01) : 81 - 92
  • [32] An Algorithm for Identifying the Abstract Syntax of Graph-Based Diagrams
    Anaby-Tavor, Ateret
    Amid, David
    Fisher, Amit
    Ossher, Harold
    Bellamy, Rachel
    Callery, Matthew
    Desmond, Michael
    Krasikov, Sophia
    Roth, Tova
    Simmonds, Ian
    de Vries, Jacqueline
    2009 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, PROCEEDINGS, 2009, : 193 - +
  • [33] Sample Efficient Graph-Based Optimization with Noisy Observations
    Nguyen, Tan
    Shameli, Ali
    Abbasi-Yadkori, Yasin
    Rao, Anup
    Kveton, Branislav
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [34] Application of graph-based data mining to metabolic pathways
    You, Chang Hun
    Holder, Lawrence B.
    Cook, Diane J.
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 169 - +
  • [35] Multivariate multi-sample tests for location based on data depth
    Shirke, Digannbar Tukarann
    Chavan, Atul Rajaram
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (18) : 3377 - 3390
  • [36] Bioinformatics analysis with graph-based clustering to detect gastric cancer-related pathways
    Liu, P.
    Wang, X.
    Hu, C. H.
    Hu, T. H.
    GENETICS AND MOLECULAR RESEARCH, 2012, 11 (03) : 3497 - 3504
  • [37] Graph-based Multi-task Learning
    Li, Ya
    Tian, Xinmei
    2015 IEEE 16TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2015, : 730 - 733
  • [38] A GENERALIZED DISTANCE MULTI-SAMPLE TEST OF NORMALITY WITH APPLICATIONS TO PROCESS-CONTROL
    SPURRIER, JD
    WILSON, J
    PARK, JW
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1989, 18 (02) : 553 - 569
  • [39] Graph-based multi-modality integration for prediction of cancer subtype and severity
    Duroux, Diane
    Wohlfart, Christian
    Van Steen, Kristel
    Vladimirova, Antoaneta
    King, Michael
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [40] Graph-based multi-modality integration for prediction of cancer subtype and severity
    Diane Duroux
    Christian Wohlfart
    Kristel Van Steen
    Antoaneta Vladimirova
    Michael King
    Scientific Reports, 13 (1)