Utilizing somatic mutation data from numerous studies for cancer research: proof of concept and applications

被引:13
|
作者
Amar, D. [1 ]
Izraeli, S. [2 ,3 ]
Shamir, R. [1 ]
机构
[1] Tel Aviv Univ, Blavatnik Sch Comp Sci, Tel Aviv Univ Campus, IL-69978 Tel Aviv, Israel
[2] Sheba Med Ctr, Safra Childrens Hosp, Dept Pediat Hematol Oncol, Ramat Gan, Israel
[3] Tel Aviv Univ, Sackler Sch Med, Tel Aviv, Israel
基金
以色列科学基金会;
关键词
HEDGEHOG SIGNALING PATHWAY; GENE-EXPRESSION PROFILES; INTEGRATED ANALYSIS; IDENTIFICATION; CYTOSCAPE; ONTOLOGY; BIOLOGY; SMAD4; KRAS;
D O I
10.1038/onc.2016.489
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large cancer projects measure somatic mutations in thousands of samples, gradually assembling a catalog of recurring mutations in cancer. Many methods analyze these data jointly with auxiliary information with the aim of identifying subtype-specific results. Here, we show that somatic gene mutations alone can reliably and specifically predict cancer subtypes. Interpretation of the classifiers provides useful insights for several biomedical applications. We analyze the COSMIC database, which collects somatic mutations from The Cancer Genome Atlas (TCGA) as well as from many smaller scale studies. We use multi-label classification techniques and the Disease Ontology hierarchy in order to identify cancer subtype-specific biomarkers. Cancer subtype classifiers based on TCGA and the smaller studies have comparable performance, and the smaller studies add a substantial value in terms of validation, coverage of additional subtypes, and improved classification. The gene sets of the classifiers are used for threefold contribution. First, we refine the associations of genes to cancer subtypes and identify novel compelling candidate driver genes. Second, using our classifiers we successfully predict the primary site of metastatic samples. Third, we provide novel hypotheses regarding detection of subtype-specific synthetic lethality interactions. From the cancer research community perspective, our results suggest that curation efforts, such as COSMIC, have great added and complementary value even in the era of large international cancer projects.
引用
收藏
页码:3375 / 3383
页数:9
相关论文
共 50 条
  • [21] Somatic Mutation Analyses in Studies of the Clonal Evolution and Diagnostic Targets of Prostate Cancer
    Mikhaylenko, Dmitry S.
    Efremov, Gennady D.
    Strelnikov, Vladimir V.
    Zaletaev, Dmitry V.
    Alekseev, Boris Y.
    CURRENT GENOMICS, 2017, 18 (03) : 236 - 243
  • [22] Design and analysis issues in genome-wide somatic mutation studies of cancer
    Parmigiani, Giovanni
    Boca, Simina
    Lin, Jimmy
    Kinzler, Kenneth W.
    Velculescu, VicLor
    Vogelstein, Bert
    GENOMICS, 2009, 93 (01) : 17 - 21
  • [23] A short tutorial in analyzing NGS data of cancer genomes for somatic mutation calling
    Vuong, Huy
    Zhao, Zhongming
    BMC BIOINFORMATICS, 2013, 14
  • [24] FDK'S SMT25 RESULTED FROM DESIGN CONCEPT AND NUMEROUS STUDIES.
    Kaneda, Takeyoshi
    JEE. Journal of electronic engineering, 1988, 25 (253): : 38 - 40
  • [25] A short tutorial in analyzing NGS data of cancer genomes for somatic mutation calling
    Huy Vuong
    Zhongming Zhao
    BMC Bioinformatics, 14
  • [26] MUFFINN: cancer gene discovery via network analysis of somatic mutation data
    Cho, Ara
    Shim, Jung Eun
    Kim, Eiru
    Supek, Fran
    Lehner, Ben
    Lee, Insuk
    GENOME BIOLOGY, 2016, 17
  • [27] MUFFINN: cancer gene discovery via network analysis of somatic mutation data
    Ara Cho
    Jung Eun Shim
    Eiru Kim
    Fran Supek
    Ben Lehner
    Insuk Lee
    Genome Biology, 17
  • [28] Observations Utilizing Korea Ocean Research Stations and their Applications for Process Studies
    Ha, Kyung-Ja
    Nam, Sunghyun
    Jeong, In-Yong
    Moon, Il-Ju
    Lee, Meehye
    Yun, Junghee
    Jang, Chan Joo
    Kim, Yong Sun
    Byun, Do-Seong
    Heo, Ki-Young
    Shim, Jae-Seol
    BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2019, 100 (10) : 2061 - 2075
  • [29] Standardized Data Sharing in a Paediatric Oncology Research Network - A Proof-of-Concept Study
    Hochedlinger, Nina
    Nitzlnader, Michael
    Falgenhauer, Markus
    Welte, Stefan
    Hayn, Dieter
    Koumakis, Lefteris
    Potamias, George
    Tsiknakis, Manolis
    Saraceno, Davide
    Rinaldi, Eugenia
    Ladenstein, Ruth
    Schreier, Guenter
    EHEALTH2015 - HEALTH INFORMATICS MEETS EHEALTH: INNOVATIVE HEALTH PERSPECTIVES: PERSONALIZED HEALTH, 2015, 212 : 27 - 34
  • [30] Classification of fNIRS data with LDA and SVM : a proof-of-concept for application in infant studies
    Gemignani, Jessica
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 824 - 827