Multiple signatures of a disease in potential biomarker space: Getting the signatures consensus and identification of novel biomarkers

被引:4
|
作者
Ow, Ghim Siong [1 ]
Kuznetsov, Vladimir A. [1 ,2 ]
机构
[1] ASTAR, Bioinformat Inst, Singapore, Singapore
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
来源
BMC GENOMICS | 2015年 / 16卷
关键词
GENE-EXPRESSION DATA; CANCER; ENRICHMENT; PROGRAMS; SAMPLES; TOOL;
D O I
10.1186/1471-2164-16-S7-S2
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The lack of consensus among reported gene signature subsets (GSSs) in multi-gene biomarker discovery studies is often a concern for researchers and clinicians. Subsequently, it discourages larger scale prospective studies, prevents the translation of such knowledge into a practical clinical setting and ultimately hinders the progress of the field of biomarker-based disease classification, prognosis and prediction. Methods: We define all "gene identificators" (gIDs) as constituents of the entire potential disease biomarker space. For each gID in a GSS of interest ("tested GSS"/tGSS), our method counts the empirical frequency of gID co-occurrences/overlaps in other reference GSSs (rGSSs) and compares it with the expected frequency generated via implementation of a randomized sampling procedure. Comparison of the empirical frequency distribution (EFD) with the expected background frequency distribution (BFD) allows dichotomization of statistically novel (SN) and common (SC) gIDs within the tGSS. Results: We identify SN or SC biomarkers for tGSSs obtained from previous studies of high-grade serous ovarian cancer (HG-SOC) and breast cancer (BC). For each tGSS, the EFD of gID co-occurrences/overlaps with other rGSSs is characterized by scale and context-dependent Pareto-like frequency distribution function. Our results indicate that while independently there is little overlap between our tGSS with individual rGSSs, comparison of the EFD with BFD suggests that beyond a confidence threshold, tested gIDs become more common in rGSSs than expected. This validates the use of our tGSS as individual or combined prognostic factors. Our method identifies SN and SC genes of a 36-gene prognostic signature that stratify HG-SOC patients into subgroups with low, intermediate or high-risk of the disease outcome. Using 70 BC rGSSs, the method also predicted SN and SC BC prognostic genes from the tested obesity and IGF1 pathway GSSs. Conclusions: Our method provides a strategy that identify/predict within a tGSS of interest, gID subsets that are either SN or SC when compared to other rGSSs. Practically, our results suggest that there is a stronger association of the IGF1 signature genes with the 70 BC rGSSs, than for the obesity-associated signature. Furthermore, both SC and SN genes, in both signatures could be considered as perspective prognostic biomarkers of BCs that stratify the patients onto low or high risks
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Multiple signatures of a disease in potential biomarker space: Getting the signatures consensus and identification of novel biomarkers
    Ghim Siong Ow
    Vladimir A Kuznetsov
    BMC Genomics, 16
  • [2] Identification of Extracellular Matrix Signatures as Novel Potential Prognostic Biomarkers in Lung Adenocarcinoma
    Zeng, Zhen
    Zuo, Yuanli
    Jin, Yang
    Peng, Yong
    Zhu, Xiaofeng
    FRONTIERS IN GENETICS, 2022, 13
  • [3] Identification of gene expression signatures as potential novel biomarkers in malignant melanoma.
    Figueroa, Stephanie
    Tiwari, Raj
    Geliebter, Jan
    CANCER RESEARCH, 2021, 81 (13)
  • [4] Identification of Six Novel Prognostic Gene Signatures as Potential Biomarkers in Small Cell Lung Cancer
    Feng, Shicheng
    Zhang, Xiuxiu
    Gu, Xuyu
    Zhou, Min
    Chen, Yan
    Wang, Cailian
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2023, 26 (05) : 938 - 949
  • [5] Identification of circulating microRNA signatures as potential biomarkers in the serum of elk infected with chronic wasting disease
    Jessy A. Slota
    Sarah J. Medina
    Megan Klassen
    Damian Gorski
    Christine M. Mesa
    Catherine Robertson
    Gordon Mitchell
    Michael B. Coulthart
    Sandra Pritzkow
    Claudio Soto
    Stephanie A. Booth
    Scientific Reports, 9
  • [6] Identification of circulating microRNA signatures as potential biomarkers in the serum of elk infected with chronic wasting disease
    Slota, Jessy A.
    Medina, Sarah J.
    Klassen, Megan
    Gorski, Damian
    Mesa, Christine M.
    Robertson, Catherine
    Mitchell, Gordon
    Coulthart, Michael B.
    Pritzkow, Sandra
    Soto, Claudio
    Booth, Stephanie A.
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [7] Identification of Biomarkers and Signatures in Protein Data
    Nordling, Torbjorn E. M.
    Padhan, Narendra
    Nelander, Sven
    Claesson-Welsh, Lena
    2015 IEEE 11TH INTERNATIONAL CONFERENCE ON E-SCIENCE, 2015, : 411 - 419
  • [8] Disease signatures: Biomarkers/indicators of neurodegeneration
    Zetterberg, Henrik
    Baehr, Mathias
    MOLECULAR AND CELLULAR NEUROSCIENCE, 2019, 97 : 1 - 2
  • [9] Towards the Identification of Disease Signatures
    Venetis, Tassos
    Ailamaki, Anastasia
    Heinis, Thomas
    Karpathiotakis, Manos
    Kherif, Ferath
    Mitelpunkt, Alexis
    Vassalos, Vasilis
    BRAIN INFORMATICS AND HEALTH (BIH 2015), 2015, 9250 : 145 - 155
  • [10] Identification of Multiple Invalid Signatures in Pairing-Based Batched Signatures
    Matt, Brian J.
    PUBLIC KEY CRYPTOGRAPHY-PKC 2009, PROCEEDINGS, 2009, 5443 : 337 - 356