Continuous representations of time-series gene expression data

被引:170
|
作者
Bar-Joseph, Z
Gerber, GK
Gifford, DK
Jaakkola, TS
Simon, I
机构
[1] MIT, Comp Sci Lab, Cambridge, MA 02139 USA
[2] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
[3] Whitehead Inst Biomed Res, Cambridge, MA 02142 USA
关键词
time series expression data; missing value estimation; clustering; alignment;
D O I
10.1089/10665270360688057
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.
引用
收藏
页码:341 / 356
页数:16
相关论文
共 50 条
  • [1] Gene Selection in Time-Series Gene Expression Data
    Adhikari, Prem Raj
    Upadhyaya, Bimal Babu
    Meng, Chen
    Hollmen, Jaakko
    PATTERN RECOGNITION IN BIOINFORMATICS, 2011, 7036 : 145 - +
  • [2] Clustering Time-Series Gene Expression Data with Unequal Time Intervals
    Rueda, Luis
    Bari, Ataul
    Ngom, Alioune
    TRANSACTIONS ON COMPUTATIONAL SYSTEMS BIOLOGY X, 2008, 5410 : 100 - 123
  • [3] Interactive Network Visualization of Gene Expression Time-Series Data
    Cruz, Antonio
    Arrais, Joel P.
    Machado, Penousal
    2018 22ND INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV), 2018, : 574 - 580
  • [4] Clustering of unevenly sampled gene expression time-series data
    Möller-Levet, CS
    Klawonn, F
    Cho, KH
    Yin, H
    Wolkenhauer, O
    FUZZY SETS AND SYSTEMS, 2005, 152 (01) : 49 - 66
  • [5] A dynamic time order network for time-series gene expression data analysis
    Zhang, Pengyue
    Mourad, Raphael
    Xiang, Yang
    Huang, Kun
    Huang, Tim
    Nephew, Kenneth
    Liu, Yunlong
    Li, Lang
    BMC SYSTEMS BIOLOGY, 2012, 6
  • [6] Data Modeling With Polynomial Representations and Autoregressive Time-Series Representations, and Their Connections
    Nandi, Asoke K.
    IEEE ACCESS, 2020, 8 : 110412 - 110424
  • [7] Stochastic dynamic modeling of short gene expression time-series data
    Wang, Zidong
    Yang, Fuwen
    Ho, Daniel W. C.
    Swift, Stephen
    Tucker, Allan
    Liu, Xiaohui
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2008, 7 (01) : 44 - 55
  • [8] Extrapolating heterogeneous time-series gene expression data using Sagittarius
    Woicik, Addie
    Zhang, Mingxin
    Chan, Janelle
    Ma, Jianzhu
    Wang, Sheng
    NATURE MACHINE INTELLIGENCE, 2023, 5 (07) : 699 - +
  • [9] A New Biclustering Algorithm for Time-Series Gene Expression Data Analysis
    Xue, Yun
    Liao, Zhengling
    Li, Meihang
    Luo, Jie
    Hu, Xiaohui
    Luo, Guiyin
    Chen, Wen-Sheng
    2014 TENTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2014, : 268 - 272
  • [10] Extrapolating heterogeneous time-series gene expression data using Sagittarius
    Addie Woicik
    Mingxin Zhang
    Janelle Chan
    Jianzhu Ma
    Sheng Wang
    Nature Machine Intelligence, 2023, 5 : 699 - 713