Continuous representations of time-series gene expression data

被引:170
|
作者
Bar-Joseph, Z
Gerber, GK
Gifford, DK
Jaakkola, TS
Simon, I
机构
[1] MIT, Comp Sci Lab, Cambridge, MA 02139 USA
[2] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
[3] Whitehead Inst Biomed Res, Cambridge, MA 02142 USA
关键词
time series expression data; missing value estimation; clustering; alignment;
D O I
10.1089/10665270360688057
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.
引用
收藏
页码:341 / 356
页数:16
相关论文
共 50 条
  • [21] Prioritizing biological pathways by recognizing context in time-series gene expression data
    Jusang Lee
    Kyuri Jo
    Sunwon Lee
    Jaewoo Kang
    Sun Kim
    BMC Bioinformatics, 17
  • [22] HMM Training using Correlation Coefficients of Time-Series Gene Expression Data
    Li, Jiangeng
    Guo, Qinglei
    He, Yiheng
    PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 3719 - 3723
  • [23] Using gene expression programming to infer gene regulatory networks from time-series data
    Zhang, Yongqing
    Pu, Yifei
    Zhang, Haisen
    Su, Yabo
    Zhang, Lifang
    Zhou, Jiliu
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2013, 47 : 198 - 206
  • [24] Inferring Time-Delayed Causal Gene Network Using Time-Series Expression Data
    Lo, Leung-Yau
    Leung, Kwong-Sak
    Lee, Kin-Hong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (05) : 1169 - 1182
  • [25] TIME-SERIES OF CONTINUOUS PROPORTIONS
    GRUNWALD, GK
    RAFTERY, AE
    GUTTORP, P
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1993, 55 (01): : 103 - 116
  • [26] Microarray Time-Series Data Clustering via Multiple Alignment of Gene Expression Profiles
    Subhani, Numanul
    Ngom, Alioune
    Rueda, Luis
    Burden, Conrad
    PATTERN RECOGNITION IN BIOINFORMATICS, PROCEEDINGS, 2009, 5780 : 377 - +
  • [27] Multi-objective evolutionary triclustering with constraints of time-series gene expression data
    Chen, Lei
    Liu, Hai-Lin
    Tang, Weiseng
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2019, 26 (04) : 399 - 410
  • [28] Discovery of bidirectional contiguous column coherent bicluster in time-series gene expression data
    Xue, Yun
    Ma, Zhihao
    Xu, Huixin
    Lu, Zhihao
    Hu, Xiaohui
    Pang, Chaoyi
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 413 - 426
  • [29] A contiguous column coherent evolution biclustering algorithm for time-series gene expression data
    Xue, Yun
    Zhang, Meizhen
    Liao, Zhengling
    Li, Meihang
    Luo, Jie
    Hu, Xiaohui
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (03) : 441 - 453
  • [30] Studying and modelling dynamic biological processes using time-series gene expression data
    Ziv Bar-Joseph
    Anthony Gitter
    Itamar Simon
    Nature Reviews Genetics, 2012, 13 : 552 - 564