Biclustering by sparse canonical correlation analysis

被引:5
|
作者
Pimentel, Harold [1 ]
Hu, Zhiyue [2 ]
Huang, Haiyan [2 ]
机构
[1] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
biclustering; SCCA; gene clusters;
D O I
10.1007/s40484-017-0127-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundDeveloping appropriate computational tools to distill biological insights from large-scale gene expression data has been an important part of systems biology. Considering that gene relationships may change or only exist in a subset of collected samples, biclustering that involves clustering both genes and samples has become increasingly important, especially when the samples are pooled from a wide range of experimental conditions.MethodsIn this paper, we introduce a new biclustering algorithm to find subsets of genomic expression features (EFs) (e.g., genes, isoforms, exon inclusion) that show strong "group interactions" under certain subsets of samples. Group interactions are defined by strong partial correlations, or equivalently, conditional dependencies between EFs after removing the influences of a set of other functionally related EFs. Our new biclustering method, named SCCA-BC, extends an existing method for group interaction inference, which is based on sparse canonical correlation analysis (SCCA) coupled with repeated random partitioning of the gene expression data set.ResultsSCCA-BC gives sensible results on real data sets and outperforms most existing methods in simulations. Software is available at https://github.com/pimentel/scca-bc.ConclusionsSCCA-BC seems to work in numerous conditions and the results seem promising for future extensions. SCCA-BC has the ability to find different types of bicluster patterns, and it is especially advantageous in identifying a bicluster whose elements share the same progressive and multivariate normal distribution with a dense covariance matrix.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 50 条
  • [31] Sparse Representation based Discriminative Canonical Correlation Analysis for Face Recognition
    Guan, Naiyang
    Zhang, Xiang
    Luo, Zhigang
    Lan, Long
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 51 - 56
  • [32] A fault detection method based on sparse dynamic canonical correlation analysis
    Hu, Xuguang
    Wu, Ping
    Pan, Haipeng
    He, Yuchen
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2024, 102 (03): : 1188 - 1202
  • [33] An iterative penalized least squares approach to sparse canonical correlation analysis
    Mai, Qing
    Zhang, Xin
    BIOMETRICS, 2019, 75 (03) : 734 - 744
  • [34] Branch-and-bound algorithm for optimal sparse canonical correlation analysis
    Watanabe, Akihisa
    Tamura, Ryuta
    Takano, Yuichi
    Miyashiro, Ryuhei
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 217
  • [35] Sparse Bayesian multiway canonical correlation analysis for EEG pattern recognition
    Zhang, Yu
    Zhou, Guoxu
    Jin, Jing
    Zhang, Yangsong
    Wang, Xingyu
    Cichocki, Andrzej
    NEUROCOMPUTING, 2017, 225 : 103 - 110
  • [36] Sparse canonical correlation analysis algorithm with alternating direction method of multipliers
    Gu, Xiaolan
    Wang, Qiusheng
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2020, 49 (09) : 2372 - 2388
  • [37] Sparse additive discriminant canonical correlation analysis for multiple features fusion
    Wang, Zhan
    Wang, Lizhi
    Huang, Hua
    NEUROCOMPUTING, 2021, 463 : 185 - 197
  • [38] Sparse multiway canonical correlation analysis for multimodal stroke recovery data
    Das, Subham
    West, Franklin D.
    Park, Cheolwoo
    BIOMETRICAL JOURNAL, 2024, 66 (02)
  • [39] Sparse tensor canonical correlation analysis for micro-expression recognition
    Wang, Su-Jing
    Yan, Wen-Jing
    Sun, Tingkai
    Zhao, Guoying
    Fu, Xiaolan
    NEUROCOMPUTING, 2016, 214 : 218 - 232
  • [40] Structure-constrained sparse canonical correlation analysis with an application to microbiome data analysis
    Chen, Jun
    Bushman, Frederic D.
    Lewis, James D.
    Wu, Gary D.
    Li, Hongzhe
    BIOSTATISTICS, 2013, 14 (02) : 244 - 258