A constrained matrix-variate Gaussian process for transposable data

被引:1
|
作者
Koyejo, Oluwasanmi [1 ]
Lee, Cheng [2 ]
Ghosh, Joydeep [3 ]
机构
[1] Univ Texas Austin, Imaging Res Ctr, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept Biomed Engn, Austin, TX 78712 USA
[3] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
关键词
Constrained Bayesian inference; Gaussian process; Transposable data; Nuclear norm; Low rank; GENOME-WIDE ASSOCIATION; DISEASE GENES; REGULARIZATION; PRIORITIZATION; KERNELS;
D O I
10.1007/s10994-014-5444-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transposable data represents interactions among two sets of entities, and are typically represented as a matrix containing the known interaction values. Additional side information may consist of feature vectors specific to entities corresponding to the rows and/or columns of such a matrix. Further information may also be available in the form of interactions or hierarchies among entities along the same mode (axis). We propose a novel approach for modeling transposable data with missing interactions given additional side information. The interactions are modeled as noisy observations from a latent noise free matrix generated from a matrix-variate Gaussian process. The construction of row and column covariances using side information provides a flexible mechanism for specifying a-priori knowledge of the row and column correlations in the data. Further, the use of such a prior combined with the side information enables predictions for new rows and columns not observed in the training data. In this work, we combine the matrix-variate Gaussian process model with low rank constraints. The constrained Gaussian process approach is applied to the prediction of hidden associations between genes and diseases using a small set of observed associations as well as prior covariances induced by gene-gene interaction networks and disease ontologies. The proposed approach is also applied to recommender systems data which involves predicting the item ratings of users using known associations as well as prior covariances induced by social networks. We present experimental results that highlight the performance of constrained matrix-variate Gaussian process as compared to state of the art approaches in each domain.
引用
收藏
页码:103 / 127
页数:25
相关论文
共 50 条
  • [21] Constrained Factor Models for High-Dimensional Matrix-Variate Time Series
    Chen, Elynn Y.
    Tsay, Ruey S.
    Chen, Rong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (530) : 775 - 793
  • [22] A Matrix-Variate t Model for Networks
    Billio, Monica
    Casarin, Roberto
    Costola, Michele
    Iacopini, Matteo
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
  • [23] Matrix-variate variational auto-encoder with applications to image process
    Li, Jinghua
    Yan, Huixia
    Gao, Junbin
    Kong, Dehui
    Wang, Lichun
    Wang, Shaofan
    Yin, Baocai
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 67
  • [24] MATRIX-VARIATE GAUSS HYPERGEOMETRIC DISTRIBUTION
    Gupta, Arjun K.
    Nagar, Daya K.
    JOURNAL OF THE AUSTRALIAN MATHEMATICAL SOCIETY, 2012, 92 (03) : 335 - 355
  • [25] Dynamic Matrix-Variate Graphical Models
    Carvalho, Carlos M.
    West, Mike
    BAYESIAN ANALYSIS, 2007, 2 (01): : 69 - 97
  • [26] GENERALIZED MATRIX-VARIATE HYPERGEOMETRIC DISTRIBUTION
    VANDERME.GJ
    ROUX, JJJ
    SOUTH AFRICAN STATISTICAL JOURNAL, 1974, 8 (01) : 49 - 58
  • [27] A matrix-variate dirichlet process to model earthquake hypocentre temporal patterns
    A. Ray, Meredith
    Bowman, Dale
    Csontos, Ryan
    Van Arsdale, Roy B.
    Zhang, Hongmei
    STATISTICAL MODELLING, 2022, 22 (04) : 245 - 272
  • [28] A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models
    Scharl, Theresa
    Gruen, Bettina
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [29] A clustering procedure for three-way RNA sequencing data using data transformations and matrix-variate Gaussian mixture models
    Theresa Scharl
    Bettina Grün
    BMC Bioinformatics, 25
  • [30] Matrix-variate Kummer-Beta distribution
    Nagar, DK
    Gupta, AK
    JOURNAL OF THE AUSTRALIAN MATHEMATICAL SOCIETY, 2002, 73 : 11 - 25