A constrained matrix-variate Gaussian process for transposable data

被引:1
|
作者
Koyejo, Oluwasanmi [1 ]
Lee, Cheng [2 ]
Ghosh, Joydeep [3 ]
机构
[1] Univ Texas Austin, Imaging Res Ctr, Austin, TX 78712 USA
[2] Univ Texas Austin, Dept Biomed Engn, Austin, TX 78712 USA
[3] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX 78712 USA
关键词
Constrained Bayesian inference; Gaussian process; Transposable data; Nuclear norm; Low rank; GENOME-WIDE ASSOCIATION; DISEASE GENES; REGULARIZATION; PRIORITIZATION; KERNELS;
D O I
10.1007/s10994-014-5444-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transposable data represents interactions among two sets of entities, and are typically represented as a matrix containing the known interaction values. Additional side information may consist of feature vectors specific to entities corresponding to the rows and/or columns of such a matrix. Further information may also be available in the form of interactions or hierarchies among entities along the same mode (axis). We propose a novel approach for modeling transposable data with missing interactions given additional side information. The interactions are modeled as noisy observations from a latent noise free matrix generated from a matrix-variate Gaussian process. The construction of row and column covariances using side information provides a flexible mechanism for specifying a-priori knowledge of the row and column correlations in the data. Further, the use of such a prior combined with the side information enables predictions for new rows and columns not observed in the training data. In this work, we combine the matrix-variate Gaussian process model with low rank constraints. The constrained Gaussian process approach is applied to the prediction of hidden associations between genes and diseases using a small set of observed associations as well as prior covariances induced by gene-gene interaction networks and disease ontologies. The proposed approach is also applied to recommender systems data which involves predicting the item ratings of users using known associations as well as prior covariances induced by social networks. We present experimental results that highlight the performance of constrained matrix-variate Gaussian process as compared to state of the art approaches in each domain.
引用
收藏
页码:103 / 127
页数:25
相关论文
共 50 条
  • [1] A constrained matrix-variate Gaussian process for transposable data
    Oluwasanmi Koyejo
    Cheng Lee
    Joydeep Ghosh
    Machine Learning, 2014, 97 : 103 - 127
  • [2] GRAPH ESTIMATION FOR MATRIX-VARIATE GAUSSIAN DATA
    Chen, Xi
    Liu, Weidong
    STATISTICA SINICA, 2019, 29 (01) : 479 - 504
  • [3] Modal clustering of matrix-variate data
    Ferraccioli, Federico
    Menardi, Giovanna
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2023, 17 (02) : 323 - 345
  • [4] Modal clustering of matrix-variate data
    Federico Ferraccioli
    Giovanna Menardi
    Advances in Data Analysis and Classification, 2023, 17 : 323 - 345
  • [5] Matrix-Variate Dirichlet Process Priors with Applications
    Zhang, Zhihua
    Wang, Dakan
    Dai, Guang
    Jordan, Michael I.
    BAYESIAN ANALYSIS, 2014, 9 (02): : 259 - 285
  • [6] Some properties of matrix-variate Laplace transforms and matrix-variate Whittaker functions
    Mathai, AM
    Pederzoli, G
    LINEAR ALGEBRA AND ITS APPLICATIONS, 1997, 253 : 209 - 226
  • [7] Bayesian nonparametric estimation of Milky Way parameters using matrix-variate data, in a new Gaussian Process based method
    Chakrabarty, Dalia
    Biswas, Munmun
    Bhattacharya, Sourabh
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 1378 - 1403
  • [8] DNNLasso: Scalable Graph Learning for Matrix-Variate Data
    Lin, Meixia
    Zhang, Yangjing
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [9] Core shrinkage covariance estimation for matrix-variate data
    Hoff, Peter
    McCormack, Andrew
    Zhang, Anru R.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2024, 85 (05) : 1659 - 1679
  • [10] On the Matrix-Variate Beta Distribution
    Zine, Raoudha
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2012, 41 (09) : 1569 - 1582