Constrained Covariance Matrices With a Biologically Realistic Structure: Comparison of Methods for Generating High-Dimensional Gaussian Graphical Models

被引:5
|
作者
Emmert-Streib, Frank [1 ,2 ]
Tripathi, Shailesh [1 ,3 ,4 ]
Dehmer, Matthias [3 ,5 ]
机构
[1] Tampere Univ, Fac Informat Technol & Commun Sci, Predict Soc & Data Analyt Lab, Tampere, Finland
[2] Inst Biosci & Med Technol, Tampere, Finland
[3] Univ Appl Sci Upper Austria, Inst Intelligent Prod, Fac Management, Wels, Austria
[4] UMIT, Dept Mech & Biomed Comp Sci, Hall In Tirol, Austria
[5] Nankai Univ, Coll Comp & Control Engn, Tianjin, Peoples R China
关键词
Gaussian graphical models; network science; machine learning; data science; genomics; gene regulatory networks; statistics;
D O I
10.3389/fams.2019.00017
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
High-dimensional data from molecular biology possess an intricate correlation structure that is imposed by the molecular interactions between genes and their products forming various different types of gene networks. This fact is particularly well-known for gene expression data, because there is a sufficient number of large-scale data sets available that are amenable for a sensible statistical analysis confirming this assertion. The purpose of this paper is two fold. First, we investigate three methods for generating constrained covariance matrices with a biologically realistic structure. Such covariance matrices are playing a pivotal role in designing novel statistical methods for high-dimensional biological data, because they allow to define Gaussian graphical models (GGM) for the simulation of realistic data; including their correlation structure. We study local and global characteristics of these covariance matrices, and derived concentration/partial correlation matrices. Second, we connect these results, obtained from a probabilistic perspective, to statistical results of studies aiming to estimate gene regulatory networks frombiological data. This connection allows to shed light on the well-known heterogeneity of statistical estimation methods for inferring gene regulatory networks and provides an explanation for the difficulties inferring molecular interactions between highly connected genes.
引用
收藏
页数:15
相关论文
共 44 条
  • [31] High-dimensional generalized linear models incorporating graphical structure among predictors
    Zhou, Shengbin
    Zhou, Jingke
    Zhang, Bo
    ELECTRONIC JOURNAL OF STATISTICS, 2019, 13 (02): : 3161 - 3194
  • [32] High-dimensional rank-based graphical models for non-Gaussian functional data
    Solea, Eftychia
    Al Hajj, Rayan
    STATISTICS, 2023, : 388 - 422
  • [33] Tests for high-dimensional generalized linear models under general covariance structure
    Yang, Weichao
    Guo, Xu
    Zhu, Lixing
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 199
  • [34] Learning High-dimensional Gaussian Graphical Models under Total Positivity without Adjustment of Tuning Parameters
    Wang, Yuhao
    Roy, Uma
    Uhler, Caroline
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
  • [35] High-Dimensional Sparse Graph Estimation by Integrating DTW-D Into Bayesian Gaussian Graphical Models
    Li, Ying
    Xu, Xiaojun
    Li, Jianbo
    IEEE ACCESS, 2018, 6 : 34279 - 34287
  • [36] Simultaneous change point inference and structure recovery for high dimensional gaussian graphical models
    Liu, Bin
    Zhang, Xinsheng
    Liu, Yufeng
    Journal of Machine Learning Research, 2021, 22
  • [37] The comparison of the estimators of banded toeplitz covariance structure under the high-dimensional multivariate model
    John, Mateusz
    Mieldzioc, Adam
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2020, 49 (03) : 734 - 752
  • [38] A comparison of computational methods for high-dimensional hierarchical nonlinear models for repeated measurements
    Segawa, E
    duToit, SHC
    Bock, RD
    AMERICAN STATISTICAL ASSOCIATION - 1996 PROCEEDINGS OF THE STATISTICAL COMPUTING SECTION, 1996, : 130 - 135
  • [39] High-Dimensional Consistencies of KOO Methods for the Selection of Variables in Multivariate Linear Regression Models with Covariance Structures
    Fujikoshi, Yasunori
    Sakurai, Tetsuro
    MATHEMATICS, 2023, 11 (03)
  • [40] Iterative Reconstruction of High-Dimensional Gaussian Graphical Models Based on a New Method to Estimate Partial Correlations under Constraints
    Guillemot, Vincent
    Bender, Andreas
    Boulesteix, Anne-Laure
    PLOS ONE, 2013, 8 (04):