Constrained Covariance Matrices With a Biologically Realistic Structure: Comparison of Methods for Generating High-Dimensional Gaussian Graphical Models

被引:5
|
作者
Emmert-Streib, Frank [1 ,2 ]
Tripathi, Shailesh [1 ,3 ,4 ]
Dehmer, Matthias [3 ,5 ]
机构
[1] Tampere Univ, Fac Informat Technol & Commun Sci, Predict Soc & Data Analyt Lab, Tampere, Finland
[2] Inst Biosci & Med Technol, Tampere, Finland
[3] Univ Appl Sci Upper Austria, Inst Intelligent Prod, Fac Management, Wels, Austria
[4] UMIT, Dept Mech & Biomed Comp Sci, Hall In Tirol, Austria
[5] Nankai Univ, Coll Comp & Control Engn, Tianjin, Peoples R China
关键词
Gaussian graphical models; network science; machine learning; data science; genomics; gene regulatory networks; statistics;
D O I
10.3389/fams.2019.00017
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
High-dimensional data from molecular biology possess an intricate correlation structure that is imposed by the molecular interactions between genes and their products forming various different types of gene networks. This fact is particularly well-known for gene expression data, because there is a sufficient number of large-scale data sets available that are amenable for a sensible statistical analysis confirming this assertion. The purpose of this paper is two fold. First, we investigate three methods for generating constrained covariance matrices with a biologically realistic structure. Such covariance matrices are playing a pivotal role in designing novel statistical methods for high-dimensional biological data, because they allow to define Gaussian graphical models (GGM) for the simulation of realistic data; including their correlation structure. We study local and global characteristics of these covariance matrices, and derived concentration/partial correlation matrices. Second, we connect these results, obtained from a probabilistic perspective, to statistical results of studies aiming to estimate gene regulatory networks frombiological data. This connection allows to shed light on the well-known heterogeneity of statistical estimation methods for inferring gene regulatory networks and provides an explanation for the difficulties inferring molecular interactions between highly connected genes.
引用
收藏
页数:15
相关论文
共 44 条
  • [1] High-dimensional Covariance Estimation Based On Gaussian Graphical Models
    Zhou, Shuheng
    Ruetimann, Philipp
    Xu, Min
    Buehlmann, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 2975 - 3026
  • [2] Block-Diagonal Covariance Selection for High-Dimensional Gaussian Graphical Models
    Devijver, Emilie
    Gallopin, Melina
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (521) : 306 - 314
  • [3] High-Dimensional Gaussian Graphical Regression Models with Covariates
    Zhang, Jingfei
    Li, Yi
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (543) : 2088 - 2100
  • [4] HIGH-DIMENSIONAL SEMIPARAMETRIC GAUSSIAN COPULA GRAPHICAL MODELS
    Liu, Han
    Han, Fang
    Yuan, Ming
    Lafferty, John
    Wasserman, Larry
    ANNALS OF STATISTICS, 2012, 40 (04): : 2293 - 2326
  • [5] Uniform inference in high-dimensional Gaussian graphical models
    Klaassen, S.
    Kueck, J.
    Spindler, M.
    Chernozhukov, V
    BIOMETRIKA, 2023, 110 (01) : 51 - 68
  • [6] High-Dimensional Dynamic Covariance Matrices With Homogeneous Structure
    Ke, Yuan
    Lian, Heng
    Zhang, Wenyang
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 40 (01) : 96 - 110
  • [7] CHANGE DETECTION IN THE COVARIANCE STRUCTURE OF HIGH-DIMENSIONAL GAUSSIAN LOW-RANK MODELS
    Beisson, R.
    Vallet, P.
    Giremus, A.
    Ginolhac, G.
    2021 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2021, : 421 - 425
  • [8] Quantifying Nonlocal Informativeness in High-Dimensional, Loopy Gaussian Graphical Models
    Levine, Daniel
    How, Jonathan P.
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 487 - 495
  • [9] High-dimensional Gaussian graphical models on network-linked data
    Li, Tianxi
    Qian, Cheng
    Levina, Elizaveta
    Zhu, Ji
    1600, Microtome Publishing (21):
  • [10] Fast and Separable Estimation in High-Dimensional Tensor Gaussian Graphical Models
    Min, Keqian
    Mai, Qing
    Zhang, Xin
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (01) : 294 - 300