REGULARIZED RANK-BASED ESTIMATION OF HIGH-DIMENSIONAL NONPARANORMAL GRAPHICAL MODELS

被引:162
|
作者
Xue, Lingzhou [1 ]
Zou, Hui [1 ]
机构
[1] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
来源
ANNALS OF STATISTICS | 2012年 / 40卷 / 05期
基金
美国国家科学基金会;
关键词
CLIME; Dantzig selector; graphical lasso; nonparanormal graphical model; rate of convergence; variable transformation; COVARIANCE ESTIMATION; VARIABLE SELECTION; DANTZIG SELECTOR; LASSO; BIOSYNTHESIS; LIKELIHOOD; PATHWAYS;
D O I
10.1214/12-AOS1041
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A sparse precision matrix can be directly translated into a sparse Gaussian graphical model under the assumption that the data follow a joint normal distribution. This neat property makes high-dimensional precision matrix estimation very appealing in many applications. However, in practice we often face nonnormal data, and variable transformation is often used to achieve normality. In this paper we consider the nonparanormal model that assumes that the variables follow a joint normal distribution after a set of unknown monotone transformations. The nonparanormal model is much more flexible than the normal model while retaining the good interpretability of the latter in that each zero entry in the sparse precision matrix of the nonparanormal model corresponds to a pair of conditionally independent variables. In this paper we show that the nonparanormal graphical model can be efficiently estimated by using a rank-based estimation scheme which does not require estimating these unknown transformation functions. In particular, we study the rank-based graphical lasso, the rank-based neighborhood Dantzig selector and the rank-based CLIME. We establish their theoretical properties in the setting where the dimension is nearly exponentially large relative to the sample size. It is shown that the proposed rank-based estimators work as well as their oracle counterparts defined with the oracle data. Furthermore, the theory motivates us to consider the adaptive version of the rank-based neighborhood Dantzig selector and the rank-based CLIME that are shown to enjoy graphical model selection consistency without assuming the irrepresentable condition for the oracle and rank-based graphical lasso. Simulated and real data are used to demonstrate the finite performance of the rank-based estimators.
引用
收藏
页码:2541 / 2571
页数:31
相关论文
共 50 条
  • [21] Rank-based lasso - Efficient methods for high-dimensional robust model selection
    Rejchel, Wojciech
    Bogdan, Malgorzata
    1600, Microtome Publishing (21):
  • [22] Rank-based Lasso - efficient methods for high-dimensional robust model selection
    Rejchel, Wojciech
    Bogdan, Malgorzata
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [23] Monitoring high-dimensional heteroscedastic processes using rank-based EWMA methods
    Wang, Zezhong
    Goedhart, Rob
    Zwetsloot, Inez Maria
    COMPUTERS & INDUSTRIAL ENGINEERING, 2023, 184
  • [24] Rfit: Rank-based Estimation for Linear Models
    Kloke, John D.
    McKean, Joseph W.
    R JOURNAL, 2012, 4 (02): : 57 - 64
  • [25] High-Dimensional Mixed Graphical Models
    Cheng, Jie
    Li, Tianxi
    Levina, Elizaveta
    Zhu, Ji
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2017, 26 (02) : 367 - 378
  • [26] Rank-based correlation matrix estimation for high dimensional microbiome data
    Wang, Jiyang
    Liang, Wanfeng
    Li, Lijie
    Zou, Feng
    STATISTICS, 2024, 58 (05) : 1169 - 1196
  • [27] Joint estimation of multiple high-dimensional Gaussian copula graphical models
    He, Yong
    Zhang, Xinsheng
    Ji, Jiadong
    Liu, Bin
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2017, 59 (03) : 289 - 310
  • [28] High-dimensional joint estimation of multiple directed Gaussian graphical models
    Wang, Yuhao
    Segarra, Santiago
    Uhler, Caroline
    ELECTRONIC JOURNAL OF STATISTICS, 2020, 14 (01): : 2439 - 2483
  • [29] Fast and Separable Estimation in High-Dimensional Tensor Gaussian Graphical Models
    Min, Keqian
    Mai, Qing
    Zhang, Xin
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (01) : 294 - 300
  • [30] Rank-based sequential feature selection for high-dimensional accelerated failure time models with main and interaction effects
    Yu, Ke
    Luo, Shan
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2024, 197