Skeleton estimation of directed acyclic graphs using partial least squares from correlated data

被引:5
|
作者
Wang, Xiaokang [1 ]
Lu, Shan [2 ]
Zhou, Rui [3 ]
Wang, Huiwen [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing, Peoples R China
[2] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
[3] Huaxia Bank, Risk Management Dept, Beijing, Peoples R China
[4] Beihang Univ, Sch Econ & Management, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Directed acyclic graph; partial least squares; hierarchical clustering; sparse learning; REGRESSION; NETWORKS; LASSO; PLS;
D O I
10.1016/j.patcog.2023.109460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Directed acyclic graphs (DAGs) are directed graphical models that are well known for discovering causal relationships between variables in a high-dimensional setting. When the DAG is not identifiable due to the lack of interventional data, the skeleton can be estimated using observational data, which is formed by removing the direction of the edges in a DAG. In real data analyses, variables are often highly corre-lated due to some form of clustered sampling, and ignoring this correlation will inflate the standard er-rors of the parameter estimates in the regression-based DAG structure learning framework. In this work, we propose a two-stage DAG skeleton estimation approach for highly correlated data. First, we propose a novel neighborhood selection method based on sparse partial least squares (PLS) regression, and a cluster -weighted adaptive penalty is imposed on the PLS weight vectors to exploit the local information. In the second stage, the DAG skeleton is estimated by evaluating a set of conditional independence hypotheses. Simulation studies are presented to demonstrate the effectiveness of the proposed method. The algorithm is also tested on publicly available datasets, and we show that our algorithm obtains higher sensitivity with comparable false discovery rates for high-dimensional data under different network structures.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] MARKOVIAN ACYCLIC DIRECTED MIXED GRAPHS FOR DISCRETE DATA
    Evans, Robin J.
    Richardson, Thomas S.
    ANNALS OF STATISTICS, 2014, 42 (04): : 1452 - 1482
  • [22] Partial least squares for discrimination in fMRI data
    Andersen, Anders H.
    Rayens, William S.
    Liu, Yushu
    Smith, Charles D.
    MAGNETIC RESONANCE IMAGING, 2012, 20 (03) : 446 - 452
  • [23] Transformed partial least squares for multivariate data
    Zhu, Li-Xing
    Zhu, Li-Ping
    Li, Xin
    STATISTICA SINICA, 2007, 17 (04) : 1657 - 1675
  • [24] Partial Least Squares Regression for Binary Data
    Vicente-Gonzalez, Laura
    Frutos-Bernal, Elisa
    Vicente-Villardon, Jose Luis
    MATHEMATICS, 2025, 13 (03)
  • [25] Using Partial Least Squares Regression to Analyze Cellular Response Data
    Kreeger, Pamela K.
    SCIENCE SIGNALING, 2013, 6 (271)
  • [26] MULTIVARIATE FUNCTIONAL PARTIAL LEAST SQUARES FOR CLASSIFICATION USING LONGITUDINAL DATA
    Dembowska, Sonia
    Frangi, Alex
    Houwing-Duistermaat, Jeanine
    Liu, Haiyan
    THEORETICAL BIOLOGY FORUM, 2021, 114 (01) : 75 - 88
  • [27] Kernel Partial Least Squares for Stationary Data
    Singer, Marco
    Krivobokova, Tatyana
    Munk, Axel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [28] On the decoding of intracranial data using sparse orthonormalized partial least squares
    van Gerven, Marcel A. J.
    Chao, Zenas C.
    Heskes, Tom
    JOURNAL OF NEURAL ENGINEERING, 2012, 9 (02)
  • [29] Multivariate data modeling using modified kernel partial least squares
    Gao Yingbin
    Kong Xiangyu
    Hu Changhua
    Zhang Zhengxin
    Li Hongzeng
    Hou Li'an
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2015, 94 : 466 - 474
  • [30] THE EXTENDED-LEAST-SQUARES TREATMENT OF CORRELATED DATA
    COHEN, ER
    TUNINSKY, VS
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 1995, 44 (02) : 475 - 478