Skeleton estimation of directed acyclic graphs using partial least squares from correlated data

被引:5
|
作者
Wang, Xiaokang [1 ]
Lu, Shan [2 ]
Zhou, Rui [3 ]
Wang, Huiwen [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Econ & Management, Beijing, Peoples R China
[2] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
[3] Huaxia Bank, Risk Management Dept, Beijing, Peoples R China
[4] Beihang Univ, Sch Econ & Management, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Directed acyclic graph; partial least squares; hierarchical clustering; sparse learning; REGRESSION; NETWORKS; LASSO; PLS;
D O I
10.1016/j.patcog.2023.109460
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Directed acyclic graphs (DAGs) are directed graphical models that are well known for discovering causal relationships between variables in a high-dimensional setting. When the DAG is not identifiable due to the lack of interventional data, the skeleton can be estimated using observational data, which is formed by removing the direction of the edges in a DAG. In real data analyses, variables are often highly corre-lated due to some form of clustered sampling, and ignoring this correlation will inflate the standard er-rors of the parameter estimates in the regression-based DAG structure learning framework. In this work, we propose a two-stage DAG skeleton estimation approach for highly correlated data. First, we propose a novel neighborhood selection method based on sparse partial least squares (PLS) regression, and a cluster -weighted adaptive penalty is imposed on the PLS weight vectors to exploit the local information. In the second stage, the DAG skeleton is estimated by evaluating a set of conditional independence hypotheses. Simulation studies are presented to demonstrate the effectiveness of the proposed method. The algorithm is also tested on publicly available datasets, and we show that our algorithm obtains higher sensitivity with comparable false discovery rates for high-dimensional data under different network structures.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Penalized estimation of directed acyclic graphs from discrete data
    Jiaying Gu
    Fei Fu
    Qing Zhou
    Statistics and Computing, 2019, 29 : 161 - 176
  • [2] Penalized estimation of directed acyclic graphs from discrete data
    Gu, Jiaying
    Fu, Fei
    Zhou, Qing
    STATISTICS AND COMPUTING, 2019, 29 (01) : 161 - 176
  • [3] Joint skeleton estimation of multiple directed acyclic graphs for heterogeneous population
    Liu, Jianyu
    Sun, Wei
    Liu, Yufeng
    BIOMETRICS, 2019, 75 (01) : 36 - 47
  • [4] Estimation of Sparse Directed Acyclic Graphs for Multivariate Counts Data
    Han, Sung Won
    Zhong, Hua
    BIOMETRICS, 2016, 72 (03) : 791 - 803
  • [5] Finding least common ancestors in directed acyclic graphs
    Bender, MA
    Pemmasani, G
    Skiena, S
    Sumazin, P
    PROCEEDINGS OF THE TWELFTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2001, : 845 - 854
  • [6] Exact estimation of multiple directed acyclic graphs
    Oates, Chris J.
    Smith, Jim Q.
    Mukherjee, Sach
    Cussens, James
    STATISTICS AND COMPUTING, 2016, 26 (04) : 797 - 811
  • [7] Exact estimation of multiple directed acyclic graphs
    Chris J. Oates
    Jim Q. Smith
    Sach Mukherjee
    James Cussens
    Statistics and Computing, 2016, 26 : 797 - 811
  • [8] ESTIMATION OF GAUSSIAN DIRECTED ACYCLIC GRAPHS USING PARTIAL ORDERING INFORMATION WITH APPLICATIONS TO DREAM3 NETWORKS AND DAIRY CATTLE DATA
    Rahman, Syed
    Khare, Kshitij
    Michailidis, George
    Martinez, Carlos
    Carulla, Juan
    ANNALS OF APPLIED STATISTICS, 2023, 17 (02): : 929 - 960
  • [9] Unique Least Common Ancestors and Clusters in Directed Acyclic Graphs
    Shanavas, Ameera Vaheeda
    Changat, Manoj
    Hellmuth, Marc
    Stadler, Peter F.
    ALGORITHMS AND DISCRETE APPLIED MATHEMATICS, CALDAM 2024, 2024, 14508 : 148 - 161
  • [10] Missing values estimation in microarray data with partial least squares regression
    Yang, Kun
    Li, Jianzhong
    Wang, Chaokun
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 662 - 669