Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data

被引:46
|
作者
Wang, Chan [1 ]
Hu, Jiyuan [1 ]
Blaser, Martin J. [2 ]
Li, Huilin [1 ]
机构
[1] NYU, Dept Populat Hlth, Div Biostat, Sch Med, New York, NY 10016 USA
[2] Rutgers State Univ, Ctr Adv Biotechnol & Med, Dept Med & Microbiol, Piscataway, NJ 08854 USA
基金
美国国家卫生研究院;
关键词
VARIABLE SELECTION; REGRESSION; MODEL; DIET;
D O I
10.1093/bioinformatics/btz565
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent microbiome association studies have revealed important associations between microbiome and disease/health status. Such findings encourage scientists to dive deeper to uncover the causal role of microbiome in the underlying biological mechanism, and have led to applying statistical models to quantify causal microbiome effects and to identify the specific microbial agents. However, there are no existing causal mediation methods specifically designed to handle high dimensional and compositional microbiome data. Results: We propose a rigorous Sparse Microbial Causal Mediation Model (SparseMCMM) specifically designed for the high dimensional and compositional microbiome data in a typical three-factor (treatment, microbiome and outcome) causal study design. In particular, linear log-contrast regression model and Dirichlet regression model are proposed to estimate the causal direct effect of treatment and the causal mediation effects of microbiome at both the community and individual taxon levels. Regularization techniques are used to perform the variable selection in the proposed model framework to identify signature causal microbes. Two hypothesis tests on the overall mediation effect are proposed and their statistical significance is estimated by permutation procedures. Extensive simulated scenarios show that SparseMCMM has excellent performance in estimation and hypothesis testing. Finally, we showcase the utility of the proposed SparseMCMM method in a study which the murine microbiome has been manipulated by providing a clear and sensible causal path among antibiotic treatment, microbiome composition and mouse weight.
引用
收藏
页码:347 / 355
页数:9
相关论文
共 50 条
  • [1] Mediation effect selection in high-dimensional and compositional microbiome data
    Zhang, Haixiang
    Chen, Jun
    Feng, Yang
    Wang, Chan
    Li, Huilin
    Liu, Lei
    STATISTICS IN MEDICINE, 2021, 40 (04) : 885 - 896
  • [2] Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis
    Li, Hongzhe
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 2, 2015, 2 : 73 - 94
  • [3] Estimating and testing high-dimensional mediation effects in epigenetic studies
    Zhang, Haixiang
    Zheng, Yinan
    Zhang, Zhou
    Gao, Tao
    Joyce, Brian
    Yoon, Grace
    Zhang, Wei
    Schwartz, Joel
    Just, Allan
    Colicino, Elena
    Vokonas, Pantel
    Zhao, Lihui
    Lv, Jinchi
    Baccarelli, Andrea
    Hou, Lifang
    Liu, Lei
    BIOINFORMATICS, 2016, 32 (20) : 3150 - 3154
  • [4] Compositional knockoff filter for high-dimensional regression analysis of microbiome data
    Srinivasan, Arun
    Xue, Lingzhou
    Zhan, Xiang
    BIOMETRICS, 2021, 77 (03) : 984 - 995
  • [5] Hypothesis test of mediation effect in causal mediation model with high-dimensional continuous mediators
    Huang, Yen-Tsung
    Pan, Wen-Chi
    BIOMETRICS, 2016, 72 (02) : 402 - 413
  • [6] A distance based multisample test for high-dimensional compositional data with applications to the human microbiome
    Zhang, Qingyang
    Dao, Thy
    BMC BIOINFORMATICS, 2020, 21 (Suppl 9)
  • [7] A distance based multisample test for high-dimensional compositional data with applications to the human microbiome
    Qingyang Zhang
    Thy Dao
    BMC Bioinformatics, 21
  • [8] CRAmed: a conditional randomization test for high-dimensional mediation analysis in sparse microbiome data
    Liu, Tiantian
    Xu, Xiangnan
    Wang, Tao
    Xu, Peirong
    BIOINFORMATICS, 2025, 41 (02)
  • [9] High-Dimensional Mediation Analysis with Applications to Causal Gene Identification
    Zhang, Qi
    STATISTICS IN BIOSCIENCES, 2022, 14 (03) : 432 - 451
  • [10] High-Dimensional Mediation Analysis with Applications to Causal Gene Identification
    Qi Zhang
    Statistics in Biosciences, 2022, 14 : 432 - 451