Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data

被引:46
|
作者
Wang, Chan [1 ]
Hu, Jiyuan [1 ]
Blaser, Martin J. [2 ]
Li, Huilin [1 ]
机构
[1] NYU, Dept Populat Hlth, Div Biostat, Sch Med, New York, NY 10016 USA
[2] Rutgers State Univ, Ctr Adv Biotechnol & Med, Dept Med & Microbiol, Piscataway, NJ 08854 USA
基金
美国国家卫生研究院;
关键词
VARIABLE SELECTION; REGRESSION; MODEL; DIET;
D O I
10.1093/bioinformatics/btz565
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Recent microbiome association studies have revealed important associations between microbiome and disease/health status. Such findings encourage scientists to dive deeper to uncover the causal role of microbiome in the underlying biological mechanism, and have led to applying statistical models to quantify causal microbiome effects and to identify the specific microbial agents. However, there are no existing causal mediation methods specifically designed to handle high dimensional and compositional microbiome data. Results: We propose a rigorous Sparse Microbial Causal Mediation Model (SparseMCMM) specifically designed for the high dimensional and compositional microbiome data in a typical three-factor (treatment, microbiome and outcome) causal study design. In particular, linear log-contrast regression model and Dirichlet regression model are proposed to estimate the causal direct effect of treatment and the causal mediation effects of microbiome at both the community and individual taxon levels. Regularization techniques are used to perform the variable selection in the proposed model framework to identify signature causal microbes. Two hypothesis tests on the overall mediation effect are proposed and their statistical significance is estimated by permutation procedures. Extensive simulated scenarios show that SparseMCMM has excellent performance in estimation and hypothesis testing. Finally, we showcase the utility of the proposed SparseMCMM method in a study which the murine microbiome has been manipulated by providing a clear and sensible causal path among antibiotic treatment, microbiome composition and mouse weight.
引用
收藏
页码:347 / 355
页数:9
相关论文
共 50 条
  • [21] Testing ARCH effect of high-dimensional time series data
    Li, Xuejiao
    Wei, Shufang
    Yang, Yaxing
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [22] High-dimensional log-error-in-variable regression with applications to microbial compositional data analysis
    Shi, Pixu
    Zhou, Yuchen
    Zhang, Anru R.
    BIOMETRIKA, 2022, 109 (02) : 405 - 420
  • [23] Power-enhanced two-sample mean tests for high-dimensional microbiome compositional data
    Li, Danning
    Xue, Lingzhou
    Yang, Haoyi
    Yu, Xiufan
    BIOMETRICS, 2025, 81 (02)
  • [24] Estimating dependency and significance for high-dimensional data
    Siracusa, MR
    Tieu, K
    Ihler, AT
    Fisher, JW
    Willsky, AS
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 1085 - 1088
  • [25] High-dimensional multivariate mediation with application to neuroimaging data
    Chen, Oliver Y.
    Crainiceanu, Ciprian
    Ogburn, Elizabeth L.
    Caffo, Brian S.
    Wager, Tor D.
    Lindquist, Martin A.
    BIOSTATISTICS, 2018, 19 (02) : 121 - 136
  • [26] Mediation analysis for survival data with high-dimensional mediators
    Zhang, Haixiang
    Zheng, Yinan
    Hou, Lifang
    Zheng, Cheng
    Liu, Lei
    BIOINFORMATICS, 2021, 37 (21) : 3815 - 3821
  • [27] On testing mean of high dimensional compositional data
    Jiang, Qianqian
    Li, Wenbo
    Li, Zeng
    STATISTICS & PROBABILITY LETTERS, 2025, 222
  • [28] A Multiple-Testing Procedure for High-Dimensional Mediation Hypotheses
    Dai, James Y.
    Stanford, Janet L.
    LeBlanc, Michael
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (537) : 198 - 213
  • [29] PROGRAM EVALUATION AND CAUSAL INFERENCE WITH HIGH-DIMENSIONAL DATA
    Belloni, A.
    Chernozhukov, V.
    Fernandez-Val, I.
    Hansen, C.
    ECONOMETRICA, 2017, 85 (01) : 233 - 298
  • [30] MENDELIAN RANDOMIZATION TEST OF CAUSAL EFFECT USING HIGH-DIMENSIONAL SUMMARY DATA
    Deng, Lu
    Wheeler, William
    Yu, Kai
    STATISTICA SINICA, 2023, 33 : 1365 - 1387