Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning

被引:0
|
作者
Tran, Linh N. [1 ,2 ]
Sun, Connie K. [2 ]
Struck, Travis J. [2 ]
Sajan, Mathews [2 ]
Gutenkunst, Ryan N. [2 ]
机构
[1] Univ Arizona, Genet Grad Interdisciplinary Program, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Mol & Cellular Biol, Tucson, AZ 85721 USA
基金
美国国家卫生研究院;
关键词
population genomics; demographic history inference; machine learning; POPULATION-GENETICS; SPECTRUM;
D O I
10.1093/molbev/msae077
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we present donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future genomic data summarized by an AFS. We demonstrate that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Localized Debiased Machine Learning: Efficient Inference on Quantile Treatment Effects and Beyond
    Kallus, Nathan
    Mao, Xiaojie
    Uehara, Masatoshi
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [42] INTENTION MINING FROM KNOWLEDGE BASE AND SUPERVISED MACHINE LEARNING
    Diaz-Rodriguez, Oswaldo E.
    Perez Hernandez, Maria Gabriela
    3C TIC, 2021, 10 (03): : 65 - 101
  • [43] Cosmological density field emulation and gravitational wave inference based on dimensionality reduction and supervised machine learning
    Conceicao, Miguel
    da Silva, Antonio
    Krone-Martins, Alberto
    SIXTEENTH MARCEL GROSSMANN MEETING, 2023, : 391 - 408
  • [44] Inference of Onsager coefficient from microscopic simulations by machine learning
    Zhang, Kaihua
    Qi, Shuanhu
    Ren, Yongzhi
    Zhou, Jiajia
    Jiang, Ying
    JOURNAL OF CHEMICAL PHYSICS, 2025, 162 (03):
  • [45] Inference from Nonrandom Samples Using Bayesian Machine Learning
    Liu, Yutao
    Gelman, Andrew
    Chen, Qixuan
    JOURNAL OF SURVEY STATISTICS AND METHODOLOGY, 2023, 11 (02) : 433 - 455
  • [46] Machine learning for time series: from forecasting to causal inference
    Bontempi, Gianluca
    PROCEEDINGS OF THE 12TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE, SETN 2022, 2022,
  • [47] Inference of Personal Attributes from Tweets Using Machine Learning
    Yo, Take
    Sasahara, Kazutoshi
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3168 - 3174
  • [48] Analyzing Unimproved Drinking Water Sources and Their Determinants Using Supervised Machine Learning: Evidence from the Somaliland Demographic Health Survey 2020
    Ismail, Hibak M.
    Muse, Abdisalam Hassan
    Hassan, Mukhtar Abdi
    Muse, Yahye Hassan
    Nadarajah, Saralees
    WATER, 2024, 16 (20)
  • [49] An unsupervised machine learning based ground motion selection method for computationally efficient estimation of seismic fragility
    Hu, Jinjun
    Liu, Bali
    Xie, Lili
    EARTHQUAKE ENGINEERING & STRUCTURAL DYNAMICS, 2023, 52 (08): : 2360 - 2383
  • [50] A machine learning driven computationally efficient horse shoe shaped antenna design for internet of medical things
    Khan, Umhara Rasool
    Sheikh, Javaid A.
    Junaid, Aqib
    Ashraf, Shazia
    Balkhi, Altaf A.
    PLOS ONE, 2025, 20 (02):