Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning

被引:0
|
作者
Tran, Linh N. [1 ,2 ]
Sun, Connie K. [2 ]
Struck, Travis J. [2 ]
Sajan, Mathews [2 ]
Gutenkunst, Ryan N. [2 ]
机构
[1] Univ Arizona, Genet Grad Interdisciplinary Program, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Mol & Cellular Biol, Tucson, AZ 85721 USA
基金
美国国家卫生研究院;
关键词
population genomics; demographic history inference; machine learning; POPULATION-GENETICS; SPECTRUM;
D O I
10.1093/molbev/msae077
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we present donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future genomic data summarized by an AFS. We demonstrate that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] An Efficient and Zero-Knowledge Classical Machine Learning Inference Pipeline
    Wang, Haodi
    Bie, Rongfang
    Hoang, Thang
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2025, 22 (02) : 1347 - 1364
  • [32] DynaFuse: Dynamic Fusion for Resource Efficient Multimodal Machine Learning Inference
    Alikhani, Hamidreza
    Kanduri, Anil
    Liljeberg, Pasi
    Rahmani, Amir M.
    Dutt, Nikil
    IEEE EMBEDDED SYSTEMS LETTERS, 2023, 15 (04) : 222 - 225
  • [33] Predicting nutritional status for women of childbearing age from their economic, health, and demographic features: A supervised machine learning approach
    Khudri, Md. Mohsan
    Rhee, Kang Keun
    Hasan, Mohammad Shabbir
    Ahsan, Karar Zunaid
    PLOS ONE, 2023, 18 (05):
  • [34] Efficient Supervised Machine Learning Network for Non-Intrusive Load Monitoring
    Hadi, Muhammad Usman
    Suhaimi, Nik Hazmi Nik
    Basit, Abdul
    TECHNOLOGIES, 2022, 10 (04)
  • [35] An efficient flow-based botnet detection using supervised machine learning
    Stevanovic, Matija
    Pedersen, Jens Myrup
    2014 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2014, : 797 - 801
  • [36] Learning From Demonstrations: A Computationally Efficient Inverse Reinforcement Learning Approach With Simplified Implementation
    Lin, Yanbin
    Ni, Zhen
    Zhong, Xiangnan
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2025,
  • [37] Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations
    Vikas Bansal
    Ondrej Libiger
    BMC Bioinformatics, 16
  • [38] Fast individual ancestry inference from DNA sequence data leveraging allele frequencies for multiple populations
    Bansal, Vikas
    Libiger, Ondrej
    BMC BIOINFORMATICS, 2015, 16
  • [39] Inference of Locus-Specific Population Mixtures from Linked Genome-Wide Allele Frequencies
    Reyna-Blanco, Carlos S.
    Caduff, Madleina
    Galimberti, Marco
    Leuenberger, Christoph
    Wegmann, Daniel
    MOLECULAR BIOLOGY AND EVOLUTION, 2024, 41 (07)
  • [40] Joint inference of adaptive and demographic history from temporal population genomic data
    Pavinato, Vitor A. C.
    De Mita, Stephane
    Marin, Jean-Michel
    de Navascues, Miguel
    PEER COMMUNITY JOURNAL, 2022, 2