Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning

被引:0
|
作者
Tran, Linh N. [1 ,2 ]
Sun, Connie K. [2 ]
Struck, Travis J. [2 ]
Sajan, Mathews [2 ]
Gutenkunst, Ryan N. [2 ]
机构
[1] Univ Arizona, Genet Grad Interdisciplinary Program, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Mol & Cellular Biol, Tucson, AZ 85721 USA
基金
美国国家卫生研究院;
关键词
population genomics; demographic history inference; machine learning; POPULATION-GENETICS; SPECTRUM;
D O I
10.1093/molbev/msae077
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we present donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future genomic data summarized by an AFS. We demonstrate that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Supervised Learning in Physical Networks: From Machine Learning to Learning Machines
    Stern, Menachem
    Hexner, Daniel
    Rocks, Jason W.
    Liu, Andrea J.
    PHYSICAL REVIEW X, 2021, 11 (02)
  • [22] Performance evaluation of supervised machine learning techniques for efficient detection of emotions from online content
    Asghar M.Z.
    Subhan F.
    Imran M.
    Kundi F.M.
    Khan A.
    Shamshirband S.
    Mosavi A.
    Csiba P.
    Varkonyi Koczy A.R.
    Computers, Materials and Continua, 2020, 63 (03): : 1093 - 1118
  • [23] Keep Learning: Self-supervised Meta-learning for Learning from Inference
    Kedia, Akhil
    Chinthakindi, Sai Chetan
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 63 - 77
  • [24] Machine-learning-based computationally efficient particle size distribution retrieval from bulk optical properties
    Jia, Ruhui
    Zhang, Xiaohao
    Cui, Fenping
    Chen, Gongye
    Li, Haomiao
    Peng, Haochen
    Cao, Zhaolou
    Pei, Shixin
    APPLIED OPTICS, 2020, 59 (24) : 7284 - 7291
  • [25] Objective Supervised Machine Learning-Based Classification and Inference of Biological Neuronal Networks
    Barros, Michael Taynnan
    Siljak, Harun
    Mullen, Peter
    Papadias, Constantinos
    Hyttinen, Jari
    Marchetti, Nicola
    MOLECULES, 2022, 27 (19):
  • [26] Machine learning for helicopter accident analysis using supervised classification: Inference, prediction, and implications
    Xu, Zhaoyi
    Saleh, Joseph Homer
    Subagia, Rachmat
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2020, 204
  • [27] Using Deep Learning to Estimate Clonality from Variant Allele Frequencies
    Anaya, Jordan
    Baras, Alexander
    Sohn, Andrew
    LABORATORY INVESTIGATION, 2023, 103 (03) : S1283 - S1284
  • [28] A MACHINE LEARNING APPROACH FOR COMPUTATIONALLY AND ENERGY EFFICIENT SPEECH ENHANCEMENT IN BINAURAL HEARING AIDS
    Ayllon, David
    Gil-Pita, Roberto
    Rosa-Zurera, Manuel
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6515 - 6519
  • [29] Computationally Efficient Wideband Spectrum Sensing through Cumulative Distribution Function and Machine Learning
    Nikonowicz, Jakub
    Jessa, Mieczyslaw
    Matuszewski, Lukasz
    JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2024, 20 (01) : 38 - 46
  • [30] An efficient catalyst screening strategy combining machine learning and causal inference
    Song, Chenyu
    Shi, Yintao
    Li, Meng
    Wu, Lin
    Xiong, Xiaorong
    Liu, Jianyun
    Xia, Dongsheng
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 377