ModelRevelator: Fast phylogenetic model estimation via deep learning

被引:9
|
作者
Burgstaller-Muehlbacher, Sebastian [1 ,2 ]
Crotty, Stephen M. [3 ,4 ]
Schmidt, Heiko A. [1 ,2 ]
Reden, Franziska [1 ,2 ]
Drucks, Tamara [1 ,2 ,6 ]
von Haeseler, Arndt [1 ,2 ,5 ]
机构
[1] Univ Vienna, Max Perutz Labs, Ctr Integrat Bioinformat Vienna, A-1030 Vienna, Austria
[2] Med Univ Vienna, Vienna Bioctr VBC 5, A-1030 Vienna, Austria
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA 5005, Australia
[5] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Waehringer Str 29, A-1090 Vienna, Austria
[6] TU Wien, Res Unit Machine Learning, A-1040 Vienna, Austria
关键词
Phylogenetic model estimation; Deep learning; Artificial intelligence; Phylogenetics; Phylogenomics; DNA-SEQUENCES; SELECTION; SUBSTITUTIONS; SIMULATION; JMODELTEST; EVOLUTION; PROTEIN; SITES; RATES; TREE;
D O I
10.1016/j.ympev.2023.107905
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Selecting the best model of sequence evolution for a multiple-sequence-alignment (MSA) constitutes the first step of phylogenetic tree reconstruction. Common approaches for inferring nucleotide models typically apply maximum likelihood (ML) methods, with discrimination between models determined by one of several information criteria. This requires tree reconstruction and optimisation which can be computationally expensive. We demonstrate that neural networks can be used to perform model selection, without the need to reconstruct trees, optimise parameters, or calculate likelihoods.We introduce ModelRevelator, a model selection tool underpinned by two deep neural networks. The first neural network, NNmodelfind, recommends one of six commonly used models of sequence evolution, ranging in complexity from Jukes and Cantor to General Time Reversible. The second, NNalphafind, recommends whether or not a Gamma-distributed rate heterogeneous model should be incorporated, and if so, provides an estimate of the shape parameter, alpha. Users can simply input an MSA into ModelRevelator, and swiftly receive output recommending the evolutionary model, inclusive of the presence or absence of rate heterogeneity, and an estimate of alpha.We show that ModelRevelator performs comparably with likelihood-based methods and the recently published machine learning method ModelTeller over a wide range of parameter settings, with significant potential savings in computational effort. Further, we show that this performance is not restricted to the alignments on which the networks were trained, but is maintained even on unseen empirical data. We expect that ModelRevelator will provide a valuable alternative for phylogeneticists, especially where traditional methods of model selection are computationally prohibitive.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Age and Gender Estimation via Deep Dictionary Learning Regression
    Singhal, Vanika
    Majumdar, Angshul
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [22] Learning the Unobservable: High-Resolution State Estimation via Deep Learning
    Mestav, Kursat Rasim
    Tong, Lang
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 171 - 176
  • [23] A Robust Deep Learning Model for Terrain Slope Estimation
    Alorf, Abdulaziz
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 1231 - 1245
  • [24] A Deep Learning Model for Estimation of Patients with Undiagnosed Diabetes
    Ryu, Kwang Sun
    Lee, Sang Won
    Batbaatar, Erdenebileg
    Lee, Jae Wook
    Choi, Kui Son
    Cha, Hyo Soung
    APPLIED SCIENCES-BASEL, 2020, 10 (01):
  • [25] FAST TRACKING VIA CONTEXT DEPTH MODEL LEARNING
    Chen, Zhaoyun
    Luo, Lei
    Wen, Mei
    Zhang, Chunyuan
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 4215 - 4218
  • [26] Learning and Fast Adaptation for Grid Emergency Control via Deep Meta Reinforcement Learning
    Huang, Renke
    Chen, Yujiao
    Yin, Tianzhixi
    Huang, Qiuhua
    Tan, Jie
    Yu, Wenhao
    Li, Xinya
    Li, Ang
    Du, Yan
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2022, 37 (06) : 4168 - 4178
  • [27] Fast Learning of Deep Neural Networks via Singular Value Decomposition
    Cai, Chenghao
    Ke, Dengfeng
    Xu, Yanyan
    Su, Kaile
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 820 - 826
  • [28] Fast and Efficient DNN Deployment via Deep Gaussian Transfer Learning
    Sun, Qi
    Bai, Chen
    Chen, Tinghuan
    Geng, Hao
    Zhang, Xinyun
    Bai, Yang
    Yu, Bei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 5360 - 5370
  • [29] MULTI-PITCH ESTIMATION VIA FAST GROUP SPARSE LEARNING
    Kronvall, Ted
    Elvander, Filip
    Adalbjornsson, Stefan Ingi
    Jakobsson, Andreas
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1093 - 1097
  • [30] Deep Learning Based Phylogenetic Analysis
    Das, Bihter
    Toroman, Suat
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2020, : 323 - 326