ModelRevelator: Fast phylogenetic model estimation via deep learning

被引:9
|
作者
Burgstaller-Muehlbacher, Sebastian [1 ,2 ]
Crotty, Stephen M. [3 ,4 ]
Schmidt, Heiko A. [1 ,2 ]
Reden, Franziska [1 ,2 ]
Drucks, Tamara [1 ,2 ,6 ]
von Haeseler, Arndt [1 ,2 ,5 ]
机构
[1] Univ Vienna, Max Perutz Labs, Ctr Integrat Bioinformat Vienna, A-1030 Vienna, Austria
[2] Med Univ Vienna, Vienna Bioctr VBC 5, A-1030 Vienna, Austria
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA 5005, Australia
[5] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Waehringer Str 29, A-1090 Vienna, Austria
[6] TU Wien, Res Unit Machine Learning, A-1040 Vienna, Austria
关键词
Phylogenetic model estimation; Deep learning; Artificial intelligence; Phylogenetics; Phylogenomics; DNA-SEQUENCES; SELECTION; SUBSTITUTIONS; SIMULATION; JMODELTEST; EVOLUTION; PROTEIN; SITES; RATES; TREE;
D O I
10.1016/j.ympev.2023.107905
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Selecting the best model of sequence evolution for a multiple-sequence-alignment (MSA) constitutes the first step of phylogenetic tree reconstruction. Common approaches for inferring nucleotide models typically apply maximum likelihood (ML) methods, with discrimination between models determined by one of several information criteria. This requires tree reconstruction and optimisation which can be computationally expensive. We demonstrate that neural networks can be used to perform model selection, without the need to reconstruct trees, optimise parameters, or calculate likelihoods.We introduce ModelRevelator, a model selection tool underpinned by two deep neural networks. The first neural network, NNmodelfind, recommends one of six commonly used models of sequence evolution, ranging in complexity from Jukes and Cantor to General Time Reversible. The second, NNalphafind, recommends whether or not a Gamma-distributed rate heterogeneous model should be incorporated, and if so, provides an estimate of the shape parameter, alpha. Users can simply input an MSA into ModelRevelator, and swiftly receive output recommending the evolutionary model, inclusive of the presence or absence of rate heterogeneity, and an estimate of alpha.We show that ModelRevelator performs comparably with likelihood-based methods and the recently published machine learning method ModelTeller over a wide range of parameter settings, with significant potential savings in computational effort. Further, we show that this performance is not restricted to the alignments on which the networks were trained, but is maintained even on unseen empirical data. We expect that ModelRevelator will provide a valuable alternative for phylogeneticists, especially where traditional methods of model selection are computationally prohibitive.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Fast Horizon Estimation via Deep Hashing
    Luo, Wenbing
    Zhu, Yi
    Li, Hanxi
    Wang, Mingwen
    8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, : 84 - 87
  • [2] Seismic Volumetric Dip Estimation via Multichannel Deep Learning Model
    Lou, Yihuai
    Li, Shizhen
    Li, Shengjun
    Liu, Naihao
    Zhang, Bo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [3] Fast Beamforming Design via Deep Learning
    Huang, Hao
    Peng, Yang
    Yang, Jie
    Xia, Wenchao
    Gui, Guan
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (01) : 1065 - 1069
  • [4] Fast Scene Layout Estimation via Deep Hashing
    Zhu, Yi
    Luo, Wenbing
    Li, Hanxi
    Wang, Mingwen
    THIRD INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2018, 10828
  • [5] Fusang: a framework for phylogenetic tree inference via deep learning
    Wang, Zhicheng
    Sun, Jinnan
    Gao, Yuan
    Xue, Yongwei
    Zhang, Yubo
    Li, Kuan
    Zhang, Wei
    Zhang, Chi
    Zu, Jian
    Zhang, Li
    NUCLEIC ACIDS RESEARCH, 2023, 51 (20) : 10909 - 10923
  • [6] Fast Posterior Estimation of Cardiac Electrophysiological Model Parameters via Bayesian Active Learning
    Zaman, Md Shakil
    Dhamala, Jwala
    Bajracharya, Pradeep
    Sapp, John L.
    Horacek, B. Milan
    Wu, Katherine C.
    Trayanova, Natalia A.
    Wang, Linwei
    FRONTIERS IN PHYSIOLOGY, 2021, 12
  • [7] Fast structured illumination microscopy via deep learning
    CHANG LING
    CHONGLEI ZHANG
    MINGQUN WANG
    FANFEI MENG
    LUPING DU
    XIAOCONG YUAN
    Photonics Research, 2020, 8 (08) : 1350 - 1359
  • [8] Fast structured illumination microscopy via deep learning
    CHANG LING
    CHONGLEI ZHANG
    MINGQUN WANG
    FANFEI MENG
    LUPING DU
    XIAOCONG YUAN
    Photonics Research , 2020, (08) : 1350 - 1359
  • [9] Fast structured illumination microscopy via deep learning
    Ling, Chang
    Zhang, Chonglei
    Wang, Mingqun
    Meng, Fanfei
    Du, Luping
    Yuan, Xiaocong
    PHOTONICS RESEARCH, 2020, 8 (08) : 1350 - 1359
  • [10] Fast Uncertainty Estimation for Deep Learning Based Optical Flow
    Lee, Serin
    Capuano, Vincenzo
    Harvard, Alexei
    Chung, Soon-Jo
    2020 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2020, : 10138 - 10144