ModelRevelator: Fast phylogenetic model estimation via deep learning

被引:9
|
作者
Burgstaller-Muehlbacher, Sebastian [1 ,2 ]
Crotty, Stephen M. [3 ,4 ]
Schmidt, Heiko A. [1 ,2 ]
Reden, Franziska [1 ,2 ]
Drucks, Tamara [1 ,2 ,6 ]
von Haeseler, Arndt [1 ,2 ,5 ]
机构
[1] Univ Vienna, Max Perutz Labs, Ctr Integrat Bioinformat Vienna, A-1030 Vienna, Austria
[2] Med Univ Vienna, Vienna Bioctr VBC 5, A-1030 Vienna, Austria
[3] Univ Adelaide, Sch Math Sci, Adelaide, SA 5005, Australia
[4] Univ Adelaide, ARC Ctr Excellence Math & Stat Frontiers, Adelaide, SA 5005, Australia
[5] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Waehringer Str 29, A-1090 Vienna, Austria
[6] TU Wien, Res Unit Machine Learning, A-1040 Vienna, Austria
关键词
Phylogenetic model estimation; Deep learning; Artificial intelligence; Phylogenetics; Phylogenomics; DNA-SEQUENCES; SELECTION; SUBSTITUTIONS; SIMULATION; JMODELTEST; EVOLUTION; PROTEIN; SITES; RATES; TREE;
D O I
10.1016/j.ympev.2023.107905
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Selecting the best model of sequence evolution for a multiple-sequence-alignment (MSA) constitutes the first step of phylogenetic tree reconstruction. Common approaches for inferring nucleotide models typically apply maximum likelihood (ML) methods, with discrimination between models determined by one of several information criteria. This requires tree reconstruction and optimisation which can be computationally expensive. We demonstrate that neural networks can be used to perform model selection, without the need to reconstruct trees, optimise parameters, or calculate likelihoods.We introduce ModelRevelator, a model selection tool underpinned by two deep neural networks. The first neural network, NNmodelfind, recommends one of six commonly used models of sequence evolution, ranging in complexity from Jukes and Cantor to General Time Reversible. The second, NNalphafind, recommends whether or not a Gamma-distributed rate heterogeneous model should be incorporated, and if so, provides an estimate of the shape parameter, alpha. Users can simply input an MSA into ModelRevelator, and swiftly receive output recommending the evolutionary model, inclusive of the presence or absence of rate heterogeneity, and an estimate of alpha.We show that ModelRevelator performs comparably with likelihood-based methods and the recently published machine learning method ModelTeller over a wide range of parameter settings, with significant potential savings in computational effort. Further, we show that this performance is not restricted to the alignments on which the networks were trained, but is maintained even on unseen empirical data. We expect that ModelRevelator will provide a valuable alternative for phylogeneticists, especially where traditional methods of model selection are computationally prohibitive.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] An ensemble deep learning model for fast classification of Twitter spam
    Dhar, Suparna
    Bose, Indranil
    INFORMATION & MANAGEMENT, 2024, 61 (08)
  • [32] Deep learning for fast channel estimation in millimeter-wave MIMO systems
    Lyu, Siting
    Li, Xiaohui
    Fan, Tao
    Liu, Jiawen
    Shi, Mingli
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2022, 33 (06) : 1088 - 1095
  • [33] Deep learning for fast channel estimation in millimeter-wave MIMO systems
    LYU Siting
    LI Xiaohui
    FAN Tao
    LIU Jiawen
    SHI Mingli
    Journal of Systems Engineering and Electronics, 2022, 33 (06) : 1088 - 1095
  • [34] Fast M2 estimation for fiber beams through deep learning
    An, Yi
    Li, Jun
    Huang, Liangjin
    Leng, Jinyong
    Yang, Lijia
    Zhou, Pu
    2019 CONFERENCE ON LASERS AND ELECTRO-OPTICS (CLEO), 2019,
  • [35] Deep learning reconstruction with uncertainty estimation for γ photon interaction in fast scintillator detectors
    Daniel, G.
    Yahiaoui, M. -B.
    Comtat, C.
    Jan, S.
    Kochebina, O.
    Martinez, J. -M.
    Sergeyeva, V.
    Sharyy, V.
    Sung, C. -H.
    Yvon, D.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [36] Deep Learning for Channel Coding via Neural Mutual Information Estimation
    Fritschek, Rick
    Schaefer, Rafael F.
    Wunder, Gerhard
    2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,
  • [37] Design of sparse arrays via deep learning for enhanced DOA estimation
    Steven Wandale
    Koichi Ichige
    EURASIP Journal on Advances in Signal Processing, 2021
  • [38] DEEP EXPOSURE FUSION WITH DEGHOSTING VIA HOMOGRAPHY ESTIMATION AND ATTENTION LEARNING
    Chen, Sheng-Yeh
    Chuang, Yung-Yu
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1464 - 1468
  • [39] A deep learning approach for sepsis monitoring via severity score estimation
    Asuroglu, Tunc
    Ogul, Hasan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 198
  • [40] Bayesian State Estimation for Unobservable Distribution Systems via Deep Learning
    Mestav, Kursat Rasim
    Luengo-Rozas, Jaime
    Tong, Lang
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2019, 34 (06) : 4910 - 4920