Influence of substitution model selection on protein phylogenetic tree reconstruction

被引:7
|
作者
Del Amparo, Roberto [1 ,2 ]
Arenas, Miguel [1 ,2 ,3 ]
机构
[1] Univ Vigo, CINBIO, Vigo 36310, Spain
[2] Univ Vigo, Dept Biochem Genet & Immunol, Vigo 36310, Spain
[3] Galicia Sur Hlth Res Inst IIS Galicia Sur, Vigo 36310, Spain
关键词
Substitution models of protein evolution; Substitution model selection; Molecular evolution; Phylogenetic tree reconstruction; Protein evolution; Phylogenetics; BEST-FIT MODELS; LIKELIHOOD; EVOLUTION; PERFORMANCE; SEQUENCES; DIVERGENCE; TOPOLOGY; PROTTEST; MUTATION; RATES;
D O I
10.1016/j.gene.2023.147336
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Probabilistic phylogenetic tree reconstruction is traditionally performed under a best-fitting substitution model of molecular evolution previously selected according to diverse statistical criteria. Interestingly, some recent studies proposed that this procedure is unnecessary for phylogenetic tree reconstruction leading to a debate in the field. In contrast to DNA sequences, phylogenetic tree reconstruction from protein sequences is traditionally based on empirical exchangeability matrices that can differ among taxonomic groups and protein families. Considering this aspect, here we investigated the influence of selecting a substitution model of protein evolution on phylogenetic tree reconstruction by the analyses of real and simulated data. We found that phylogenetic tree reconstructions based on a selected best-fitting substitution model of protein evolution are the most accurate, in terms of topology and branch lengths, compared with those derived from substitution models with amino acid replacement matrices far from the selected best-fitting model, especially when the data has large genetic di-versity. Indeed, we found that substitution models with similar amino acid replacement matrices produce similar reconstructed phylogenetic trees, suggesting the use of substitution models as similar as possible to a selected best-fitting model when the latter cannot be used. Therefore, we recommend the use of the traditional protocol of selection among substitution models of evolution for protein phylogenetic tree reconstruction.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Pattern-Based Phylogenetic Distance Estimation and Tree Reconstruction
    Hoehl, Michael
    Rigoutsos, Isidore
    Ragan, Mark A.
    EVOLUTIONARY BIOINFORMATICS, 2006, 2 : 359 - 375
  • [42] PTreeRec: Phylogenetic Tree Reconstruction based on genome BLAST distance
    Deng, Riqiang
    Huang, Mingsong
    Wang, Jinwen
    Huang, Yuansen
    Yang, Jie
    Feng, Jinghua
    Wang, Xunzhang
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2006, 30 (04) : 300 - 302
  • [43] A short proof that phylogenetic tree reconstruction by maximum likelihood is hard
    Roch, S
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2006, 3 (01) : 92 - 94
  • [44] An FPGA Hardware Implementation Approach for a Phylogenetic Tree Reconstruction Algorithm with Incremental Tree Optimization
    Block, Henry
    Maruyama, Tsutomu
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [45] Phylogenetic tree selection by the adjusted k-means approach
    Wang, Hsiuying
    Hung, Shan-Lin
    JOURNAL OF APPLIED STATISTICS, 2012, 39 (03) : 643 - 655
  • [46] A new phylogenetic tree model for fuzzy characters
    Auyeung, A
    ITCC 2005: International Conference on Information Technology: Coding and Computing, Vol 1, 2005, : 2 - 7
  • [47] A Phyletic Model for Evaluating Phylogenetic Tree Estimation
    Enosawa, Ryosuke
    Mutoh, Atsuko
    Inuzuka, Nobuhiro
    2015 IEEE 4TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE), 2015, : 404 - 407
  • [48] PartitionFinder: Combined Selection of Partitioning Schemes and Substitution Models for Phylogenetic Analyses
    Lanfear, Robert
    Calcott, Brett
    Ho, Simon Y. W.
    Guindon, Stephane
    MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (06) : 1695 - 1701
  • [49] PROGRESSIVE ALIGNMENT AND PHYLOGENETIC TREE CONSTRUCTION OF PROTEIN SEQUENCES
    FENG, DF
    DOOLITTLE, RF
    METHODS IN ENZYMOLOGY, 1990, 183 : 375 - 387
  • [50] Protein Molecular Function Prediction Based on the Phylogenetic Tree
    Jian, Lu
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, 2012, 304 : 185 - 190