A multi-population memetic algorithm for the 3-D protein structure prediction problem

被引:12
|
作者
Correa, Leonardo de Lima [1 ]
Dorn, Marcio [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Av Bento Goncalves 9500, BR-91501970 Porto Alegre, RS, Brazil
关键词
Optimization; Metaheuristics; Evolutionary and knowledge-based algorithms; Structural bioinformatics; BEE COLONY ALGORITHM; EFFICIENT ALGORITHM; SECONDARY STRUCTURE; OPTIMIZATION; CLASSIFICATION; RESOLUTION; DIVERSITY; SEQUENCE;
D O I
10.1016/j.swevo.2020.100677
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a knowledge-based memetic algorithm to tackle the three-dimensional protein structure prediction problem without the explicit use of experimentally determined protein structures' templates. Our algorithm proposal was divided into two main prediction steps: (i) solutions sampling and initialization; and (ii) structural models' optimization coming from the previous stage. The first step generates and classifies several structural models for a given target protein, through the Angle Probability List strategy, to identify distinct structural patterns and consider reasonable solutions in the memetic algorithm initialization. The Angle Probability List takes advantage of structural knowledge stored in the Protein Data Bank to reduce the size and, consequently, the conformational search space complexity. The second step of the method consists in the optimization of the structures generated in the first stage by the proposed memetic algorithm. It uses a tree-based population where each node can be seen as an independent subpopulation that interacts with each other over global search operations, aiming at knowledge sharing, population diversity, and better exploration of the multimodal search space. The method also encompasses ad hoc global search operators, whose objective is to increase the method exploration ability focusing on specific characteristics of the protein structure prediction problem, combined with the artificial bee colony algorithm used as an exploitation technique applied to each node of the tree. The proposed algorithm was tested on a set of 24 amino acid sequences, as well as compared to the reference method in the protein structure prediction area, the method of Rosetta. The obtained results show the ability of our method to predict three-dimensional protein structures with similar folding to the experimentally determined ones, regarding the structural metrics Root-Mean-Square Deviation and Global Distance Total Score Test. We also show that our method was able to reach comparable results to Rosetta, and in some cases, it outperformed Rosetta, corroborating the effectiveness of our proposal.
引用
收藏
页数:36
相关论文
共 50 条
  • [1] A Memetic Algorithm for 3D Protein Structure Prediction Problem
    Correa, Leonardo
    Borguesan, Bruno
    Farfan, Camilo
    Inostroza-Ponta, Mario
    Dorn, Marcio
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (03) : 690 - 704
  • [2] Multi-objective multi-population biased random-key genetic algorithm for the 3-D container loading problem
    Zheng, Jia-Nian
    Chien, Chen-Fu
    Gen, Mitsuo
    COMPUTERS & INDUSTRIAL ENGINEERING, 2015, 89 : 80 - 87
  • [3] A memetic algorithm for protein structure prediction in a 3D-Lattice HP model
    Bazzoli, A
    Tettamanzi, AGB
    APPLICATIONS OF EVOLUTIONARY COMPUTING, 2004, 3005 : 1 - 10
  • [4] Clustered Memetic Algorithm for Protein Structure Prediction
    Islam, Kamrul
    Chetty, Madhu
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [5] A multi-population algorithm for multi-objective knapsack problem
    Ben Mansour, Imen
    Basseur, Matthieu
    Saubion, Frederic
    APPLIED SOFT COMPUTING, 2018, 70 : 814 - 825
  • [6] Novel Memetic Algorithm for Protein Structure Prediction
    Islam, Md Kamrul
    Chetty, Madhu
    AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 412 - 421
  • [7] A knowledge-based artificial bee colony algorithm for the 3-D protein structure prediction problem
    Correa, Leonardo de Lima
    Dorn, Marcio
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 949 - 956
  • [8] Evaluation of a combined energy fitness function for a distributed memetic algorithm to tackle the 3D protein structure prediction problem
    Escobar, Ivan
    Hidalgo, Nicolas
    Inostroza-Ponta, Mario
    Marin, Mauricio
    Rosas, Erika
    Dorn, Marcio
    PROCEEDINGS OF THE 2016 35TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2016,
  • [9] An Improved Multi-objective Evolutionary Memetic Algorithm based on Multi-population and Its Application
    Xiao Zhongliang
    FOURTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2012), 2012, 8334
  • [10] A Hybrid Genetic Algorithm for the 3-D Protein Structure Prediction Problem using a Path-Relinking Strategy
    Dorn, Marcio
    Buriol, Luciana S.
    Lamb, Luis C.
    2011 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2011, : 2709 - 2716