A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming

被引:54
|
作者
Zhang, Hao [1 ,2 ]
Zhang, Chunhe [1 ,2 ]
Li, Zhi [3 ]
Li, Cong [1 ,2 ]
Wei, Xu [1 ,2 ]
Zhang, Borui [4 ]
Liu, Yuanning [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Minist Educ, Changchun, Jilin, Peoples R China
[2] Jilin Univ, Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
[3] Changchun Univ Sci & Technol, Coll Comp Sci & Technol, Changchun, Jilin, Peoples R China
[4] Columbia Independent Sch, Columbia, MO USA
来源
FRONTIERS IN GENETICS | 2019年 / 10卷
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
convolutional neural network; dynamic programming; RNA secondary structure; base pairing probability; energy balance status; REVEALS;
D O I
10.3389/fgene.2019.00467
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In recent years, obtaining RNA secondary structure information has played an important role in RNA and gene function research. Although some RNA secondary structures can be gained experimentally, in most cases, efficient, and accurate computational methods are still needed to predict RNA secondary structure. Current RNA secondary structure prediction methods are mainly based on the minimum free energy algorithm, which finds the optimal folding state of RNA in vivo using an iterative method to meet the minimum energy or other constraints. However, due to the complexity of biotic environment, a true RNA structure always keeps the balance of biological potential energy status, rather than the optimal folding status that meets the minimum energy. For short sequence RNA its equilibrium energy status for the RNA folding organism is close to the minimum free energy status; therefore, the minimum free energy algorithm for predicting RNA secondary structure has higher accuracy. Nevertheless, in a longer sequence RNA, constant folding causes its biopotential energy balance to deviate far from the minimum free energy status. This deviation is because of its complex structure and results in a serious decline in the prediction accuracy of its secondary structure. In this paper, we propose a novel RNA secondary structure prediction algorithm using a convolutional neural network model combined with a dynamic programming method to improve the accuracy with large-scale RNA sequence and structure data. We analyze current experimental RNA sequences and structure data to construct a deep convolutional network model, and then we extract implicit features of an effective classification from large-scale data to predict the pairing probability of each base in an RNA sequence. For the obtained probabilities of RNA sequence base pairing, an enhanced dynamic programming method is applied to obtain the optimal RNA secondary structure. Results indicate that our proposed method is superior to the common RNA secondary structure prediction algorithms in predicting three benchmark RNA families. Based on the characteristics of deep learning algorithm, it can be inferred that the method proposed in this paper has a 30% higher prediction success rate when compared with other algorithms, which will be needed as the amount of real RNA structure data increases in the future.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] GARCH model prediction method based on Hessian matrix dynamic programming deep neural network
    Lei, Ding
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S4361 - S4366
  • [22] An Approach for RNA Secondary Structure Prediction Based on Bayesian Network
    Wu, Tianhua
    Deng, Zhidong
    Song, Dandan
    CIBCB: 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2009, : 24 - 30
  • [23] Application analysis of heuristic algorithms integrating dynamic programming in RNA secondary structure prediction
    Yuan, Tao
    Yan, Xu
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2024, 23
  • [24] Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure
    Mathews, DH
    Disney, MD
    Childs, JL
    Schroeder, SJ
    Zuker, M
    Turner, DH
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (19) : 7287 - 7292
  • [25] Protein Secondary Structure Prediction using Multi-input Convolutional Neural Network
    Jalal, Shayan Ihsan
    Zhong, Jiling
    Kumar, Suman
    2019 IEEE SOUTHEASTCON, 2019,
  • [26] GIS Fault Rate Prediction Method Based on Convolutional Neural Network
    Cui, Guodong
    Liu, Guanghui
    Wang, Jianjun
    Zhang, Zhaoqi
    2020 5TH ASIA CONFERENCE ON POWER AND ELECTRICAL ENGINEERING (ACPEE 2020), 2020, : 21 - 25
  • [27] A new method for prediction of RNA secondary structure with pseudoknots, based on helix removal and refinement
    Amgalan, Bayarbaatar
    Lee, Julian
    OPTIMIZATION, 2009, 58 (07) : 861 - 869
  • [28] A Method of Dynamic Visual Scene Analysis Based on Convolutional Neural Network
    Borisov, Vadim V.
    Garanin, Oleg I.
    ARTIFICIAL INTELLIGENCE (RCAI 2018), 2018, 934 : 60 - 69
  • [29] Research on dynamic prediction of tool life based on deep convolutional neural network
    Guo H.
    Ren B.-C.
    Yan X.-G.
    Tian Q.
    Ren D.-Y.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (08): : 2119 - 2126
  • [30] EditPredict: Prediction of RNA editable sites with convolutional neural network
    Wang, Jiandong
    Ness, Scott
    Brown, Roger
    Yu, Hui
    Oyebamiji, Olufunmilola
    Jiang, Limin
    Sheng, Quanhu
    Samuels, David C.
    Zhao, Ying-Yong
    Tang, Jijun
    Guo, Yan
    GENOMICS, 2021, 113 (06) : 3864 - 3871