Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks

被引:45
|
作者
Guo, Yanbu [1 ]
Wang, Bingyi [2 ]
Li, Weihua [1 ]
Yang, Bei [3 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, 2 North Cuihu Rd, Kunming 650091, Yunnan, Peoples R China
[2] Chinese Acad Forestry, Res Inst Resource Insects, Kunming 650224, Yunnan, Peoples R China
[3] Second Peoples Hosp Yunnan Prov, Cardiol Dept, 176 Qingnian Rd, Kunming 650021, Yunnan, Peoples R China
基金
美国国家科学基金会;
关键词
Bioinformatics; protein secondary structure predication (PSSP); convolutional neural networks (CNNs); recurrent neural networks (RNNs); long short-term memory (LSTM); gated recurrent units (GRUs);
D O I
10.1142/S021972001850021X
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein secondary structure prediction (PSSP) is an important research field in bioinformatics. The representation of protein sequence features could be treated as a matrix, which includes the amino-acid residue (time-step) dimension and the feature vector dimension. Common approaches to predict secondary structures only focus on the amino-acid residue dimension. However, the feature vector dimension may also contain useful information for PSSP. To integrate the information on both dimensions of the matrix, we propose a hybrid deep learning framework, two-dimensional convolutional bidirectional recurrent neural network (2C-BRNN), for improving the accuracy of 8-class secondary structure prediction. The proposed hybrid framework is to extract the discriminative local interactions between amino-acid residues by two-dimensional convolutional neural networks (2DCNNs), and then further capture long-range interactions between amino-acid residues by bidirectional gated recurrent units (BGRUs) or bidirectional long short-term memory (BLSTM). Specifically, our proposed 2C-BRNNs framework consists of four models: 2DConv-BGRUs, 2DCNN-BGRUs, 2DConv-BLSTM and 2DCNN-BLSTM. Among these four models, the 2DConv- models only contain two-dimensional (2D) convolution operations. Moreover, the 2DCNN- models contain 2D convolutional and pooling operations. Experiments are conducted on four public datasets. The experimental results show that our proposed 2DConv-BLSTM model performs significantly better than the benchmark models. Furthermore, the experiments also demonstrate that the proposed models can extract more meaningful features from the matrix of proteins, and the feature vector dimension is also useful for PSSP. The codes and datasets of our proposed methods are available at https://github.com/guoyanb/JBCB2018/.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Interacting Vehicle Trajectory Prediction with Convolutional Recurrent Neural Networks
    Mukherjee, Saptarshi
    Wang, Sen
    Wallace, Andrew
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4336 - 4342
  • [42] Applications of deep convolutional neural networks in prospecting prediction based on two-dimensional geological big data
    Shi Li
    Jianping Chen
    Jie Xiang
    Neural Computing and Applications, 2020, 32 : 2037 - 2053
  • [43] IGPRED: Combination of convolutional neural and graph convolutional networks for protein secondary structure prediction (vol 89, 1277, 2021)
    Gormez, Yasin
    Sabzekar, Mostafa
    Aydin, Zafer
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2022, 90 (08) : 1613 - 1613
  • [44] Applications of deep convolutional neural networks in prospecting prediction based on two-dimensional geological big data
    Li, Shi
    Chen, Jianping
    Xiang, Jie
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (07): : 2037 - 2053
  • [45] Protein secondary structure prediction using data-partitioning combined with stacked convolutional neural networks and bidirectional gated recurrent units
    Sofi M.A.
    Wani M.A.
    International Journal of Information Technology, 2022, 14 (5) : 2285 - 2295
  • [46] Protein-protein interaction prediction based on ordinal regression and recurrent convolutional neural networks
    Xu, Weixia
    Gao, Yangyun
    Wang, Yang
    Guan, Jihong
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 6)
  • [47] Prediction of protein secondary structure by multi-modal neural networks
    Zhu, HX
    Yoshihara, I
    Yamamori, K
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 280 - 285
  • [48] Protein secondary structure prediction methods based on RBF neural networks
    Jing, N.
    Xia, B.
    Zhou, C. G.
    Wang, Y.
    COMPUTATIONAL METHODS, PTS 1 AND 2, 2006, : 1037 - +
  • [49] Prediction of protein secondary structure by multi-modal neural networks
    Zhu, HX
    Yoshihara, I
    Yamamori, K
    Yasunaga, M
    RECENT ADVANCES IN SIMULATED EVOLUTION AND LEARNING, 2004, 2 : 682 - 697
  • [50] A chaotic two-dimensional image classification algorithm based on convolutional neural networks
    Zhou, Xuefang
    Wang, Hongliang
    Hu, Junchao
    Li, Haozhen
    Xu, Mengmeng
    Miao, Hu
    CHAOS SOLITONS & FRACTALS, 2025, 195