TAWFN: a deep learning framework for protein function prediction

被引:0
|
作者
Meng, Lu [1 ]
Wang, Xiaoran [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, 3-11 Wenhua Rd, Shenyang 110000, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
SEQUENCE; GENERATION;
D O I
10.1093/bioinformatics/btae571
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Proteins play pivotal roles in biological systems, and precise prediction of their functions is indispensable for practical applications. Despite the surge in protein sequence data facilitated by high-throughput techniques, unraveling the exact functionalities of proteins still demands considerable time and resources. Currently, numerous methods rely on protein sequences for prediction, while methods targeting protein structures are scarce, often employing convolutional neural networks (CNN) or graph convolutional networks (GCNs) individually.Results To address these challenges, our approach starts from protein structures and proposes a method that combines CNN and GCN into a unified framework called the two-model adaptive weight fusion network (TAWFN) for protein function prediction. First, amino acid contact maps and sequences are extracted from the protein structure. Then, the sequence is used to generate one-hot encoded features and deep semantic features. These features, along with the constructed graph, are fed into the adaptive graph convolutional networks (AGCN) module and the multi-layer convolutional neural network (MCNN) module as needed, resulting in preliminary classification outcomes. Finally, the preliminary classification results are inputted into the adaptive weight computation network, where adaptive weights are calculated to fuse the initial predictions from both networks, yielding the final prediction result. To evaluate the effectiveness of our method, experiments were conducted on the PDBset and AFset datasets. For molecular function, biological process, and cellular component tasks, TAWFN achieved area under the precision-recall curve (AUPR) values of 0.718, 0.385, and 0.488 respectively, with corresponding Fmax scores of 0.762, 0.628, and 0.693, and Smin scores of 0.326, 0.483, and 0.454. The experimental results demonstrate that TAWFN exhibits promising performance, outperforming existing methods.Availability and implementation The TAWFN source code can be found at: https://github.com/ss0830/TAWFN.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Deep learning methods for protein function prediction
    Boadu, Frimpong
    Lee, Ahhyun
    Cheng, Jianlin
    PROTEOMICS, 2025, 25 (1-2)
  • [2] Deep neural learning based protein function prediction
    Xu, Wenjun
    Zhao, Zihao
    Zhang, Hongwei
    Hu, Minglei
    Yang, Ning
    Wang, Hui
    Wang, Chao
    Jiao, Jun
    Gu, Lichuan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2022, 19 (03) : 2471 - 2488
  • [3] Protein Function Prediction: From Traditional Classifier to Deep Learning
    Lv, Zhibin
    Ao, Chunyan
    Zou, Quan
    PROTEOMICS, 2019, 19 (14)
  • [4] A Comprehensive Survey of Deep Learning Techniques in Protein Function Prediction
    Dhanuka, Richa
    Singh, Jyoti Prakash
    Tripathi, Anushree
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (03) : 2291 - 2301
  • [5] An Overview of Protein Function Prediction Methods: A Deep Learning Perspective
    Ispano, Emilio
    Bianca, Federico
    Lavezzo, Enrico
    Toppo, Stefano
    CURRENT BIOINFORMATICS, 2023, 18 (08) : 621 - 630
  • [6] RPITER: A Hierarchical Deep Learning Framework for ncRNA-Protein Interaction Prediction
    Peng, Cheng
    Han, Siyu
    Zhang, Hui
    Li, Ying
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (05)
  • [7] DeepCrystal: a deep learning framework for sequence-based protein crystallization prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Mall, Raghvendra
    Bensmail, Halima
    BIOINFORMATICS, 2019, 35 (13) : 2216 - 2225
  • [8] DeepCrystal: A Deep Learning Framework for Sequence-based Protein Crystallization Prediction
    Elbasir, Abdurrahman
    Moovarkumudalvan, Balasubramanian
    Kunji, Khalid
    Kolatkar, Prasanna R.
    Bensmail, Halima
    Mall, Raghvendra
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2747 - 2749
  • [9] A comprehensive framework for advanced protein classification and function prediction using synergistic approaches: Integrating bispectral analysis, machine learning, and deep learning
    Alquran, Hiam
    Al Fahoum, Amjed
    Zyout, Ala'a
    Abu Qasmieh, Isam
    PLOS ONE, 2023, 18 (12):
  • [10] DeepSol: a deep learning framework for sequence-based protein solubility prediction
    Khurana, Sameer
    Rawi, Reda
    Kunji, Khalid
    Chuang, Gwo-Yu
    Bensmail, Halima
    Mall, Raghvendra
    BIOINFORMATICS, 2018, 34 (15) : 2605 - 2613