A novel prediction method for protein DNA-binding residues based on neighboring residue correlations

被引:1
|
作者
Song, Jiazhi [1 ,2 ,3 ]
Liu, Guixia [1 ,3 ]
Jiang, Jingqing [2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Inner Mongolia Minzu Univ, Coll Comp Sci & Technol, Tongliao, Inner Mongolia, Peoples R China
[3] Jilin Univ, Coll Comp Sci & Technol, Dept Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Jilin, Peoples R China
关键词
Bioinformatics; protein; machine learning; binding sites; sequence information; INTEGRATING SEQUENCE; DOMAIN; SITES;
D O I
10.1080/13102818.2022.2122871
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Accurately identifying the protein DNA-binding residues is important for understanding the protein-DNA recognition mechanism and protein function annotation. Many computational methods have been proposed to predict protein-DNA binding residues using protein sequence information; however, for severe imbalanced data like the protein-DNA binding dataset, the under-sampling technique which is applied by most previous methods cannot achieve satisfactory performance. In this study, an adjustment algorithm is proposed to offset the biased prediction results from the classifier. The proposed adjustment algorithm uses the binding probability between the target residue and its neighboring residues to identify more true binding residues which are wrongly predicted as non-binding. The proposed prediction method with adjustment algorithm achieves an area under the ROC curve (AUC) of 0.926 and 0.866 on two benchmark datasets and 0.882 on the independent testing set, which demonstrates that the proposed method can efficiently predict specific residues for protein-DNA interactions.
引用
收藏
页码:865 / 877
页数:13
相关论文
共 50 条
  • [21] StackDPP: a stacking ensemble based DNA-binding protein prediction model
    Sheikh Hasib Ahmed
    Dibyendu Brinto Bose
    Rafi Khandoker
    M Saifur Rahman
    BMC Bioinformatics, 25
  • [22] Sequence-Based Prediction of DNA-Binding Residues in Proteins with Conservation and Correlation Information
    Ma, Xin
    Guo, Jing
    Liu, Hong-De
    Xie, Jian-Ming
    Sun, Xiao
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (06) : 1766 - 1775
  • [23] DNA-binding residues and binding mode prediction with binding-mechanism concerned models
    Huang Y.-F.
    Huang C.-C.
    Liu Y.-C.
    Oyang Y.-J.
    Huang C.-K.
    BMC Genomics, 10 (Suppl 3)
  • [24] STRUCTURE PREDICTION OF BINDING-SITE OF DNA-BINDING PROTEIN
    ZAKO, T
    UEDA, H
    SUZUKI, E
    NISHIMURA, H
    GO, M
    NAGAMUNE, T
    PROTEIN ENGINEERING, 1994, 7 (09): : 1163 - 1163
  • [25] An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
    Sijia Zhang
    Lihua Wang
    Le Zhao
    Menglu Li
    Mengya Liu
    Ke Li
    Yannan Bin
    Junfeng Xia
    BMC Bioinformatics, 22
  • [26] An improved DNA-binding hot spot residues prediction method by exploring interfacial neighbor properties
    Zhang, Sijia
    Wang, Lihua
    Zhao, Le
    Li, Menglu
    Liu, Mengya
    Li, Ke
    Bin, Yannan
    Xia, Junfeng
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 3)
  • [27] Localization of residues in a novel DNA-binding domain of DmSNAP43 required for DmSNAPc DNA-binding activity
    Hung, Ko-Hsuan
    Stumph, William E.
    FEBS LETTERS, 2012, 586 (06) : 841 - 846
  • [28] PREDICTION OF ZINC-FINGER DNA-BINDING PROTEIN
    NAKATA, K
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1995, 11 (02): : 125 - 131
  • [29] HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features
    Zaman, Rianon
    Chowdhury, Shahana Yasmin
    Rashid, Mahmood A.
    Sharma, Alok
    Dehzangi, Abdollah
    Shatabda, Swakkhar
    BIOMED RESEARCH INTERNATIONAL, 2017, 2017
  • [30] A graph kernel method for DNA-binding site prediction
    Yan, Changhui
    Wang, Yingfeng
    BMC SYSTEMS BIOLOGY, 2014, 8