MIonSite: Ligand-specific prediction of metal ion-binding sites via enhanced AdaBoost algorithm with protein sequence information

被引:18
|
作者
Qiao, Liang [1 ]
Xie, Dongqing [1 ]
机构
[1] Guangzhou Univ, Sch Math & Informat Sci, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Metal ion-binding site prediction; Ligand-specific; Imbalance leaming; Sequence-based; SECONDARY STRUCTURE PREDICTION; RESIDUES; DATABASE;
D O I
10.1016/j.ab.2018.11.009
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurately targeting metal ion-binding sites solely from protein sequences is valuable for both basic experimental biology and drug discovery studies. Although considerable progress has been made, metal ion-binding site prediction is still a challenging problem due to the small size and high versatility of the metal ions. In this paper, we develop a ligand-specific predictor called MIonSite for predicting metal ion-binding sites from protein sequences. MIonSite first employs protein evolutionary information, predicted secondary structure, predicted solvent accessibility, and conservation information calculated by Jensen-Shannon Divergence score to extract the discriminative feature of each residue. An enhanced AdaBoost algorithm is then designed to cope with the serious imbalance problem buried in the metal ion-binding site prediction, where the number of non-binding sites is far more than that of metal ion-binding sites. A new gold-standard benchmark dataset, consisting of training and independent validation subsets of Zn2+, Ca2+, Mg2+, Mn2+, Fe3+, Cu2+, Fe2+, Co2+, Na+, K+, Cd2+, and Ni2+, is constructed to evaluate the proposed MIonSite with other existing predictors. Experimental results demonstrate that the proposed MIonSite achieves high prediction performance and outperforms other state-of-the-art sequence-based predictors. The standalone program of MIonSite and corresponding datasets can be freely downloaded at https://github.com/LiangQiaoGu/MIonSite.git for academic use.
引用
收藏
页码:75 / 88
页数:14
相关论文
共 21 条
  • [11] SmoPSI: Analysis and Prediction of Small Molecule Binding Sites Based on Protein Sequence Information
    Wang, Wei
    Li, Keliang
    Lv, Hehe
    Zhang, Hongjun
    Wang, Shixun
    Huang, Junwei
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2019, 2019
  • [12] Alignment-free metal ion-binding site prediction from protein sequence through pretrained language model and multi-task learning
    Yuan, Qianmu
    Chen, Sheng
    Wang, Yu
    Zhao, Huiying
    Yang, Yuedong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
  • [13] Protein-Specific Prediction of RNA-Binding Sites Based on Information Entropy
    Ji, Yue
    Bai, Lu
    Li, Menglong
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [14] Identification of ligand-specific G protein-coupled receptor states and prediction of downstream efficacy via data-driven modeling
    Fleetwood, Oliver
    Carlsson, Jens
    Delemotte, Lucie
    ELIFE, 2021, 10 : 1 - 46
  • [15] M-Ionic: prediction of metal-ion-binding sites from sequence using residue embeddings
    Shenoy, Aditi
    Kalakoti, Yogesh
    Sundar, Durai
    Elofsson, Arne
    BIOINFORMATICS, 2024, 40 (01)
  • [16] Enhanced compound-protein binding affinity prediction by representing protein multimodal information via a coevolutionary strategy
    Guo, Binjie
    Zheng, Hanyu
    Jiang, Haohan
    Li, Xiaodan
    Guan, Naiyu
    Zuo, Yanming
    Zhang, Yicheng
    Yang, Hengfu
    Wang, Xuhua
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (02)
  • [17] Locating ligand binding sites in G-protein coupled receptors using combined information from docking and sequence conservation
    Vidad, Ashley Ryan
    Macaspac, Stephen
    Ng, Ho Leung
    PEERJ, 2021, 9
  • [18] Prediction of metal ion ligand binding residues by adding disorder value and propensity factors based on deep learning algorithm
    Hao, Sixi
    Hu, Xiuzhen
    Feng, Zhenxing
    Sun, Kai
    You, Xiaoxiao
    Wang, Ziyang
    Yang, Caiyun
    FRONTIERS IN GENETICS, 2022, 13
  • [19] Pseq2Sites: Enhancing protein sequence-based ligand binding-site prediction accuracy via the deep convolutional network and attention mechanism
    Seo, Sangmin
    Choi, Jonghwan
    Choi, Seungyeon
    Lee, Jieun
    Park, Chihyun
    Park, Sanghyun
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [20] Metal Ion Binding at the Catalytic Site Induces Widely Distributed Changes in a Sequence Specific Protein-DNA Complex
    Sinha, Kaustubh
    Sangani, Sahil S.
    Kehr, Andrew D.
    Rule, Gordon S.
    Jen-Jacobson, Linda
    BIOCHEMISTRY, 2016, 55 (44) : 6115 - 6132