A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine

被引:10
|
作者
Jain, Dharm Skandh [1 ,3 ]
Gupte, Sanket Rajan [1 ]
Aduri, Raviprasad [2 ]
机构
[1] Birla Inst Technol & Sci Pilani, Dept Comp Sci & Informat Syst, KK Birla Goa Campus, South Goa, Goa, India
[2] Birla Inst Technol & Sci Pilani, Dept Biol Sci, KK Birla Goa Campus, South Goa 403726, Goa, India
[3] Warsaw Univ Technol, Fac Elect & Informat Technol, Warsaw, Poland
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
关键词
LONG NONCODING RNA; BINDING PROTEINS; CLIP;
D O I
10.1038/s41598-018-27814-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA protein interactions (RPI) play a pivotal role in the regulation of various biological processes. Experimental validation of RPI has been time-consuming, paving the way for computational prediction methods. The major limiting factor of these methods has been the accuracy and confidence of the predictions, and our in-house experiments show that they fail to accurately predict RPI involving short RNA sequences such as TERRA RNA. Here, we present a data-driven model for RPI prediction using a gradient boosting classifier. Amino acids and nucleotides are classified based on the high-resolution structural data of RNA protein complexes. The minimum structural unit consisting of five residues is used as the descriptor. Comparative analysis of existing methods shows the consistently higher performance of our method irrespective of the length of RNA present in the RPI. The method has been successfully applied to map RPI networks involving both long noncoding RNA as well as TERRA RNA. The method is also shown to successfully predict RNA and protein hubs present in RPI networks of four different organisms. The robustness of this method will provide a way for predicting RPI networks of yet unknown interactions for both long noncoding RNA and microRNA.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine
    Dharm Skandh Jain
    Sanket Rajan Gupte
    Raviprasad Aduri
    Scientific Reports, 8
  • [2] Recent Advances in Machine Learning Based Prediction of RNA-Protein Interactions
    Sagar, Amit
    Xue, Bin
    PROTEIN AND PEPTIDE LETTERS, 2019, 26 (08): : 601 - 619
  • [3] RNA-PROTEIN INTERACTIONS
    FRANKEL, AD
    MATTAJ, IW
    RIO, DC
    CELL, 1991, 67 (06) : 1041 - 1046
  • [4] RNA-protein interactions
    Hall, KB
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (03) : 283 - 288
  • [5] RNA-PROTEIN INTERACTIONS
    WICKENS, MP
    DAHLBERG, JE
    CELL, 1987, 51 (03) : 339 - 342
  • [6] Predicting RNA-Protein Interactions Using Only Sequence Information
    Muppirala, Usha K.
    Honavar, Vasant G.
    Dobbs, Drena
    BMC BIOINFORMATICS, 2011, 12
  • [7] The Plethora of RNA-Protein Interactions Model a Basis for RNA Therapies
    Dansereau, Stephen J.
    Cui, Hua
    Dartawan, Ricky P.
    Sheng, Jia
    GENES, 2025, 16 (01)
  • [8] Predicting RNA-Protein Interactions Using Only Sequence Information
    Usha K Muppirala
    Vasant G Honavar
    Drena Dobbs
    BMC Bioinformatics, 12
  • [9] A boosting ensemble learning based hybrid light gradient boosting machine and extreme gradient boosting model for predicting house prices
    Sibindi, Racheal
    Mwangi, Ronald Waweru
    Waititu, Anthony Gichuhi
    ENGINEERING REPORTS, 2023, 5 (04)
  • [10] RNA-protein interactions.
    Williamson, JR
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1997, 214 : 163 - PHYS