A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine

被引:10
|
作者
Jain, Dharm Skandh [1 ,3 ]
Gupte, Sanket Rajan [1 ]
Aduri, Raviprasad [2 ]
机构
[1] Birla Inst Technol & Sci Pilani, Dept Comp Sci & Informat Syst, KK Birla Goa Campus, South Goa, Goa, India
[2] Birla Inst Technol & Sci Pilani, Dept Biol Sci, KK Birla Goa Campus, South Goa 403726, Goa, India
[3] Warsaw Univ Technol, Fac Elect & Informat Technol, Warsaw, Poland
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
关键词
LONG NONCODING RNA; BINDING PROTEINS; CLIP;
D O I
10.1038/s41598-018-27814-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA protein interactions (RPI) play a pivotal role in the regulation of various biological processes. Experimental validation of RPI has been time-consuming, paving the way for computational prediction methods. The major limiting factor of these methods has been the accuracy and confidence of the predictions, and our in-house experiments show that they fail to accurately predict RPI involving short RNA sequences such as TERRA RNA. Here, we present a data-driven model for RPI prediction using a gradient boosting classifier. Amino acids and nucleotides are classified based on the high-resolution structural data of RNA protein complexes. The minimum structural unit consisting of five residues is used as the descriptor. Comparative analysis of existing methods shows the consistently higher performance of our method irrespective of the length of RNA present in the RPI. The method has been successfully applied to map RPI networks involving both long noncoding RNA as well as TERRA RNA. The method is also shown to successfully predict RNA and protein hubs present in RPI networks of four different organisms. The robustness of this method will provide a way for predicting RPI networks of yet unknown interactions for both long noncoding RNA and microRNA.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] RNA-PROTEIN INTERACTIONS IN MODEL SYSTEMS FOR HIV TARTAT RECOGNITION
    PUGLISI, JD
    TAN, RY
    CALNAN, BJ
    FRANKEL, AD
    WILLIAMSON, JR
    BIOPHYSICAL JOURNAL, 1993, 64 (02) : A1 - A1
  • [22] Prediction of RNA-protein interactions using a nucleotide language model
    Yamada, Keisuke
    Hamada, Michiaki
    Arighi, Cecilia
    BIOINFORMATICS ADVANCES, 2022, 2 (01):
  • [23] Methods to study the RNA-protein interactions
    V. V. Popova
    M. M. Kurshakova
    D. V. Kopytova
    Molecular Biology, 2015, 49 : 418 - 426
  • [24] String-Based Models for Predicting RNA-Protein Interaction
    Adjeroh, Donald
    Allaga, Maen
    Tan, Jun
    Lin, Jie
    Jiang, Yue
    Abbasi, Ahmed
    Zhou, Xiaobo
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 661 - 666
  • [25] Methods to study RNA-protein interactions
    Ramanathan, Muthukumar
    Porter, Douglas F.
    Khavari, Paul A.
    NATURE METHODS, 2019, 16 (03) : 225 - 234
  • [26] Kinetics of RNA-protein interactions in cells
    Sharma, Deepak
    Licatalosi, Donny D.
    Jankowsky, Eckhard
    TRENDS IN BIOCHEMICAL SCIENCES, 2021, 46 (10) : 861 - 862
  • [27] RNA-PROTEIN INTERACTIONS RaPID hookup
    Miura, Grant
    NATURE CHEMICAL BIOLOGY, 2018, 14 (04) : 327 - 327
  • [28] Transient DNA/RNA-protein interactions
    Blanco, Francisco J.
    Montoya, Guillermo
    FEBS JOURNAL, 2011, 278 (10) : 1643 - 1650
  • [29] A Global View of RNA-Protein Interactions
    不详
    CELL, 2012, 149 (07) : 1415 - 1415
  • [30] RNA-protein interactions in spherical viruses
    Bink, HHJ
    Pleij, CWA
    ARCHIVES OF VIROLOGY, 2002, 147 (12) : 2261 - 2279