A Data Driven Model for Predicting RNA-Protein Interactions based on Gradient Boosting Machine

被引:10
|
作者
Jain, Dharm Skandh [1 ,3 ]
Gupte, Sanket Rajan [1 ]
Aduri, Raviprasad [2 ]
机构
[1] Birla Inst Technol & Sci Pilani, Dept Comp Sci & Informat Syst, KK Birla Goa Campus, South Goa, Goa, India
[2] Birla Inst Technol & Sci Pilani, Dept Biol Sci, KK Birla Goa Campus, South Goa 403726, Goa, India
[3] Warsaw Univ Technol, Fac Elect & Informat Technol, Warsaw, Poland
来源
SCIENTIFIC REPORTS | 2018年 / 8卷
关键词
LONG NONCODING RNA; BINDING PROTEINS; CLIP;
D O I
10.1038/s41598-018-27814-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA protein interactions (RPI) play a pivotal role in the regulation of various biological processes. Experimental validation of RPI has been time-consuming, paving the way for computational prediction methods. The major limiting factor of these methods has been the accuracy and confidence of the predictions, and our in-house experiments show that they fail to accurately predict RPI involving short RNA sequences such as TERRA RNA. Here, we present a data-driven model for RPI prediction using a gradient boosting classifier. Amino acids and nucleotides are classified based on the high-resolution structural data of RNA protein complexes. The minimum structural unit consisting of five residues is used as the descriptor. Comparative analysis of existing methods shows the consistently higher performance of our method irrespective of the length of RNA present in the RPI. The method has been successfully applied to map RPI networks involving both long noncoding RNA as well as TERRA RNA. The method is also shown to successfully predict RNA and protein hubs present in RPI networks of four different organisms. The robustness of this method will provide a way for predicting RPI networks of yet unknown interactions for both long noncoding RNA and microRNA.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] RNA-protein interactions in an unstructured context
    Zagrovic, Bojan
    Bartonek, Lukas
    Polyansky, Anton A.
    FEBS LETTERS, 2018, 592 (17): : 2901 - 2916
  • [32] Methods to study the RNA-protein interactions
    Popova, V. V.
    Kurshakova, M. M.
    Kopytova, D. V.
    MOLECULAR BIOLOGY, 2015, 49 (03) : 418 - 426
  • [33] Testing ancient RNA-protein interactions
    Landweber, LF
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (20) : 11067 - 11068
  • [34] Specificity and nonspecificity in RNA-protein interactions
    Jankowsky, Eckhard
    Harris, Michael E.
    NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2015, 16 (09) : 533 - 544
  • [35] Thermodynamics and mutations in RNA-protein interactions
    Hall, KB
    Kranz, JK
    ENERGETICS OF BIOLOGICAL MACROMOLECULES, 1995, 259 : 261 - 281
  • [36] Classification and function of RNA-protein interactions
    Liu, Shurong
    Li, Bin
    Liang, Qiaoxia
    Liu, Anrui
    Qu, Lianghu
    Yang, Jianhua
    WILEY INTERDISCIPLINARY REVIEWS-RNA, 2020, 11 (06)
  • [37] Human telomerase RNA-protein interactions
    Bachand, F
    Triki, F
    Autexier, C
    NUCLEIC ACIDS RESEARCH, 2001, 29 (16) : 3385 - 3393
  • [38] The Role of RNA Sequence and Structure in RNA-Protein Interactions
    Gupta, Aditi
    Gribskov, Michael
    JOURNAL OF MOLECULAR BIOLOGY, 2011, 409 (04) : 574 - 587
  • [39] RNA-Protein interactions in regulation of picornavirus RNA translation
    Belsham, GJ
    Sonenberg, N
    MICROBIOLOGICAL REVIEWS, 1996, 60 (03) : 499 - +
  • [40] A light gradient boosting machine learning-based approach for predicting clinical data breast cancer
    Wang Qiuqian
    Gao Min
    Zhang KeZhu
    Chen Chen
    MULTISCALE AND MULTIDISCIPLINARY MODELING EXPERIMENTS AND DESIGN, 2025, 8 (01)