Identifying the binding preferences of RNA-binding proteins (RBPs) is important in understanding their contribution to post-transcriptional regulation. Here, we review the current state-of the art of RNA motif identification tools for RBPs. New in vivo and in vitro data sets provide sufficient statistical power to enable detection of relatively long and complex sequence and sequence-structure binding preferences, and recent computational methods are geared towards quantitative identification of these patterns. We classify methods by their motif model's representational power and describe the underlying considerations for RNA-protein interactions. All classical motif identification algorithms apply physically motivated architectures, consisting of a motif and an occupancy model, we call these explicit motif models. Recent methods, such as convolutional neural networks and support vector machines, abandon the classical architecture and implicitly model RNA binding without defining a motif model. Although they achieve high accuracy on held-out data they may be unsuitable to solve the ultimate goal of the field, using motifs trained on in vitro data to predict in vivo binding sites. For this task methods need to separate intrinsic binding preferences from cellular effects from protein and RNA concentrations, cooperativity, and competition. To tackle this problem, we advocate for the use of a 'three-layer' architecture, consisting of motif model, occupancy model, and extrinsic factor model, which enables separation and adjustment to cellular conditions.
机构:
Max Delbruck Ctr Mol Med Helmholtz Assoc, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
Humboldt Univ, Dept Comp Sci, D-12489 Berlin, GermanyMax Delbruck Ctr Mol Med Helmholtz Assoc, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
Munteanu, Aline
Mukherjee, Neelanjan
论文数: 0引用数: 0
h-index: 0
机构:
Max Delbruck Ctr Mol Med Helmholtz Assoc, Berlin Inst Med Syst Biol, D-13125 Berlin, GermanyMax Delbruck Ctr Mol Med Helmholtz Assoc, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
Mukherjee, Neelanjan
Ohler, Uwe
论文数: 0引用数: 0
h-index: 0
机构:
Max Delbruck Ctr Mol Med Helmholtz Assoc, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
Humboldt Univ, Dept Comp Sci, D-12489 Berlin, GermanyMax Delbruck Ctr Mol Med Helmholtz Assoc, Berlin Inst Med Syst Biol, D-13125 Berlin, Germany
机构:
Univ New South Wales, Sch Chem, Sydney, NSW 2052, Australia
Univ New South Wales, Australian Ctr Astrobiol, Sydney, NSW 2052, AustraliaUniv New South Wales, Sch Chem, Sydney, NSW 2052, Australia
Marshall, Luke K.
Fahrenbach, Albert C.
论文数: 0引用数: 0
h-index: 0
机构:
Univ New South Wales, Australian Ctr Astrobiol, Sch Chem, Sydney, NSW 2052, Australia
Univ New South Wales, UNSW RNA Inst, Sydney, NSW 2052, AustraliaUniv New South Wales, Sch Chem, Sydney, NSW 2052, Australia
Fahrenbach, Albert C.
Thordarson, Pall
论文数: 0引用数: 0
h-index: 0
机构:
Univ New South Wales, Australian Ctr Astrobiol, Sch Chem, Sydney, NSW 2052, Australia
Univ New South Wales, UNSW RNA Inst, Sydney, NSW 2052, AustraliaUniv New South Wales, Sch Chem, Sydney, NSW 2052, Australia
机构:
Institute of Molecular Genetics, Russian Academy of Sciences, MoscowInstitute of Molecular Genetics, Russian Academy of Sciences, Moscow
Kotelnikov R.N.
Shpiz S.G.
论文数: 0引用数: 0
h-index: 0
机构:
Institute of Molecular Genetics, Russian Academy of Sciences, Moscow
Moscow State University, MoscowInstitute of Molecular Genetics, Russian Academy of Sciences, Moscow
Shpiz S.G.
Kalmykova A.I.
论文数: 0引用数: 0
h-index: 0
机构:
Institute of Molecular Genetics, Russian Academy of Sciences, MoscowInstitute of Molecular Genetics, Russian Academy of Sciences, Moscow
Kalmykova A.I.
Gvozdev V.A.
论文数: 0引用数: 0
h-index: 0
机构:
Institute of Molecular Genetics, Russian Academy of Sciences, Moscow
Moscow State University, MoscowInstitute of Molecular Genetics, Russian Academy of Sciences, Moscow
机构:
Guangdong Med Univ, Affiliated Hosp, Dept Med Res, Zhanjiang 524001, Peoples R China
Sorbonne Univ, Inst Biol Paris Seine IBPS, CNRS, UMR7622,Lab Dev Biol, F-75005 Paris, FranceGuangdong Med Univ, Affiliated Hosp, Dept Med Res, Zhanjiang 524001, Peoples R China
机构:
Univ Wisconsin, Dept Anaesthesiol, Madison, WI 53792 USAUniv Wisconsin, Dept Anaesthesiol, Madison, WI 53792 USA
Smith, Patrick R.
Campbell, Zachary T.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Anaesthesiol, Madison, WI 53792 USA
Univ Wisconsin, Dept Biomol Chem, Madison, WI 53792 USAUniv Wisconsin, Dept Anaesthesiol, Madison, WI 53792 USA