Learning the Protein Language Model of SARS-CoV-2 Spike Proteins

被引:0
|
作者
Llanes, Paul Vincent [1 ]
Solano, Geoffrey [1 ]
Pontiveros, Marc Jermaine [2 ]
机构
[1] Univ Philippines Manila, Dept Phys Sci & Math, Manila, Philippines
[2] Univ Philippines Diliman, Dept Comp Sci, Quezon City, Philippines
关键词
SARS-CoV-2; spike proteins; sequence mutations; COVID-19; language modelling; recurrent neural network; Leiden clustering algorithm; viral escape;
D O I
10.1109/ICAIIC57133.2023.10067040
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
SARS-CoV-2 virus has long been evolving posing an increased risk in terms of infectivity and transmissibility which causes greater impact in communities worldwide. With the surge of collected SARS-CoV-2 sequences, studies found out that most of the emerging variants are linked to increased mutations in the spike (S) protein as observed in Alpha, Beta, Gamma, and Delta variants. Multiple approaches on genomic surveillance have been performed to monitor the mutational status and spread of the virus however most are heavily dependent on labels attributed to these sequences. Hence, this study features a system that has the capability to learn the protein language model of SARS-CoV-2 spike proteins, based on a bidirectional long-short term memory (BiLSTM) recurrent neural network, using sequence data alone. Upon obtaining the sequence embedding from the model, observed clusters are generated using the Leiden clustering algorithm and is visualized to monitor similarities between variants in terms of grammatical probability and semantic change. Additionally, the system measures the validity of a user-generated next-generation sequence capturing potential sequence mutations indicative of viral escape, particularly mutations by substitutions. Further studies on methods uncovering semantic rules that govern spike proteins are recommended to learn more about other viral characteristics conclusive of the future of the COVID-19 pandemic.
引用
收藏
页码:429 / 434
页数:6
相关论文
共 50 条
  • [1] Amyloidogenesis of SARS-CoV-2 Spike Protein
    Nystrom, Sofie
    Hammarstrom, Per
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (20) : 8945 - 8950
  • [2] SARS-CoV-2 and the spike protein in endotheliopathy
    Perico, Luca
    Benigni, Ariela
    Remuzzi, Giuseppe
    TRENDS IN MICROBIOLOGY, 2024, 32 (01) : 53 - 67
  • [3] Structure of SARS-CoV-2 spike protein
    Zhang, Jun
    Xiao, Tianshu
    Cai, Yongfei
    Chen, Bing
    CURRENT OPINION IN VIROLOGY, 2021, 50 : 173 - 182
  • [4] Expression and characterization of SARS-CoV-2 spike proteins
    Schaub, Jeffrey M.
    Chou, Chia-Wei
    Kuo, Hung-Che
    Javanmardi, Kamyab
    Hsieh, Ching-Lin
    Goldsmith, Jory
    DiVenere, Andrea M.
    Le, Kevin C.
    Wrapp, Daniel
    Byrne, Patrick O.
    Hjorth, Christy K.
    Johnson, Nicole, V
    Ludes-Meyers, John
    Nguyen, Annalee W.
    Wang, Nianshuang
    Lavinder, Jason J.
    Ippolito, Gregory C.
    Maynard, Jennifer A.
    McLellan, Jason S.
    Finkelstein, Ilya J.
    NATURE PROTOCOLS, 2021, 16 (11) : 5339 - 5356
  • [5] Expression and characterization of SARS-CoV-2 spike proteins
    Jeffrey M. Schaub
    Chia-Wei Chou
    Hung-Che Kuo
    Kamyab Javanmardi
    Ching-Lin Hsieh
    Jory Goldsmith
    Andrea M. DiVenere
    Kevin C. Le
    Daniel Wrapp
    Patrick O. Byrne
    Christy K. Hjorth
    Nicole V. Johnson
    John Ludes-Meyers
    Annalee W. Nguyen
    Nianshuang Wang
    Jason J. Lavinder
    Gregory C. Ippolito
    Jennifer A. Maynard
    Jason S. McLellan
    Ilya J. Finkelstein
    Nature Protocols, 2021, 16 : 5339 - 5356
  • [6] Cellular signalling by SARS-CoV-2 spike protein
    Gracie, Nicholas P.
    Lai, Lachlan Y. S.
    Newsome, Timothy P.
    MICROBIOLOGY AUSTRALIA, 2024, 45 (01) : 13 - 17
  • [7] The Elusive Coreceptors for the SARS-CoV-2 Spike Protein
    Berkowitz, Reed L. L.
    Ostrov, David A. A.
    VIRUSES-BASEL, 2023, 15 (01):
  • [8] SARS-CoV-2 Spike Protein Interaction Space
    Lungu, Claudiu N.
    Putz, Mihai V.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (15)
  • [9] Proteolytic activation of SARS-CoV-2 spike protein
    Takeda, Makoto
    MICROBIOLOGY AND IMMUNOLOGY, 2022, 66 (01) : 15 - 23
  • [10] Is the Stalk of the SARS-CoV-2 Spike Protein Druggable?
    Pipito, Ludovico
    Reynolds, Christopher A.
    Deganutti, Giuseppe
    VIRUSES-BASEL, 2022, 14 (12):