A predictive language model for SARS-CoV-2 evolution

被引:0
|
作者
Ma, Enhao [1 ]
Guo, Xuan [1 ,2 ]
Hu, Mingda [3 ]
Wang, Penghua [4 ]
Wang, Xin [3 ]
Wei, Congwen [3 ]
Cheng, Gong [1 ,2 ]
机构
[1] Tsinghua Univ, Sch Basic Med Sci, 30 Shuangqing Rd, Beijing 100084, Peoples R China
[2] Inst Infect Dis, Shenzhen Bay Lab, Guangqiao Rd, Shenzhen 518000, Guangdong, Peoples R China
[3] Beijing Inst Biotechnol, 20 Dongdajie, Beijing 100071, Peoples R China
[4] Univ Connecticut Hlth Ctr, Sch Med, Dept Immunol, Farmington, CT 06030 USA
基金
中国国家自然科学基金;
关键词
EVASION;
D O I
10.1038/s41392-024-02066-x
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Modeling and predicting mutations are critical for COVID-19 and similar pandemic preparedness. However, existing predictive models have yet to integrate the regularity and randomness of viral mutations with minimal data requirements. Here, we develop a non-demanding language model utilizing both regularity and randomness to predict candidate SARS-CoV-2 variants and mutations that might prevail. We constructed the "grammatical frameworks" of the available S1 sequences for dimension reduction and semantic representation to grasp the model's latent regularity. The mutational profile, defined as the frequency of mutations, was introduced into the model to incorporate randomness. With this model, we successfully identified and validated several variants with significantly enhanced viral infectivity and immune evasion by wet-lab experiments. By inputting the sequence data from three different time points, we detected circulating strains or vital mutations for XBB.1.16, EG.5, JN.1, and BA.2.86 strains before their emergence. In addition, our results also predicted the previously unknown variants that may cause future epidemics. With both the data validation and experiment evidence, our study represents a fast-responding, concise, and promising language model, potentially generalizable to other viral pathogens, to forecast viral evolution and detect crucial hot mutation spots, thus warning the emerging variants that might raise public health concern.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] SARS-CoV-2 evolution and vaccines: cause for concern?
    Williams, Thomas C.
    Burgers, Wendy A.
    LANCET RESPIRATORY MEDICINE, 2021, 9 (04): : 333 - 335
  • [42] Exposing structural variations in SARS-CoV-2 evolution
    Jiaan Yang
    Peng Zhang
    Wen Xiang Cheng
    Youyong Lu
    Wu Gang
    Gang Ren
    Scientific Reports, 11
  • [43] Evolution of viruses and the emergence of SARS-CoV-2 variants
    Freer, Giulia
    Lai, Michele
    Quaranta, Paola
    Spezia, Pietro Giorgio
    Pistello, Mauro
    NEW MICROBIOLOGICA, 2021, 44 (04): : 191 - 204
  • [44] Genomic characterization and phylogenetic evolution of the SARS-CoV-2
    Zhang, R-H
    Ai, X.
    Liu, Y. G.
    Li, Ch-H
    Zhang, H-L
    ACTA VIROLOGICA, 2020, 64 (04) : 496 - 500
  • [45] Dynamic Evolution of SARS-CoV-2 in a Patient on Chemotherapy
    Huang, Weihua
    Yin, Changhong
    Briley, Kimberly P.
    Dalzell, William A. B.
    Fallon, John T.
    VIRUSES-BASEL, 2023, 15 (08):
  • [46] SARS-CoV-2 genomic characterization and evolution in China
    Zhang, Peng
    Liu, Dongzi
    Ji, Lei
    Dong, Fenfen
    HELIYON, 2023, 9 (08)
  • [47] B cell persistence and evolution to SARS-CoV-2
    Alrubayyi A.
    Nature Reviews Immunology, 2021, 21 (1) : 3 - 3
  • [48] Genomic evolution of SARS-CoV-2 in Reunion Island
    Wilkinson, David A.
    Mercier, Alize
    Turpin, Magali
    Simbi, Marie -Alice
    Turpin, Jonathan
    Lebarbenchon, Camille
    Cesari, Maya
    Jaffar-Bandjee, Marie -Christine
    Josset, Laurence
    Yemadje-Menudier, Luce
    Lina, Bruno
    Mavingui, Patrick
    INFECTION GENETICS AND EVOLUTION, 2022, 106
  • [49] Structural evolution of Delta lineage of SARS-CoV-2
    Gomari, Mohammad Mahmoudi
    Tarighi, Parastoo
    Choupani, Edris
    Abkhiz, Shadi
    Mohamadzadeh, Masoud
    Rostami, Neda
    Sadroddiny, Esmaeil
    Baammi, Soukayna
    Uversky, Vladimir N.
    Dokholyan, Nikolay V.
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 226 : 1116 - 1140
  • [50] The Yin and Yang of SARS-CoV-2 Mutation and Evolution
    Badley, Andrew D.
    MAYO CLINIC PROCEEDINGS, 2021, 96 (04) : 863 - 865