Score-based generative modeling for de novo protein design

被引:28
|
作者
Lee, Jin Sub [1 ,2 ]
Kim, Jisun [2 ]
Kim, Philip M. [1 ,2 ,3 ]
机构
[1] Univ Toronto, Dept Mol Genet, Toronto, ON, Canada
[2] Univ Toronto, Donnelly Ctr Cellular & Biomol Res, Toronto, ON, Canada
[3] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
来源
NATURE COMPUTATIONAL SCIENCE | 2023年 / 3卷 / 05期
基金
加拿大自然科学与工程研究理事会;
关键词
We acknowledge the CIHR Project Grant (grant no. PJT-153279) and NSERC Discovery Grant (grant no. RGPIN-2017-064) for funding. The funders had no role in study design; data collection and analysis; decision to publish or preparation of the manuscript. We also thank the Digital Research Alliance of Canada for computing resources.We acknowledge the CIHR Project Grant (grant no. PJT-153279) and NSERC Discovery Grant (grant no. RGPIN-2017-064) for funding. The funders had no role in study design; decision to publish or preparation of the manuscript. We also thank the Digital Research Alliance of Canada for computing resources;
D O I
10.1038/s43588-023-00440-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study proposes a diffusion model, ProteinSGM, for the design of novel protein folds. The designed proteins are diverse, experimentally stable and structurally consistent with predicted models The generation of de novo protein structures with predefined functions and properties remains a challenging problem in protein design. Diffusion models, also known as score-based generative models (SGMs), have recently exhibited astounding empirical performance in image synthesis. Here we use image-based representations of protein structure to develop ProteinSGM, a score-based generative model that produces realistic de novo proteins. Through unconditional generation, we show that ProteinSGM can generate native-like protein structures, surpassing the performance of previously reported generative models. We experimentally validate some de novo designs and observe secondary structure compositions consistent with generated backbones. Finally, we apply conditional generation to de novo protein design by formulating it as an image inpainting problem, allowing precise and modular design of protein structure.
引用
收藏
页码:382 / 392
页数:11
相关论文
共 50 条
  • [1] Score-based generative modeling for de novo protein design
    Jin Sub Lee
    Jisun Kim
    Philip M. Kim
    Nature Computational Science, 2023, 3 : 382 - 392
  • [2] Score-based Generative Modeling in Latent Space
    Vahdat, Arash
    Kreis, Karsten
    Kautz, Jan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] Convergence for score-based generative modeling with polynomial complexity
    Lee, Holden
    Lu, Jianfeng
    Tan, Yixin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] Conditional generative modeling for de novo protein design with hierarchical functions
    Kucera, Tim
    Togninalli, Matteo
    Meng-Papaxanthos, Laetitia
    BIOINFORMATICS, 2022, 38 (13) : 3454 - 3461
  • [5] Diffusion Schrodinger Bridge with Applications to Score-Based Generative Modeling
    De Bortoli, Valentin
    Thornton, James
    Heng, Jeremy
    Doucet, Arnaud
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [6] Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance
    Kwon, Dohyun
    Fan, Ying
    Lee, Kangwook
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Convergence of score-based generative modeling for general data distributions
    Lee, Holden
    Lu, Jianfeng
    Tan, Yixin
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 946 - 985
  • [8] Shape-Based Generative Modeling for de Novo Drug Design
    Skalic, Miha
    Jimenez, Jose
    Sabbadin, Davide
    De Fabritiis, Gianni
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (03) : 1205 - 1214
  • [9] Riemannian Score-Based Generative Modelling
    De Bortoli, Valentin
    Mathieu, Emile
    Hutchinson, Michael
    Thornton, James
    Teh, Yee Whye
    Doucet, Arnaud
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [10] Generative artificial intelligence for de novo protein design
    Winnifrith, Adam
    Outeiral, Carlos
    Hie, Brian L.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2024, 86