Machine-learning-guided directed evolution for protein engineering

被引:621
|
作者
Yang, Kevin K. [1 ]
Wu, Zachary [1 ]
Arnold, Frances H. [1 ]
机构
[1] CALTECH, Div Chem & Chem Engn, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
STABILITY CHANGES; SEQUENCE; MUTATIONS; PREDICTION; KERNEL; MODEL;
D O I
10.1038/s41592-019-0496-6
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein engineering through machine-learning-guided directed evolution enables the optimization of protein functions. Machine-learning approaches predict how sequence maps to function in a data-driven manner without requiring a detailed model of the underlying physics or biological pathways. Such methods accelerate directed evolution by learning from the properties of characterized variants and using that information to select sequences that are likely to exhibit improved properties. Here we introduce the steps required to build machine-learning sequence-function models and to use those models to guide engineering, making recommendations at each stage. This review covers basic concepts relevant to the use of machine learning for protein engineering, as well as the current literature and applications of this engineering paradigm. We illustrate the process with two case studies. Finally, we look to future opportunities for machine learning to enable the discovery of unknown protein functions and uncover the relationship between protein sequence and function.
引用
收藏
页码:687 / 694
页数:8
相关论文
共 50 条
  • [21] Challenges and opportunities in machine learning-guided plant protein engineering
    Shukla, Diwakar
    BIOPHYSICAL JOURNAL, 2024, 123 (03) : 455A - 455A
  • [22] Machine learning-assisted directed protein evolution with combinatorial libraries
    Wu, Zachary
    Kan, S. B. Jennifer
    Lewis, Russell D.
    Wittmann, Bruce J.
    Arnold, Frances H.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (18) : 8852 - 8858
  • [23] STAR: A Web Server for Assisting Directed Protein Evolution with Machine Learning
    Yang, Likun
    Liang, Xiaoli
    Zhang, Na
    Lu, Lu
    ACS OMEGA, 2023, 8 (47): : 44751 - 44756
  • [24] A fast machine-learning-guided primer design pipeline for selective whole genome amplification
    Dwivedi-Yu, Jane
    Oppler, Zachary
    Mitchell, Matthew
    Song, Yun
    Brisson, Dustin
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (04)
  • [25] Directed evolution approaches for protein engineering
    Farinas, ET
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2006, 9 (04) : 235 - 236
  • [26] Advances in machine learning for directed evolution
    Wittmann, Bruce J.
    Johnston, Kadina E.
    Wu, Zachary
    Arnold, Frances H.
    CURRENT OPINION IN STRUCTURAL BIOLOGY, 2021, 69 : 11 - 18
  • [27] Structural and Electronic Properties of Two-Dimensional Materials: A Machine-Learning-Guided Prediction
    Ramanathan, Eshwar S.
    Chowdhury, Chandra
    CHEMPHYSCHEM, 2023, 24 (21)
  • [28] Machine-Learning-Guided Typestate Analysis for Static Use-After-Free Detection
    Yan, Hua
    Sui, Yulei
    Chen, Shiping
    Xue, Jingling
    33RD ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2017), 2017, : 42 - 54
  • [29] Machine learning and genetic algorithm-guided directed evolution for the development of antimicrobial peptides
    Zhang, Heqian
    Wang, Yihan
    Zhu, Yanran
    Huang, Pengtao
    Gao, Qiandi
    Li, Xiaojie
    Chen, Zhaoying
    Liu, Yu
    Jiang, Jiakun
    Gao, Yuan
    Huang, Jiaquan
    Qin, Zhiwei
    JOURNAL OF ADVANCED RESEARCH, 2025, 68 : 415 - 428
  • [30] Machine-Learning-Guided Identification of Coordination Polymer Ligands for Crystallizing Separation of Cs/Sr
    Zhang, Zhiyuan
    Cheng, Min
    Xiao, Xinyi
    Bi, Kexin
    Song, Ting
    Hu, Kong-qiu
    Dai, Yiyang
    Zhou, Li
    Liu, Chong
    Ji, Xu
    Shi, Wei-qun
    ACS APPLIED MATERIALS & INTERFACES, 2022, 14 (29) : 33076 - 33084