Markov State Models: To Optimize or Not to Optimize

被引:5
|
作者
Arbon, Robert E. [1 ,2 ]
Zhu, Yanchen [1 ]
Mey, Antonia S. J. S. [1 ]
机构
[1] EaStCHEM Sch Chem, Edinburgh EH9 3FJ, Scotland
[2] Redesign Sci, New York, NY 10014 USA
关键词
DYNAMICS; VALIDATION;
D O I
10.1021/acs.jctc.3c01134
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Markov state models (MSM) are a popular statistical method for analyzing the conformational dynamics of proteins including protein folding. With all statistical and machine learning (ML) models, choices must be made about the modeling pipeline that cannot be directly learned from the data. These choices, or hyperparameters, are often evaluated by expert judgment or, in the case of MSMs, by maximizing variational scores such as the VAMP-2 score. Modern ML and statistical pipelines often use automatic hyperparameter selection techniques ranging from the simple, choosing the best score from a random selection of hyperparameters, to the complex, optimization via, e.g., Bayesian optimization. In this work, we ask whether it is possible to automatically select MSM models this way by estimating and analyzing over 16,000,000 observations from over 280,000 estimated MSMs. We find that differences in hyperparameters can change the physical interpretation of the optimization objective, making automatic selection difficult. In addition, we find that enforcing conditions of equilibrium in the VAMP scores can result in inconsistent model selection. However, other parameters that specify the VAMP-2 score (lag time and number of relaxation processes scored) have only a negligible influence on model selection. We suggest that model observables and variational scores should be only a guide to model selection and that a full investigation of the MSM properties should be undertaken when selecting hyperparameters.
引用
收藏
页码:977 / 988
页数:12
相关论文
共 50 条
  • [1] An Application of Particle Swarm Algorithms to Optimize Hidden Markov Models for Driver Fatigue Identification
    Zhang, Mingheng
    Zhai, Xiaojuan
    Zhao, Guang
    Chong, Tonghong
    Wang, Zheng
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 25 - 30
  • [2] Clustering Methods: To Optimize or to Not Optimize?
    Brusco, Michael
    Steinley, Douglas
    Watts, Ashley L.
    PSYCHOLOGICAL METHODS, 2024,
  • [3] Dynamic Markov-based Queuing Models and Strategies with Heterogeneous Processing Capabilities to Optimize Machine Utilization
    Zhao, Fuqing
    He, Xuan
    Ma, Weimin
    Zhang, Jianqiu
    Zhang, Chuck
    PROCEEDINGS OF THE 2018 IEEE 22ND INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN ((CSCWD)), 2018, : 564 - 569
  • [4] Beam orientations in IMRT: to optimize or not to optimize?
    Pugachev, A
    Li, JG
    Boyer, AL
    Xing, L
    USE OF COMPUTERS IN RADIATION THERAPY, 2000, : 37 - 39
  • [5] A Markov Decision Approach to Optimize Testing Profile in Software Testing
    Zhang, Deping
    Nie, Changhai
    Xu, Baowen
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 1205 - 1210
  • [6] A Novel Approach to Optimize Combinatory Drugs Using Markov Chain
    Wang, Bo
    Wang, Wenxue
    Wang, Yuechao
    Liu, Lianqing
    2016 IEEE 11TH ANNUAL INTERNATIONAL CONFERENCE ON NANO/MICRO ENGINEERED AND MOLECULAR SYSTEMS (NEMS), 2016,
  • [7] Using environmental models to optimize sensor placement
    Stolkin, Rustam
    Vickers, Lucas
    Nickerson, Jeffrey V.
    IEEE SENSORS JOURNAL, 2007, 7 (3-4) : 319 - 320
  • [8] The use of predictive models to optimize risk of decisions
    Baranyi, Jozsef
    da Silva, Nathalia Buss
    INTERNATIONAL JOURNAL OF FOOD MICROBIOLOGY, 2017, 240 : 19 - 23
  • [9] Editorial: Why JAACAP Published an "Inconclusive" Trial: Optimize, Optimize, Optimize Psychostimulant Treatment
    Cortese, Samuele
    Novins, Douglas K.
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2021, 60 (02): : 213 - 215
  • [10] An Iterative Algorithm to Optimize the Average Performance of Markov Chains with Finite States
    Fujita, Ryusei
    Iwata, Ken-ichi
    Yamamoto, Hirosuke
    2019 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2019, : 1902 - 1906