The traditional tradeoff in simulations between accuracy and speed is predicated on the assumption that force fields are typically well-parameterized. We re-examine this trade-off and show that a properly formulated coarse-grain model trained using machine learning methods can rival all-atom models for de novo protein folding and dynamics simulations. We use the contrastive divergence technique to train the force field directly from short simulation trajectories of 450 proteins beginning near their native structures. This method is applied to our recently developed Upside model, where the free energy for side chains are rapidly calculated at every time-step, allowing for a smooth energy landscape without steric rattling of the side chains (1). After this contrastive divergence training, the model is able to fold proteins up to ~100 residues de novo on a single core in CPU core-days (2).
The Upside model's success argues that simpler models that can be globally parameterized can rival more detailed but slower models whose parameterization is more challenging - more complexity does not necessarily equate to higher accuracy! Upside’s ready generation of Boltzmann ensembles allows for a wide range of studies of protein folding, dynamics and binding. Additionally, in studies that incorporate experimental or bioinformatics data, including contact predictions, Upside provides an inexpensive Bayesian prior distribution that may be updated using experimental information.
1. Rapid calculation of side chain packing and free energy with applications to protein molecular dynamics. https://arxiv.org/abs/1610.07277
2. Trajectory-Based Parameterization of a Coarse-Grained Forcefield for High-Throughput Protein Simulation. http://www.biorxiv.org/content/early/2017/07/27/169326
J.M. Jumper, K.F. Freed, T.R. Sosnick, "Trajectory-Based Parameterization of a Coarse-Grained Forcefield for High-Throughput Protein Simulation" bioRxiv:169326.
J.M. Jumper, K.F. Freed, T.R. Sosnick, "Maximum-likelihood, self-consistent side chain free energies with applications to protein molecular dynamics" arXiv:1610.07277 [q-bio.BM].